Page 1 of 1

Bug: Other charsets

PostPosted: Fri Dec 16, 2005 8:17 am
by Tomi
The system replaces all non-English characters with a ? (for example, čž, etc.).

If is to become a recognised community-based project, than this must be improved.



PostPosted: Wed Jun 07, 2006 3:49 am
by paulhaeder
I agree. I'd love to see Zürich, instead of Z?rich etc. Please try to solve this.


PostPosted: Thu Jun 08, 2006 8:01 am
by robocoder
Is that an actual example? It looks like Zürich is stored as Zurich in the database.

I did, however, notice that Beaupré, QC is coming up as Beaupr, QC.

There are a number of ways to approach this, including:
  • store the umlauts natively in the database
  • add an encode/decode step
  • use lookup tables (e.g., Zurich -> Zürich)
Also, the soundex (used in "Make a correction") would need to handle umlauts (if it doesn't already do so).


PostPosted: Mon Jun 12, 2006 6:50 am
by paulhaeder
That is an actual example.

It looks like Zürich is stored as Zurich in the database.

Yes and no. That's another problem I've come across. Cities are stored with different spellings: Zurich and Zürich. Rome is aka. Roma (Italian) and Rom (German). Which one is returned, depends on the IP...

What's the status of this bug?

PostPosted: Tue Jan 16, 2007 2:36 pm
by sneba
I would like to know the status of this bug before I consider implementing the hostip system into my project. Are special characters in citynames still not shown correctly? Can anyone tell me?


PostPosted: Tue Jul 03, 2007 10:56 am
by Coval
Thanks for a great free service! Great job!

There is a huge problem with polish city names. Most of them which includes in their names our "national characters" are corrupted.

ś, Ś, Ł, ł, ą, Ą, ź, Ź, ż, Ż, ę, Ę

In what charset is hostinfo database encoded?