[GRASS-dev] new v.in.geonames: problems with UTF-8 Unicode text
Hamish
hamish_b at yahoo.com
Mon Jun 30 11:39:31 EDT 2008
Markus:
> I am writing v.in.geonames to easily read in data from
> http://download.geonames.org/export/dump/
>
> The script is essentially using v.in.ascii to read in the
> CSV file encoded in UTF-8 Unicode text.
I think I may have found a buffer overflow in their DB... (NZ.zip export)
2181761 Taumatawhakatangihangakoauauotamateapokaiwhenuakitanatahu Taumatawhakatangihangakoauauotamateapokaiwhenuakitanatahu
Taumata-whakatangihanga-koauau-a-Tamatea-pokai-whenua-ki-tana-tahu,Taumata-whakatangihanga-kÅauau-a-Tamatea-pÅkai-whenua-ki-tana-tahu,Taumatauakatangiangakoauauotamateaturipukakapikimaungakhoronukupokanuehnuakitanatakhu,Taumatawhakatangihangakoauau,Taumatawhakatangihangakoauauotamateapokaiwhenuakitanatahu,Taumatawhakatangihangakoauauotamateaturipukakapikimaungahoronukupokaiwhenuakitanatahu,Taumatawhakatangihangakoauotamateaturipukakapikimaungahoronukupokaiwhenuakitanataha,Tetaumatafakatangikhangakoauaotamateaurekhaeaturipukapikhimaungakhoronukupokaifenuaakitanarakhu,Tetaumatawhakatangihangakoauaotamateaurehaeaturipukapihimaungahoronukupokaiwhenuaakitanarahu,ТаÑмаÑаÑакаÑангиангакоаÑаÑоÑамаÑеаÑÑÑипÑкакапикимаÑнгаÑ
оÑонÑкÑпоканÑÑнÑакÐ
¸ÑанаÑаÑ
Ñ,ТеÑаÑмаÑаÑакаÑангиÑ
ангакоаÑаоÑамаÑеаÑÑеÑ
аеаÑÑÑипÑкапиÑ
имаÑнгаÑ
оÑонÑкÑпокаиÑенÑаакиÑанаÑаÑ
Ñ,ã¿ã¦ãã¿ãã¡ã«ã¿ã³ã®ãã³ã¬ã³ã¢ã¦ã¢ã¦ãªã¿ããã¢ãã«ã¤ãã§ãã¢ãã¿ãã¿ã,å¡ä¹çå¡æ³å¡å¡å°¼åå¨å¯é¿ä¹é¿ä¹æ¬§å¡çæäºå¡å¯è´¹åªåå¥å¡å¨å¡è¡ -40.35 176.55 T HLL NZ NZ 00 0 191 Pacific/Auckland 2007-02-17
v.in.ascii chokes there for obvious reasons.
Move over small welsh towns!
http://en.wikipedia.org/wiki/Taumatawhakatangihangakoauauotamateapokaiwhenuakitanatahu
[checkout the translation(s)]
Hamish
More information about the grass-dev
mailing list