[GRASS-dev] new v.in.geonames: problems with UTF-8 Unicode text

Markus Neteler neteler at osgeo.org
Mon Jun 30 06:05:24 EDT 2008


Hi,

I am writing v.in.geonames to easily read in data from
http://download.geonames.org/export/dump/

The script is essentially using v.in.ascii to read in the CSV file encoded
in UTF-8 Unicode text. There are placenames in various languages including
Japanese.
v.in.ascii isn't able to read them properly and fails on such lines, example:

3165456 Torre del Greco Torre del Greco Torre d%27%27o Grieco,Torre
d''o Grieco,Torre d''o Grieco,Torre del Greco,トッレ・デル・グレーコ
40.7839209532791        14.3708038330078        P       PPL     IT
         04      NA      063084          90607           72
Europe/Rome     2008-06-28

(I have slightly improved the v.in.ascii error message, not yet submitted):

ERROR: Unparsable latitude value in column <4>: 'o Grieco,Torre d''o
       Grieco,Torre del Greco,トッレ・デル・グレーコ

How to fix this problem?

thanks
markus


More information about the grass-dev mailing list