[GRASS-dev] new v.in.geonames: problems with UTF-8 Unicode text
Markus Neteler
neteler at osgeo.org
Mon Jun 30 06:05:24 EDT 2008
Hi,
I am writing v.in.geonames to easily read in data from
http://download.geonames.org/export/dump/
The script is essentially using v.in.ascii to read in the CSV file encoded
in UTF-8 Unicode text. There are placenames in various languages including
Japanese.
v.in.ascii isn't able to read them properly and fails on such lines, example:
3165456 Torre del Greco Torre del Greco Torre d%27%27o Grieco,Torre
d''o Grieco,Torre d''o Grieco,Torre del Greco,トッレ・デル・グレーコ
40.7839209532791 14.3708038330078 P PPL IT
04 NA 063084 90607 72
Europe/Rome 2008-06-28
(I have slightly improved the v.in.ascii error message, not yet submitted):
ERROR: Unparsable latitude value in column <4>: 'o Grieco,Torre d''o
Grieco,Torre del Greco,トッレ・デル・グレーコ
How to fix this problem?
thanks
markus
More information about the grass-dev
mailing list