[GRASS-dev] new v.in.geonames: problems with UTF-8 Unicode text

Glynn Clements glynn at gclements.plus.com
Mon Jun 30 11:26:23 EDT 2008


Markus Neteler wrote:

> I am writing v.in.geonames to easily read in data from
> http://download.geonames.org/export/dump/
> 
> The script is essentially using v.in.ascii to read in the CSV file encoded
> in UTF-8 Unicode text. There are placenames in various languages including
> Japanese.
> v.in.ascii isn't able to read them properly and fails on such lines, example:
> 
> 3165456 Torre del Greco Torre del Greco Torre d%27%27o Grieco,Torre
> d''o Grieco,Torre d''o Grieco,Torre del Greco,トッレ��デル・グレーコ
> 40.7839209532791        14.3708038330078        P       PPL     IT
>          04      NA      063084          90607           72
> Europe/Rome     2008-06-28
> 
> (I have slightly improved the v.in.ascii error message, not yet submitted):
> 
> ERROR: Unparsable latitude value in column <4>: 'o Grieco,Torre d''o
>        Grieco,Torre del Greco,トッレ・デル・グレーコ
> 
> How to fix this problem?

Can you please provide accurate and sufficient information about the
problem?

I.e. the exact data being fed to v.in.ascii (*before* it has been
mangled by the various components of the email chain), the v.in.ascii
command which is failing, etc.

-- 
Glynn Clements <glynn at gclements.plus.com>


More information about the grass-dev mailing list