[postgis-users] problems with shp2pgsql
Paul Ramsey
pramsey at refractions.net
Tue May 1 08:45:37 PDT 2007
Bruce,
It sounds like you've come across a genuine bug (assuming those are
valid UTF-8 code points). It may be some time before we get to fixing
this one, though a patch would be accepted. In the meanwhile, can you
file it on http://postgis.refractions.net/bugs/ so we don't lose it?
thanks,
Paul
Bruce Rusk wrote:
> I'm having a problem using shp2pgsql to convert a set of shapefiles.
> Some of the records created have values that are not properly enclosed
> in quotation marks: there is any open quote but not closing quote for
> one of the VALUES.
>
> The data I'm working with is UTF-8; there is a particular string in it
> that causes problems. When a field has the following sequence of
> characters, shp2pgsql fails to add the second quote:
>
> U+6682 U+65E0
>
> In the data, this is a common value for one of the text fields. In the
> .sql file, it converts to a byte sequence that somehow ends up 'eating'
> the close quote. From the opening quote, the offending bytes are, ending
> with the comma:
>
> 27 e6 9a 82 e6 97 a7 2c
>
> One hint: in the WIN-1252 codepage, 82 is U+201A SINGLE LOW-9 QUOTATION
> MARK.
>
> There is at least one other string that causes this type of problem,
> though I haven't yet figured out what the original .text is there. Any
> suggestions of fixes/causes?
>
> I'm running the latest POSTGIS release on Win XP SP2.
>
> Thanks,
>
> Bruce Rusk
> _______________________________________________
> postgis-users mailing list
> postgis-users at postgis.refractions.net
> http://postgis.refractions.net/mailman/listinfo/postgis-users
--
Paul Ramsey
Refractions Research
http://www.refractions.net
pramsey at refractions.net
Phone: 250-383-3022
Cell: 250-885-0632
More information about the postgis-users
mailing list