[gdal-dev] Re: WFS and -where with non-ASCII characters

Mateusz Łoskot mateusz at loskot.net
Tue Jan 3 10:39:51 EST 2012

On 3 January 2012 15:22, Even Rouault <even.rouault at mines-paris.org> wrote:
>> Even,
>> Your test program works for me as expected for text with
>> all Polish diacritics included
> As expected, really ? I can see on the photo that there are 17 characters in the
> string and that it prints 17. But I'd say it is *not* the expected result. If it
> was UTF-8, it would be more than 17 because strlen() will/should return the
> number of bytes.


You are right. I assumed wrong correctness: Polish diacritics would
fit in UTF-8
based on their single-byte encoding of extended ASCII codes.
Certainly, they are 2-byte long in UTF-8.

Best regards,
Mateusz Loskot, http://mateusz.loskot.net

More information about the gdal-dev mailing list