[gdal-dev] Reading GeoJSON with diacritics via GDAL C# bindings — identifiers vs data values
Rahkonen Jukka
jukka.rahkonen at maanmittauslaitos.fi
Tue Jan 13 23:50:00 PST 2026
Hi,
I don't have an answer to your question, but I checked the JSON specification https://www.rfc-editor.org/rfc/rfc8259 and it says:
"JSON text exchanged between systems that are not part of a closed ecosystem MUST be encoded using UTF-8"
-Jukka Rahkonen-
________________________________________
Lähettäjä: gdal-dev <gdal-dev-bounces at lists.osgeo.org> käyttäjän John Smith via gdal-dev <gdal-dev at lists.osgeo.org> puolesta
Lähetetty: Keskiviikko 14. tammikuuta 2026 7.59
Vastaanottaja: Even Rouault <even.rouault at spatialys.com>; David Klaus via gdal-dev <gdal-dev at lists.osgeo.org>
Aihe: [gdal-dev] Reading GeoJSON with diacritics via GDAL C# bindings — identifiers vs data values
I am using GDAL 3.9.3 and reading GeoJSON files (saved in ANSI/Windows-1252 encoding) via the GDAL C# bindings. I want to clarify how SQL queries with diacritics are handled:When a SQL query contains identifiers (table/column names) with diacritics, e.g., "Gelände", are the bytes looked up exactly as-is in the dataset, without any UTF-8 conversion?When the query contains data values with non-English characters, e.g., 'CCüri kom XXCX', does GDAL interpret these as UTF-8 (or according to the dataset encoding) internally?In other words, is it correct to assume that identifiers are matched byte-for-byte, while data values are parsed according to GDAL’s encoding rules, when passing SQL queries via C# bindings?Any clarification would be appreciated.BRJohn
More information about the gdal-dev
mailing list