[gdal-dev] Issue converting geo parquet data
Even Rouault
even.rouault at spatialys.com
Tue Jul 30 12:38:34 PDT 2024
Hi Michael,
This error comes very deep inside libparquet, actually in libthrift. The
more natural assumption would be that it would be due to a corrupted
Parquet file. If you disabled multithreading (GDAL_NUM_THREADS=1) and
enabled --debug on, perhaps this would happen on the same file ? I was
wondering also if that couldn't come from RAM exhaustion, but that
doesn't seem likely (did you monitor the RAM consumption?). It would be
interesting to see if that would also occur after fetching locally the
files under overturemaps-us-west-2/release/2024-07-22.0/theme=buildings
and running the conversion from the local files
Even
Le 30/07/2024 à 20:37, Michael Smith via gdal-dev a écrit :
>
> Hi all,
>
> I was converting the overture maps parquet data to geopackage and got
> this error using gdal master (via conda).
>
> GDAL_NUM_THREADS=ALL_CPUS CPL_TMPDIR=/data2 ogr2ogr -f gpkg
> /data/overture_buildings.gpkg
> /vsis3/overturemaps-us-west-2/release/2024-07-22.0/theme=buildings
> "theme=buildings" -progress -nlt PROMOTE_TO_MULTI -nln buildings
> -skipfailures
>
> ERROR 1: ReadNext() failed: Couldn't deserialize thrift:
> TProtocolException: Exceeded size limit
>
> Not sure what size limit it exceeded? I’ve had this happen several
> times but at very different points in the process.
>
> Ideas?
>
> Mike
>
> --
>
> Michael Smith
>
> US Army Corps of Engineers
>
> Remote Sensing/GIS Center
>
>
> _______________________________________________
> gdal-dev mailing list
> gdal-dev at lists.osgeo.org
> https://lists.osgeo.org/mailman/listinfo/gdal-dev
--
http://www.spatialys.com
My software is free, but my time generally not.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osgeo.org/pipermail/gdal-dev/attachments/20240730/19d47e57/attachment.htm>
More information about the gdal-dev
mailing list