[OSGeoLive] Parquet format in Debian

Brian M Hamlin maplabs at light42.com
Thu May 9 07:09:08 PDT 2024


there has been some discussion online recently regarding Debian and the 
Parquet data format

   Are Parquet data drivers available in ordinary Debian repos ?

   Does GDAL build for #osgeolive include a Parquet data access driver ?

     --Brian M Hamlin    /  MAPLABS  /  Berkeley, California


jgreen writes:

Pandas is in Debian but it cannot read parquet files itself, it uses 3rd 
party "engines" for that purpose and those are not available in Debian

|Python 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0] on linux Type 
"help", "copyright", "credits" or "license" for more information. >>> 
import pandas >>> pandas.read_parquet('sample3.parquet') Traceback (most 
recent call last): File "<stdin>", line 1, in <module> File 
"/usr/lib/python3/dist-packages/pandas/io/parquet.py", line 493, in 
read_parquet impl = get_engine(engine) File 
"/usr/lib/python3/dist-packages/pandas/io/parquet.py", line 53, in 
get_engine raise ImportError( ImportError: Unable to find a usable 
engine; tried using: 'pyarrow', 'fastparquet'. A suitable version of 
pyarrow or fastparquet is required for parquet support.|
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osgeo.org/pipermail/osgeolive/attachments/20240509/85f55159/attachment.htm>


More information about the osgeolive mailing list