<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>there has been some discussion online recently regarding Debian
and the Parquet data format</p>
<p> Are Parquet data drivers available in ordinary Debian repos ?</p>
<p> Does GDAL build for #osgeolive include a Parquet data access
driver ?</p>
<p> --Brian M Hamlin / MAPLABS / Berkeley, California</p>
<p><br>
</p>
<p>jgreen writes:</p>
<p><span style="color: rgb(0, 0, 0); font-family: Verdana, Geneva,
sans-serif; font-size: 13.3333px; font-style: normal;
font-variant-ligatures: normal; font-variant-caps: normal;
font-weight: 400; letter-spacing: normal; orphans: 2;
text-align: start; text-indent: 0px; text-transform: none;
white-space: normal; widows: 2; word-spacing: 0px;
-webkit-text-stroke-width: 0px; background-color: rgb(246, 246,
239); text-decoration-thickness: initial; text-decoration-style:
initial; text-decoration-color: initial; display: inline
!important; float: none;">Pandas is in Debian but it cannot read
parquet files itself, it uses 3rd party "engines" for that
purpose and those are not available in Debian</span></p>
<pre style="overflow: auto; padding: 2px; white-space: pre-wrap; overflow-wrap: anywhere; color: rgb(0, 0, 0); font-size: 13.3333px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: start; text-indent: 0px; text-transform: none; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(246, 246, 239); text-decoration-thickness: initial; text-decoration-style: initial; text-decoration-color: initial;"><code> Python 3.10.12 (main, Nov 20 2023, 15:14:05) [GCC 11.4.0]
on linux
Type "help", "copyright", "credits" or "license" for more
information.
>>> import pandas
>>> pandas.read_parquet('sample3.parquet')
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/lib/python3/dist-packages/pandas/io/parquet.py",
line 493, in read_parquet
impl = get_engine(engine)
File "/usr/lib/python3/dist-packages/pandas/io/parquet.py",
line 53, in get_engine
raise ImportError(
ImportError: Unable to find a usable engine; tried using:
'pyarrow', 'fastparquet'.
A suitable version of pyarrow or fastparquet is required for
parquet support.</code></pre>
</body>
</html>