-
I am unable to add cdc records to a snapshot.
environment
```
emr 7.2.0
AmazonCloudWatchAgent 1.300032.2,
Hive 3.1.3,
Spark 3.5.1,
Zeppelin 0.10.1
```
spark command
```
spark-shell …
-
DuckDB allows use of [encrypted Parquet files](https://duckdb.org/docs/data/parquet/encryption.html).
The key is set via a PRAGMA statement like `PRAGMA add_parquet_key('key128', '0123456789112345'…
-
**Describe the bug**
I noticed this while investigating https://github.com/apache/datafusion/issues/7845#issuecomment-2370455772.
The suggestion from @jayzhan211 and @alamb was that `datafusion.…
-
Request:
Support loading parquet as a source datatype in vega-loader
Envisioned solution:
```javascript
{
"data": [
{
"name": "my_data",
"format": {"type": "parquet"},
…
-
### Description
# Normal Data Page Types
Velox Type | Parquet LogicalType | Parquet ConvertedType | Parquet Storage Type | Supported?
-- | -- | -- | -- | --
BOOLEAN | | | BOOLEAN (1 …
-
### Checks
- [X] I have checked that this issue has not already been reported.
- [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars.
### Reprodu…
-
This needs more investigation. If already available integrations (jdbc) can read we should document that. Otherwise it may be worth a new dataset type to support parquet.
### Discussed in https://g…
-
### Describe the enhancement requested
Background: [Parquet Metadata evolution](https://docs.google.com/document/d/1PQpY418LkIDHMFYCY8ne_G-CFpThK15LLpzWYbc7rFU/edit)
Should we just do a POC for th…
-
TLDR: the primary pain point here is huge (in terms of total uncompressed byte size) row groups - writing the PageIndex OR reducing row group sizes, perhaps both, would help a lot.
Basically, the d…
-
It would be great:
- you could select columns for reading from parquet, or, even better, select from the schema hierarchy in general for deeper structured datasets
- you allow reading row-group X …