parquet Search Results - Githubissues

1000+ results
for parquet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/hudi #11785

Unable to merge cdc records to Hudi snapshot.

I am unable to add cdc records to a snapshot. environment ``` emr 7.2.0 AmazonCloudWatchAgent 1.300032.2, Hive 3.1.3, Spark 3.5.1, Zeppelin 0.10.1 ``` spark command ``` spark-shell …

rcc1101 updated 1 week ago
6
duckdb/dbt-duckdb #419

reading from encrypted parquet file

DuckDB allows use of [encrypted Parquet files](https://duckdb.org/docs/data/parquet/encryption.html). The key is set via a PRAGMA statement like `PRAGMA add_parquet_key('key128', '0123456789112345'…

jb8628 updated 2 months ago
5
apache/arrow-rs #6454

`parquet::column::reader::GenericColumnReader::skip_records`…

**Describe the bug** I noticed this while investigating https://github.com/apache/datafusion/issues/7845#issuecomment-2370455772. The suggestion from @jayzhan211 and @alamb was that `datafusion.…

samuelcolvin updated 4 days ago
4
vega/vega #3961

Support parquet as a loader format

Request: Support loading parquet as a source datatype in vega-loader Envisioned solution: ```javascript { "data": [ { "name": "my_data", "format": {"type": "parquet"}, …

kszlim updated 1 month ago
5
facebookincubator/velox #9767

Parquet Reader Decoder Support Status

### Description # Normal Data Page Types Velox Type | Parquet LogicalType | Parquet ConvertedType | Parquet Storage Type | Supported? -- | -- | -- | -- | -- BOOLEAN | | | BOOLEAN (1 …

yingsu00 updated 1 week ago
1
pola-rs/polars #18293

The output of `hive_partitioning=False` differs in `scan_ipc…

### Checks - [X] I have checked that this issue has not already been reported. - [X] I have confirmed this bug exists on the [latest version](https://pypi.org/project/polars/) of Polars. ### Reprodu…

etiennebacher updated 2 weeks ago
4
ERDDAP/erddap #196

Add EDDTableFromParquet reader?

This needs more investigation. If already available integrations (jdbc) can read we should document that. Otherwise it may be worth a new dataset type to support parquet. ### Discussed in https://g…

ChrisJohnNOAA updated 3 weeks ago
1
apache/arrow #43695

[C++][Parquet] Proof-of-concept: Trying to using FlatBuffer …

### Describe the enhancement requested Background: [Parquet Metadata evolution](https://docs.google.com/document/d/1PQpY418LkIDHMFYCY8ne_G-CFpThK15LLpzWYbc7rFU/edit) Should we just do a POC for th…

mapleFU updated 1 month ago
11
huggingface/datatrove #263

Optimize parquet output for remote reading

TLDR: the primary pain point here is huge (in terms of total uncompressed byte size) row groups - writing the PageIndex OR reducing row group sizes, perhaps both, would help a lot. Basically, the d…

H-Plus-Time updated 1 month ago
2
kylebarron/arro3 #195

Feature requests

It would be great: - you could select columns for reading from parquet, or, even better, select from the schema hierarchy in general for deeper structured datasets - you allow reading row-group X …

martindurant updated 3 days ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for parquet

1000+ results
for parquet