parquet Search Results - Githubissues

1000+ results
for parquet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/arrow-rs #6733

Add Option To Coerce List Type on Parquet Write

**Describe the bug** arrow-rs generated .parquet files where the schema implies a nested structure should call the list item `element` as of parquet specifications: https://github.com/apache/parquet…

ggreco updated 1 week ago
3
mukunku/ParquetViewer #121

[BUG]

**Parquet Viewer Version** 3.1.0, also tried/used 2.8. **Where was the parquet file created?** Apache Spark org.apache.spark.timeZone GMT org.apache.spark.legacyINT96 org.apache.spark.version 3.…

veranscoto updated 1 week ago
2
apache/iceberg #11614

java.io.IOException: can not read class org.apache.iceberg.s…

### Apache Iceberg version 1.4.3 ### Query engine Spark ### Please describe the bug 🐞 ``` CALL spark_catalog.system.rewrite_data_files( table => '${DATABASE_NAME}.${TABLE_NAME}'…

wardlican updated 3 days ago
4
rapidsai/cudf #16968

[BUG] Incorrect read_parquet on spark distributed parquet fi…

When I read in a parquet dataset saved with Spark on a databricks catalog I get lots of . I tried ``` import glob cudf_dfs = [cudf.read_parquet(file) for file in glob.glob("/Volumes/path/*.parquet…

matt7salomon updated 1 month ago
2
devinrsmith/deephaven-parquet-viewer #77

IntLogicalType, isSigned=false, bitWidth=64 not supported

I'm encountering the following error while trying to load some parquet files (using docker latest): ```log Initiating shutdown due to: Uncaught exception in thread main java.lang.RuntimeException…

corani updated 5 hours ago
1
evidence-dev/evidence #2836

[Bug]: SQLite: Will not eagerly load files larger than 32 Me…

### Describe the bug I'm trying to load an SQLite database that's around 100MB. Seems like I'm hitting this line when trying to access a table in the db that's bigger than 32MB: https://github.com/e…

luanmuniz updated 1 day ago
9
prestodb/presto #23840

[parquet] Use deterministic data in parquet.batchreader.deco…

Currently, all of these tests utilize randomness to generate page data for decoder verification. This can introduce test flakiness. We should re-write these tests to use a deterministic set of valu…

ZacBlanco updated 2 weeks ago
1
slingdata-io/sling-cli #435

Parquet invalid encoding / not valid UTF8

## Issue Description - Description of the issue: When exporting a MSSQL table to parquet I get a parquet file where DuckDB complains about string encoding issues. "select * from output.parquet…

OneCyrus updated 2 weeks ago
2
duckdb/duckdb #14898

ROW_GROUP_SIZE parameter doesn't work for some parquet files

### What happens? When using `COPY ... TO ...` to generate a new parquet file from another parquet file the `ROW_GROUP_SIZE` parameter doesn't work. The final row group size is very low (under 1…

nbc updated 6 days ago
4
paradedb/pg_analytics #173

Querying Hive partitioning parquets is slow

### What happens? Recently, we tried this extensions instead of using a standalone duckdb instance. When we run a simple `SELECT` query on parquet files, it's 2-20 times slower than DuckDB. Profil…

xqe2011 updated 2 weeks ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for parquet

1000+ results
for parquet