parquet-files Search Results

1000+ results
for parquet-files

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sparklyr/sparklyr #654

add example of using spark_write_parquet with the options li…

As of now the documentation for the function http://spark.rstudio.com/reference/spark_write_parquet.html does not make it clear what are the acceptable values for the mode parameter and how to use the…

andrekos2 updated 6 years ago
3
apache/datafusion #3629

manual repartitioning overridden by physical optimizer

**Describe the bug** Using dataframe.repartition() function doesn't work as expected. **To Reproduce** Using the tpch bin from benchmarks, convert the .tbl (csv) files to Parquet format using t…

kmitchener updated 1 year ago
6
graphframes/graphframes #201

ConnectedComponents: error 'Unable to infer schema for Parqu…

Hi, When computing `connectedComponents`using the graphframes algorithm I get the following error: ``` File "/root/.ivy2/jars/graphframes_graphframes-0.5.0-spark2.1-s_2.11.jar/graphframes/graphfr…

thomasopsomer updated 11 months ago
14
apache/datafusion #1060

Add support of HDFS as remote object store

**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** Currently, we can only read parquet files from local file system. It would be nice to add s…

yahoNanJing updated 1 year ago
6
ClickHouse/ClickHouse #58752

Support partial_merge and full_sorting_merge for pre-sorted …

> A clear and concise description of what is the intended usage scenario is. full_sorting_merge is a very effective way to merge two large tables if it's known that tables are already sorted. But…

tolmalev updated 5 months ago
4
pola-rs/polars #11419

Implement streaming for scan_pyarrow_dataset

### Description As per the streaming API documentation (https://pola-rs.github.io/polars/user-guide/concepts/streaming/#when-is-streaming-available) streaming now supports: scan_csv, scan_parquet, sc…

lmocsi updated 1 month ago
14
confluentinc/kafka-connect-hdfs #262

Parquet writer doesn't support Avro LogicalTypes

Hi, I have following issue. I have record where one of fields has time-millis logicalType. When saving to Avro format, column in resulting table has TIMESTAMP type. Unfortunately when saving to Pa…

maver1ck updated 6 years ago
1
roapi/roapi #98

Error reading Parquet files from S3 with a space in the path…

Hi, I had a problem loading parquet file from s3 when there's a space in the path. I tried with `%20` but it doesn't work either. Example path: ``` s3://my-bucket/trusted/receita_socios/version…

pmleveque updated 1 year ago
11
pola-rs/polars #7744

Add support for adding partitions as columns for parquet (an…

### Problem description scan_parquet today doesn't support adding the partition columns for directory-partitioned parquet (and CSV). Without this, the user has to work out the logic of adding columns…

ddutt updated 1 year ago
3
facebookincubator/velox #6527

Parquet support for timestamp data type

### Description Currently the Parquet reader does not seem to support TIMESTAMP data type. We ran into exception at [here](https://github.com/facebookincubator/velox/blob/main/velox/dwio/parquet/read…

chliang71 updated 8 months ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for parquet-files

1000+ results
for parquet-files