columnar-storage-format Search Results

487 results
for columnar-storage-format

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/datafusion #2962

Can't filter rowgroup for parquet prune for some data type

**Describe the bug** In the `RowGroupPruningStatistics`, we use the statistics to prune the row group for parquet file. In the below logical: https://github.com/apache/arrow-datafusion/blob/f386…

liukun4515 updated 2 years ago
2
NVIDIA/spark-rapids #5720

Question about stream time metric

hi， I run a SQL which contains four stages, and the 1st stage aims to scan the parquet files and prepare shuffle write data for the next stage and the mean time of tasks is about 4s. To reduce the …

cfangplus updated 2 years ago
3
duckdb/duckdb #5675

FSST string compression failed due to incorrect size calcula…

### What happens? When trying to create a table like this ```sql CREATE TABLE xxx AS SELECT tbl.*, '12345' AS dedup_group FROM read_parquet('path/glob/*.snappy.parquet') AS tbl…

RXminuS updated 1 year ago
39
apache/pinot #4230

NULL value support for all data types

Currently in Pinot we don't have real NULL value support, but use some special default values for NULL. For dimensions, the default value is the minimum value for numeric types, "null" for STRING, emp…

Jackie-Jiang updated 1 year ago
34
ClickHouse/ClickHouse #23516

Support of dynamic subcolumns in tables.

Inroduce new data type `Object()`, which will get the name of format for semi-structured data (`JSON`, `XML`, etc.). Initially it will work only with `MergeTree` tables. Maybe later will add some oth…

CurtizJ updated 2 years ago
43
pandas-dev/pandas #5913

HDF5 Select with Filter gives incorrect results when using I…

Linked Issue: https://github.com/PyTables/PyTables/issues/319 Hi, I have a dataframe saved in HDF5 with 6.7 million records (about 425MB). As you can see below, it gives an incorrect result when it …

CarstVaartjes updated 2 years ago
21
protomaps/PMTiles #62

Alternative dir structure for a more compact storage and pro…

I would like to propose some changes to the directory structure, but these might be totally irrelevant due to my misunderstanding. Current directory entry is fixed at 17bytes, stores x,y as individ…

nyurik updated 2 years ago
41
apache/arrow #13875

How does arrow parse its ipc format?

Hi there, I am trying to hack the arrow IPC format, I am confused about how does arrow differentiate between different type in record batch buffers and parse it. for example, now I store a data fra…

liusitan updated 2 years ago
7
open-telemetry/opentelemetry-specification #2726

Mistake while applying the logic in #2617 from #2589: proces…

The issue https://github.com/open-telemetry/opentelemetry-specification/issues/2589 explains why the direction for `disk` is not the right thing, which makes total sense. But we got that blindly an…

bogdandrutu updated 2 years ago
27
pandas-dev/pandas #5902

PERF: Load data (create Series, Dataframe) in a more functio…

I've been trying to find out the process of creating DataFrames in order to try to solve #2305 with minimal memory use. I've made some tests that put in a IPython notebook: http://nbviewer.ipython.org…

tinproject updated 2 years ago
21

上一页 1...27 28 29 30 31 32 33...49 下一页

487 results for columnar-storage-format

487 results
for columnar-storage-format