-
I'm wanting to look up the terms for a given document, that is `Document -> Field -> Terms`. Something like what term vectors provide in Lucene. (However, I see positions are already [stored a little …
-
Parquet is a columnar storage file format that is commonly used in the context of big data processing frameworks, such as Apache Spark and Apache Hive. The format is designed to be highly efficient fo…
-
**Summary**
To enable data compression for rapid engine. and to support data scanning on compressed data block without uncompressing the data.
```
"Lightweight Indexing on Compressed Data"
aut…
-
[Clickhouse](https://clickhouse.tech/) is a columnar store that can be queried with SQL.
-
In [Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?](http://dbmsmusings.blogspot.com/2017/10/apache-arrow-vs-parquet-and-orc-do-we.html), c…
-
What to do if one distruptively needs a disruptive™ solution for semi-mutual non-exclusive recursion for an enterprise multi-cloud ready Hello™ World™ deployment? This question is entirely OS agnostic…
-
This issue is meant to be a running list of ideas to improve performance in the columnar format.
- [x] ~Don't separate suffix into wall time / logical time / untyped version. The hypothesis is that…
-
Due to it being a proprietary DB that doesn't speak mysql or Postgres dialects, and is a columnar store - I'd love to see the ability to use Quokka to query SAP HANA
-
**Description**
Support direct write into TiFlash without TiKV replica.
**Category**
Feature
**Value**
Enable a lot more pure columnar format oriented scenarios without bothering storing as r…
-
# Data Format Changes
## Introduction:
The current data format used in our application has some limitations, such as a lack of clarity and the need for expensive calculated columns to be generated…