lweides / column-store

0 stars 0 forks source link

Benchmark #1

Open lweides opened 11 months ago

lweides commented 11 months ago
Eliasrpx commented 11 months ago

optional: introduce custom metrics:

lweides commented 10 months ago

Use-cases to benchmarks (may be extended in the future):

All benchmarks will be performed with JMH.

Filter for ids

Measure the performance of filtering for sparsely occurring ids in seconds and bytes read (if possible).

Filter for strings

Filter for strings with the supported operations provided by our API (IS, STARTS_WITH, ENDS_WITH, CONTAINS). Measure the performance in seconds and bytes read (if possible).

Filter by timeframe

Filter 2 long columns by treating them as start_time and end_time. Available filters are STARTS_IN, ENDS_IN, OVERLAPS. Measure the performance in seconds and bytes read (if possible).

Read a small subset of available columns

Read all records and measure the performance in seconds and bytes read (if possible).

Read all available columns

Read all records and measure the performance in seconds and bytes read (if possible).

Effectiveness of compression

Measure the ratio of input bytes vs. output bytes.

Ingest data of relatively stable column set

Ingest a dataset where the majority of records consists of the same columns. Measure both ingest performance in seconds and output bytes.

Ingest data of relatively unstable column set

Ingest a dataset where the majority of records consists of the different columns or a lot of column has no data (null values). Measure both ingest performance in seconds and output bytes.

lweides commented 10 months ago

Examples for the use-cases described above:

Eliasrpx commented 8 months ago

Elias: