uccross / skyhookdm-ceph-cls

Skyhook Data Management: Storage and management of tabular data in Ceph.
https://www.skyhookdm.com
GNU Lesser General Public License v2.1
13 stars 9 forks source link

arrow-cls: work on benchmarks #62

Open ivotron opened 4 years ago

ivotron commented 4 years ago

create a benchmark based on this https://blog.thedataincubator.com/2018/05/sqlite-vs-pandas-performance-benchmarks/ . Initially, we'll create a partitioning of parquet/ipc files and use that to benchmark IPC vs Parquet implementations. We'll then run this on rados (once it's ready)

JayjeetAtGithub commented 3 years ago

The IPC vs Parquet benchmarks can be found here.