h2oai / db-benchmark

reproducible benchmark of database-like ops
https://h2oai.github.io/db-benchmark
Mozilla Public License 2.0
326 stars 88 forks source link

add R's disk.frame as a solution #88

Open xiaodaigh opened 5 years ago

xiaodaigh commented 5 years ago

See https://github.com/xiaodaigh/disk.frame/issues/36

Happy to help out. I try to work on this on Sunday mornings.

MichaelChirico commented 5 years ago

Here's an article w some sample syntax:

http://diskframe.com/articles/vs-dask-juliadb.html

jangorecki commented 5 years ago

@xiaodaigh your pkgdown seems to suffer from the same problem as in-dev data.table pkgdown site, see https://github.com/r-lib/pkgdown/issues/1120 for details

xiaodaigh commented 5 years ago

Thanks @jangorecki. It's been fixed. It's actually cos I didn't upload the image. But it helped me fixed another issue.

xiaodaigh commented 5 years ago

https://www.brodrigues.co/blog/2019-09-03-disk_frame/

A blogpost showing it's much faster than spark.

jangorecki commented 4 years ago

@xiaodaigh any plans to contribute test script for disk.frame? if so, please work on top of https://github.com/h2oai/db-benchmark/tree/cmd-launcher which should greatly help in reproducibility of a single solution benchmark, so also help in developing new solution