scicloj / tablecloth

Dataset manipulation library built on the top of tech.ml.dataset
https://scicloj.github.io/tablecloth
MIT License
305 stars 27 forks source link

Refactor joins to use `pd-merge` #45

Open genmeblog opened 3 years ago

genmeblog commented 3 years ago

https://techascent.github.io/tech.ml.dataset/tech.v3.dataset.join.html#var-pd-merge

Currently joins on multiple columns are made using tmd/left-join (or right) iteratively column by column. Switching to a pd-merge will allow more optimal path.

PRs welcome.