pyjanitor-devs / pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor
https://pyjanitor-devs.github.io/pyjanitor
MIT License
1.37k stars 170 forks source link

[ENH] More control on creating the output dataframe for `conditional_join` - PR no 1 #1396

Closed samukweku closed 2 months ago

samukweku commented 2 months ago

This is the first part of a series of PRs to improve conditional_join, with the final aim being to support aggregations within conditional_join. Supporting aggregations within conditional_join means we can avoid materializing the dataframe before aggregating, and should generally improve performance for such operations.

PR Description

Please describe the changes proposed in the pull request:

This PR relates to #1269 .

Please tag maintainers to review.

ericmjl commented 2 months ago

🚀 Deployed on https://deploy-preview-1396--pyjanitor.netlify.app