rapidsai / cudf

cuDF - GPU DataFrame Library
https://docs.rapids.ai/api/cudf/stable/
Apache License 2.0
8.02k stars 871 forks source link

[FEA] Support multiple joins at once #6256

Open bryevdv opened 3 years ago

bryevdv commented 3 years ago

Is your feature request related to a problem? Please describe.

Pandas supports passing multiple dataframes to join at once, cudf does not.

ref: https://pandas.pydata.org/pandas-docs/stable/user_guide/merging.html#joining-multiple-dataframes

Describe the solution you'd like

To be able to join multiple dataframes efficiently in one call:

result = left.join([right, right2])

Describe alternatives you've considered

Can currently use cudf.concat in some cases:

df_wide = cudf.concat([...], axis=1)

however concat does not accept a join keyword to specify the kind of join to perform.

github-actions[bot] commented 3 years ago

This issue has been marked rotten due to no recent activity in the past 90d. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

jrhemstad commented 2 years ago

@shwina would this be relevant for using the object-based join from libcudf?

shwina commented 2 years ago

Yes, I think it absolutely would.