ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
34.23k stars 5.81k forks source link

[data] Documentation is formatted incorrectly #48974

Open richardliaw opened 3 days ago

richardliaw commented 3 days ago

The Ray Data groupby docs seems to be broken in formatting https://docs.ray.io/en/latest/data/api/doc/ray.data.Dataset.groupby.html, it doesn't clarify how it works. image

But I assume that grouping all columns meaning grouping everything together right, essentially === doing nothing ? If we apply the same idea to sort, then it also means we do nothing when it is set to None, wdyt? Both are explicit instead of implicitly grouping/sorting based on some hidden factors. It might be counterintuitive for Pandas users if we do the opposite behavior against Pandas.

Originally posted by @Superskyyy in https://github.com/ray-project/ray/issues/48926#issuecomment-2504769148

Superskyyy commented 3 days ago

Thanks, I will fix the doc next.