Open bdewilde opened 9 months ago
Hello burton, I'd like to work on this issue! TIA.
hi @Akshi22 , don't let me get in your way! though it looks like @ujjawal-khare-27 has already submitted a pr to fix this issue. maybe you can help there?
For what it's worth, I just ran into this issue again, only this time in the context of Dataset.groupby(col)
. It's the same error message, and presumably the same code under the hood. Just a bummer.
Hi, is this issue still open? If so, I'd like to get started contributing to Ray.io!
What happened + What you expected to happen
I wanted to get the unique values in a given column of my dataset, but some of the values are null for unavoidable reasons. Calling
Dataset.unique(colname)
on such data raises a TypeError, with differing specifics depending on how the column dtype is specified. This behavior was surprising since the equivalent operation on apandas.Series
works just fine, as does getting unique values via Python built-ins.Here are two versions of type error I got, seemingly from the same line of code:
and
Versions / Dependencies
macOS 14.1 PY 3.9 ray == 2.9.0 pandas == 2.1.0
Reproduction script
Issue Severity
Medium: It is a significant difficulty but I can work around it.