Closed tvdboom closed 10 months ago
I agree that string and arrow string should be recognized as categorical.
Even the categorical type itself it currently not recognized as such.
https://github.com/scikit-learn-contrib/category_encoders/blob/80b4a9b9d85ac449fcb7a3543c6ee4353013f41f/category_encoders/utils.py#L35
That's the function that need to be adjusted (and renamed)
alright, I'll make a pr
thanks
Expected Behavior
Category encoders should recognize pandas
string
andstring[pyarrow]
types.Actual Behavior
The column isn't recognized as categorical, and the dataframe is returned as is.
Steps to Reproduce the Problem
produces output:
Specifications