Closed Eh2406 closed 7 years ago
travis is complaining because diff has been removed.
Our test show that this works but does not make a huge difference.
Good to know about the category data type. I can definitely see this being useful. Does this particular change still make sense even though there's not much performance difference?
Sorry, just got back from a trip.
I don't think it matters much, but I think it makes sense the point of the function is to make categories that may as well be "Categorical."
So just came across Categorical Data in pandas, and this blog post on how it dramatically improves performance on data with text categories.
This is not yet tested, but I think it makes sense. What do you think? Are there other places that need it?