Open dimenwarper opened 7 years ago
Sounds promising. Please submit a PR with the new functionality along with unit tests to demonstrate how it works.
I've implemented a draft of this but realized it may clash with the functionality of converting all text to numerical values. I wonder how to proceed, as I see it there are two options:
>=50
and >=50'
get encoded to the same label.One way to proceed would be to go with 1 and then tackle 2 in a later issue.
Thanks for this awesome tool! I was wondering if we could include some sanity checking/cleanup for badly behaved text (e.g. all those invalid unicode characters). Could be as simple as running ftfy on all text columns. I'd volunteer to integrate this into datacleaner.