JasonKessler / scattertext

Beautiful visualizations of how language differs among document types.
Apache License 2.0
2.24k stars 290 forks source link

Clean_function docstring unclear #21

Closed David-Herman closed 6 years ago

David-Herman commented 6 years ago

https://github.com/JasonKessler/scattertext/blob/e2cfa882ed1d456f239ec8148453fd8452091371/scattertext/CorpusFromPandas.py#L20

Hello,

I wanted to comment that the doc string on the clean function may be unclear. I understood it to be post-tokenization rather than the per document raw text. Perhaps the docstring could be improved to clarify this point.

Thank you

JasonKessler commented 6 years ago

Sure. Fix will come in the next 0.0.2.13.