jbesomi / texthero

Text preprocessing, representation and visualization from zero to hero.
https://texthero.org
MIT License
2.88k stars 239 forks source link

Getting started: preprocessing #144

Open jbesomi opened 4 years ago

jbesomi commented 4 years ago

Task: write the "Getting started: preprocessing" doc page

Advice/Tips to the technical writer

Good to know:

Concept useful to have clear in mind:

Things to keep in mind when writing:

To stay in the technical discussion loop:

Page

aim: learn how to preprocess text-based dataset with Texthero

content:

Iota87 commented 4 years ago

I am structuring this part as follows (based on review of similar contexts, including Texthero "Getting Started" structure): Overview/Intro Why is pre-processing crucial and what are the benefits of having a standardized/customizable pipeline Clean What it does and how Custom Pipeline Why and how you should take control of the pre-processing steps More details Including pre-processing API functionalities

Please let me know if something is not clear or if you have any additional suggestions.