SennaMa / poke_haiku

MIT License
1 stars 0 forks source link

Clean input and save results in CSV #8

Closed SennaMa closed 10 months ago

SennaMa commented 10 months ago

What: After doing a bit of research, finalizing on a set of parameters and pulling the data, we want to clean the raw data specifically in the descriptions.

Why: cleaned data will produce better results/

How: Nothing fancy. Remove new line symbols (\n), escape sequence (\u000c), and zero bytes (\x0). Save the cleaned result as a new column in the CSV to preserve old records.