Squishy123 / CPS803-Project

Fake News Detection
GNU Affero General Public License v3.0
0 stars 0 forks source link

Clean dataset #1

Open dahkimr opened 3 years ago

dahkimr commented 3 years ago

Code to remove unnecessary punction, etc.

Resource: https://towardsdatascience.com/nlp-in-python-data-cleaning-6313a404a470

Squishy123 commented 3 years ago

We can also look to combine datasets into multiple csvs to make it easier I think having 3 training datasets and 3 testing datasets would be optimal

Something like a small, medium and large dataset.