arendakessian / spring2020-ml-project

fake review detection system
2 stars 3 forks source link

Fix vectorizing/downsampling order #11

Closed guidopetri closed 4 years ago

guidopetri commented 4 years ago

We're downsampling after vectorizing, but computationally, it makes more sense to downsample before vectorizing. This way we get fewer features and thus less of a memory load when training models.