twitte01 / 232R_GroupProject

UCSD Spring 2024 232R Big Data Analytics Using Spark Group Project
0 stars 2 forks source link

Split Train and Test Dataset #56

Closed twitte01 closed 4 months ago

twitte01 commented 4 months ago

After variable encoding etc. split training, validation and test datasets

CanIGetAnAman commented 4 months ago

Code for splitting datasets into train/test is complete. make sure to keep the same seed (this will ensure we get the same reproducible split)