twitte01 / 232R_GroupProject

UCSD Spring 2024 232R Big Data Analytics Using Spark Group Project
0 stars 2 forks source link

Determine Feature Expansion Needed #9

Closed twitte01 closed 6 months ago

CanIGetAnAman commented 6 months ago

We could combine all of the utility costs into 1 feature "utilities"

twitte01 commented 6 months ago

Combine education attainment with if currently in school & potentially private vs public but private vs public may be dependent on location

twitte01 commented 6 months ago

citizenship status with race may be interesting

twitte01 commented 6 months ago

potentially combine work variables (detailed class of work, looking for work) could potentially include if looking for work, hours worked and income

CanIGetAnAman commented 6 months ago

combine individual utility costs into 1 column "COSTUTIL", and created a column called "FULLTIME" that is a 1 if they work 40+ hours a week, and a 0 otherwise