wingyiuc / dsw-project

1 stars 0 forks source link

Reducing dimenstion for one-hot encoding #13

Closed wingyiuc closed 4 months ago

wingyiuc commented 4 months ago

There are too many property types. May need to reduce the dimension using PCA.

'one_hot_encoderproperty_type_Apartment', 'one_hot_encoderproperty_type_Bed & Breakfast', 'one_hot_encoderproperty_type_Boat', 'one_hot_encoderproperty_type_Boutique hotel', 'one_hot_encoderproperty_type_Bungalow', 'one_hot_encoderproperty_type_Cabin', 'one_hot_encoderproperty_type_Camper/RV', 'one_hot_encoderproperty_type_Casa particular', 'one_hot_encoderproperty_type_Castle', 'one_hot_encoderproperty_type_Cave', 'one_hot_encoderproperty_type_Chalet', 'one_hot_encoderproperty_type_Condominium', 'one_hot_encoderproperty_type_Dorm', 'one_hot_encoderproperty_type_Earth House', 'one_hot_encoderproperty_type_Guest suite', 'one_hot_encoderproperty_type_Guesthouse', 'one_hot_encoderproperty_type_Hostel', 'one_hot_encoderproperty_type_House', 'one_hot_encoderproperty_type_Hut', 'one_hot_encoderproperty_type_In-law', 'one_hot_encoderproperty_type_Island', 'one_hot_encoderproperty_type_Lighthouse', 'one_hot_encoderproperty_type_Loft', 'one_hot_encoderproperty_type_Other', 'one_hot_encoderproperty_type_Parking Space', 'one_hot_encoderproperty_type_Serviced apartment', 'one_hot_encoderproperty_type_Tent', 'one_hot_encoderproperty_type_Timeshare', 'one_hot_encoderproperty_type_Tipi', 'one_hot_encoderproperty_type_Townhouse', 'one_hot_encoderproperty_type_Train', 'one_hot_encoderproperty_type_Treehouse', 'one_hot_encoderproperty_type_Vacation home', 'one_hot_encoderproperty_type_Villa', 'one_hot_encoder__property_type_Yurt',

wingyiuc commented 4 months ago

https://www.r-bloggers.com/2022/11/pca-for-categorical-variables-in-r/#:~:text=PCA%20won't%20be%20effective,data%20set%20with%20categorical%20variables.

Maybe something like FAMD / MCA

vanessadada commented 4 months ago

solved in the latest pull request