cshjin / CS412Yelp

1 stars 3 forks source link

SelectKBest #4

Closed cshjin closed 8 years ago

cshjin commented 8 years ago

use SelectKBest function to select 20 best features

cshjin commented 8 years ago
'comp_cool'
'comp_cute'
'comp_funny'
'comp_hot'
'comp_list'
'comp_more'
'comp_note'
'comp_photos'
'comp_plain'
'comp_profile'
'comp_writer'
'elite'
'fans'
'friends'
'review_count'
'votes_cool'
'votes_funny'
'votes_useful',
cshjin commented 8 years ago

ANOVA scores for all the features:

('average_stars', 72147.05495138168), 
('stars', 42477.091361091814), 
('elite', 8523.7779180089328), 
('yelping_since', 4893.7518151340146), 
('votes_useful', 2634.3057101360305), 
('fans', 2107.0973370240094), 
('friends', 1961.0363279232472), 
('votes_cool', 1875.4106215988809), 
('votes_funny', 1654.1594296454862), 
('attr_touristy', 1470.7707442876811), 
('Caters', 1284.0163113155565), 
('review_count', 1165.0412688604752), 
('goodfor_latenight', 944.3208538770192), 
('Unnamed: 0', 918.83501565367521), 
('parking_garage', 831.63748410106882), 
('longitude', 706.53052690033837), 
('Noise Level', 606.72953105375871), 
('latitude', 513.50763529842118), 
('Wi-Fi', 498.62221185721899), 
('open', 482.58764400952953), 
('parking_lot', 452.06695565591542), 
('Attire', 439.27433756984129), 
('Wheelchair Accessible', 419.86774343210919), 
('Dogs Allowed', 392.78209984714988), 
('Has TV', 364.96849693260106), 
('attr_casual', 341.88033057508869), 
('Alcohol', 328.30330966876807), 
('comp_note', 320.66133340356436), 
('Delivery', 314.75431682570536), 
('BYOB/Corkage', 306.68145380547776), 
('Drive-Thru', 292.82989158000748), 
('attr_divey', 289.74828341673566), 
('comp_plain', 281.94875909741313), 
('goodfor_breakfast', 259.86446711602156), 
('comp_cool', 254.27095888516726), 
('Price Range', 252.71561815433063), 
('Good For Kids', 243.52179901873319), 
('Waiter Service', 241.09764373313331), 
('Corkage', 235.96532641198638), 
('Coat Check', 225.86060655165622), 
('BYOB', 210.18341157027839), 
('music_playlist', 195.35037419681314), 
('goodfor_brunch', 183.58962025465598), 
('comp_writer', 180.18841094528537), 
('state', 161.7802051572927), 
('Open 24 Hours', 160.22963357607907), 
('Good For Groups', 158.5928423151457), 
('Order at Counter', 152.86105153414181), 
('Good For Dancing', 151.01731246215593), 
('music_jukebox', 149.15489305721098), 
('Smoking', 141.43392369026697), 
('Happy Hour', 133.71131560543191), 
('goodfor_dessert', 132.34046636098418), 
('comp_funny', 125.65503168256301), 
('music_live', 123.57921538902372), 
('comp_hot', 122.13232950936487), 
('res_halal', 118.67359662517477), 
('res_gluten-free', 112.72839055535043), 
('res_kosher', 110.91505860611609), 
('music_dj', 106.74903151386562), 
('res_dairy-free', 106.46808376842471), 
('res_soy-free', 104.54055422879706), 
('music_video', 103.55182618987888), 
('res_vegetarian', 103.062840418447), 
('Takes Reservations', 95.860535090945547), 
('pt_mastercard', 88.981655607878551), 
('res_vegan', 87.631196838510078), 
('attr_intimate', 87.579088735947309), 
('pt_visa', 82.468145980897233), 
('pt_cash_only', 82.468145980897233), 
('pt_discover', 73.729571550008643), 
('pt_amex', 72.636699684951239), 
('parking_valet', 67.697639571566754), 
('parking_validated', 66.687333867602447), 
('attr_trendy', 66.21392451055533), 
('parking_street', 60.373992290882263), 
('attr_classy', 58.905851633262131), 
('comp_more', 56.144931507454281), 
('goodfor_lunch', 47.226163193430274), 
('attr_upscale', 42.401229381830042), 
('Take-out', 34.445900611201203), 
('music_karaoke', 29.700442497314899), 
('Ages Allowed', 24.547714011364381), 
('attr_romantic', 22.218516629870052), 
('city', 20.566991061701607), 
('Good for Kids', 19.757135240987107), 
('music_background_music', 17.717940776822832), 
('goodfor_dinner', 16.480012731688365), 
('Outdoor Seating', 15.658521638970504), 
('comp_cute', 15.383581555766252), 
('comp_photos', 14.978906779067374), 
('attr_hipster', 10.127722894606908), 
('comp_list', 8.539942984443611), 
('By Appointment Only', 7.0721823181309977), 
('Accepts Insurance', 6.4301352637243969), 
('comp_profile', 4.0831483690204919)