Closed andreaschandra closed 3 years ago
Average tweets per account
average retweet dari total tweet
average hashtag per user
average tweet contains hashtag
Dictionary hashtag yang perlu di label https://drive.google.com/file/d/1ggzzbHiz5cKXct7xzR_MrKsyVRLJwFjZ/view?usp=sharing
@rubentea16 @AndhikaS97
@rubentea16 @AndhikaS97 ini plot dari hashtag yang dilabelin. imbalance
Kalo imbalance gitu gimana? Harus dilanjut labelin lagi kah sampe balance?
hmm... coba gw modeling dulu ya, kalo semisal ga perform, either emang susah buat diseperate karena kurang data, atau labelnya yang ga konsisten
Baseline
BernouliNB
SVM
Random Forest
AdaBoost
Gradient Boosting
close eda ya guys @AndhikaS97 @rubentea16
boundaries yang jelas antara buzzer dan non buzzer kalo pake rule based mungkin bakal:
Task: