jakartaresearch / adi-buzzer

Analyzing and Detecting Indonesia Buzzer in Twitter About Politics and Social Issues
3 stars 0 forks source link

EDA labeled account #19

Closed andreaschandra closed 3 years ago

andreaschandra commented 3 years ago

boundaries yang jelas antara buzzer dan non buzzer kalo pake rule based mungkin bakal:

Task:

andreaschandra commented 3 years ago

Average tweets per account average-tweets-per-account

average retweet dari total tweet average_rt_per_account

average hashtag per user average_hashtag_per_user

average tweet contains hashtag average_tweet_contain_hashtag

andreaschandra commented 3 years ago

Dictionary hashtag yang perlu di label https://drive.google.com/file/d/1ggzzbHiz5cKXct7xzR_MrKsyVRLJwFjZ/view?usp=sharing

@rubentea16 @AndhikaS97

andreaschandra commented 3 years ago

@rubentea16 @AndhikaS97 ini plot dari hashtag yang dilabelin. imbalance

hashtag_label

AndhikaS97 commented 3 years ago

Kalo imbalance gitu gimana? Harus dilanjut labelin lagi kah sampe balance?

andreaschandra commented 3 years ago

hmm... coba gw modeling dulu ya, kalo semisal ga perform, either emang susah buat diseperate karena kurang data, atau labelnya yang ga konsisten

andreaschandra commented 3 years ago

Baseline

andreaschandra commented 3 years ago

Update trigram character

image image image image image

andreaschandra commented 3 years ago

close eda ya guys @AndhikaS97 @rubentea16