mackncheesiest / FakeNewsClassifier

Uses Labeled Latent Dirichlet Allocation (LLDA) to attempt to classify news from two merged datasets as "real" or "fake" based on either article title or article title + hostname
0 stars 0 forks source link

FakeNewsClassifier

This repository consists of files used for an exploratory study of binary classification of "real" or "fake" news from a given dataset using a supervised variant of a topic modeling technique known as latent drichlet allocation(LDA).

Spoiler alert: it mostly just ends up learning to distinguish the difference in popular news topics between 2014 and 2016. Kanye West got married in 2014, and the 2016 election was in 2016 ¯\_(ツ)_/¯

Try it with datasets from the same year and maybe things'll turn out better.

[1] https://www.kaggle.com/mrisdal/fake-news [2] https://www.kaggle.com/uciml/news-aggregator-dataset