Use a script to collect the titles from select/specified subreddits and use this collection to learn and predict the likelihood of a given title. This is used to predict which subreddit a chosen title should belong too.
1
stars
0
forks
source link
Add start/end characters and track the distributions accordingly #4
Don't have a mechanism to say this is the first word or this is the last word. That should be included in the bigram distribution ie
P(word | prevword) = P(What | )
Don't have a mechanism to say this is the first word or this is the last word. That should be included in the bigram distribution ie P(word | prevword) = P(What |)