flairNLP / fundus

A very simple news crawler with a funny name
MIT License
288 stars 74 forks source link

[Question] Should we classify the publishers? (Examples might include satire, sport news, general news, ...) #186

Open Weyaaron opened 1 year ago

Weyaaron commented 1 year ago

Parsers like #185 raise a question: Should we add publisher types? While they increase the complexity, I am in favor because not adding it might lead to a nasty surprise for users down the line. #80 is related as well, me might add a type like 'Sport News'

MaxDall commented 1 year ago

Hmm, very interesting point. Did you stumble upon anything which could reliably classify article types?

Weyaaron commented 1 year ago

I would not attempt this for each article on its own, but classify the publishers instead. I have changed the title to better reflect this.