Open frankier opened 3 years ago
Might want to take a look also at Embeddia's tools https://github.com/ezosa/cross-lingual-linking https://github.com/sfermoy/TeMoCo and https://github.com/SkBlaz/rakun seem relevant
I like the ideas you have proposed for the task at hand. I plan to approach it as follow:
There are a few things that can be done here, but as well as potentially finding multilingual topic clusters, another related issue is whether the story is mainly about covid or just mentions it. e.g. if there are covid keywords in the headline or lede then probably it is, otherwise maybe it's just mentioned in relation to something else (XXX elected president... blah blah.. they'll have a lot on their plate including covid...)