sjbitcode / daca-news

A news aggregation site specifically for DACA related articles, updated hourly. Built with Django ❤️
https://daca.news
4 stars 0 forks source link

Implement fuzzy matching for similar titled articles #16

Open sjbitcode opened 3 years ago

sjbitcode commented 3 years ago

Problem

As of 12/28/20, I came across three similarly titled articles, the difference is that some titles have the source included in it, ex:

In this function of the article pipeline, I implement a function to check article titles 15 days prior and ahead for the same title.

Some solutions: