KeithCu / LinuxReport

Customizable Linux news site based on Python / Flask
https://covidreport.keithcu.com/
GNU Lesser General Public License v3.0
10 stars 2 forks source link

Smarter refreshing via machine learning #18

Open KeithCu opened 4 years ago

KeithCu commented 4 years ago

Currently the system fetches usually every hour, or every 6 hours (for sites that usually update just once per day.) It does this 24/7.

It should be possible to apply some machine learning per feed to have the system figure out when the site usually updates, and then only make requests around then. This could be done manually (by keeping track of a week's worth of updates), or by applying some machine learning algorithms. It would be great if it could keep learning over time.

This would also be better for the sites that update once per day, because it could try to catch them soon after they are usually posted, rather than up to 6 hours later.