lemon24 / reader

A Python feed reader library.
https://reader.readthedocs.io
BSD 3-Clause "New" or "Revised" License
431 stars 28 forks source link

Entry number unbounded, no way of removing old entries #96

Open lemon24 opened 5 years ago

lemon24 commented 5 years ago

A database with ~3000 entries takes about 21M, which is perfectly acceptable. However, at the moment there is no way to remove old entries, and the database can grow arbitrarily.

lemon24 commented 5 years ago

How other people handle this:

Akregator has 4 archive settings (can be configured globally, or per feed) (update: unchanged as of 2022):

Also, not deleting important articles can be turned off.

Tiny Tiny RSS can purge articles after X days (can be configured globally, or per feed); some details:

An interesting (but somewhat unrelated feature) is the Archived feed, which keeps starred articles from deleted feeds and share-anything articles (you can add articles that have no feed). Articles in the Archived feed are not purged.

lemon24 commented 4 years ago

Presumably, it would be also nice to mark a whole feed as important ("don't delete"). This could also be implemented as a plug-in that marks each new entry as important, but it may pollute individual important entries.

lemon24 commented 1 year ago

Requirements:

Open questions:

lemon24 commented 1 year ago

TODO: kinds of duplicates (broadly) × deduplication mechanisms matrix

Kinds of duplicates:

Deduplication mechanisms: