thedod / feed2twister

Simple cron script to post RSS/ATOM items to Twister
GNU General Public License v3.0
10 stars 4 forks source link

Some feeds do not work due to db file / Feature request - checksum of posts #11

Open RealVegOs opened 8 years ago

RealVegOs commented 8 years ago

Hi TheDod, I experienced that some feeds only work once or after db file is deleted. I do not know what's written into the db file. The headlines of posts? I know that one of this failing feeds is quite stupid. It's from a CMS but the owner does not create new posts. He always opens the same and adds content. So the RSS output is always the same post. In this case I do not wonder, while feed2twister does not work. But there are feeds which look OK with the same behavior of only working once. Perhaps it might be a solution to calculate a checksum of each post and put that into the db file. In this case, even the stupid feed above would work as the content changes with each repost. - Thanks for reading and greetings. Br.

black-puppydog commented 8 years ago

problem is, this would repost the same post over and over again if the author only adds small bits or corrects a typo. or it would require some more sophisticated heuristics.

RealVegOs commented 8 years ago

Hi Daan,

I would not mind about reposting. A timestamp with title would do as identifier for posts. BTW, some CMS do not repeat an old post's RSS output, if the post is edited.

Cheers,

Br.

On Tue, 22 Mar 2016 08:52:24 -0700 Daan Wynen notifications@github.com wrote:

DW> problem is, this would repost the same post over and over again DW> if the author only adds small bits or corrects a typo. or it DW> would require some more sophisticated heuristics. DW> DW> --- DW> You are receiving this because you authored the thread. DW> Reply to this email directly or view it on GitHub: DW> https://github.com/thedod/feed2twister/issues/11#issuecomment-199877606

........................................................................ https://www.antispam-ev.de/ - http://www.cauce.org/

........................................................................ http://spampoison.com/ - http://www.danhatesspam.com/