greghendershott / feeds2gmail

Atom/RSS/RDF feeds --> Gmail using IMAP APPEND.
14 stars 2 forks source link

warning and error during -i, -u #5

Open dontdieych opened 9 years ago

dontdieych commented 9 years ago

https://gist.github.com/1e5db5bac28a8fde1740

feeds : list of feed url log-i : output of racket main.rkt -i feeds log-u : output of racket main.rkt -u

last log from racket main.rkt -u

racket main.rkt -u
http://www.markshuttleworth.com/feed ... 304 Not Modified
http://learnyouahaskell.com/rss.php ... 200 OK
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
http://0pointer.net/blog/index.atom ... 304 Not Modified
http://dailyvim.tumblr.com/rss ... 200 OK
http://feeds.launchpad.net/bugs/731874/bug.atom ... 200 OK
http://feeds.feedburner.com/Torrentfreak ... 200 OK
https://github.com/blog.atom ... 200 OK
http://articles.dappergentlemen.com/rss/ ... 200 OK
http://allthatbtc.blog.me/rss ... 200 OK
New: "[코빗 뉴스 클리핑] San Jose Earthquakes 축구팀, 비트코인 받을 예정"
#(struct:exn:fail check-ok: server error: (NO |[TRYCREATE]| Folder |doesn't| exist. (Failure)) #<continuation-mark-set>)
dontdieych commented 9 years ago

I'm wodering,

  1. How can I ensure all feeds properly imported?
  2. What is last update status of all or specific feed. So I can check healthy status of periodic update.
dontdieych commented 9 years ago

Thanks for this convenient tool! What I exactly looking for.

dontdieych commented 9 years ago

I've researched little bit more.

$ ag '^ {6}\("http' ~/.feeds2gmail.cache.rktd | wc -l
33
$ wc -l feeds
37 feeds
$ racket main.rkt -u
http://www.markshuttleworth.com/feed ... 304 Not Modified
http://learnyouahaskell.com/rss.php ... 200 OK
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
Bad RSS item
http://0pointer.net/blog/index.atom ... 304 Not Modified
http://dailyvim.tumblr.com/rss ... 200 OK
http://feeds.launchpad.net/bugs/731874/bug.atom ... 200 OK
http://feeds.feedburner.com/Torrentfreak ... 200 OK
https://github.com/blog.atom ... 200 OK
http://articles.dappergentlemen.com/rss/ ... 200 OK
http://allthatbtc.blog.me/rss ... 200 OK
New: "[코빗 뉴스 클리핑] San Jose Earthquakes 축구팀, 비트코인 받을 예정"
#(struct:exn:fail check-ok: server error: (NO |[TRYCREATE]| Folder |doesn't| exist. (Failure)) #<continuation-mark-set>)
$ echo $?
0
$
greghendershott commented 9 years ago

So IIUC there are two distinct problems here:

  1. Some feeds aren't being parsed. I think this will be an on-going battle. I remember hearing that much of the work that Google Reader had to do, was dealing with buggy feed files. I wish Racket had a dedicated package for reading feeds -- maintained by someone who wanted to keep refining it to handle new examples of buggy feeds. Meanwhile I just have feeds.rkt. When I have time I'll take a look at these example feeds and see if I can change it to handle them, too.
  2. It looks like an exn:fail exception is being caught, in the process of making the HTTP request. I'll take a look at this. I think, however, it probably is correct for feeds2gmail to catch the error and keep going. The next time feeds2gmail is run, it will try that server again. Do you disagree?
greghendershott commented 9 years ago

Oh, for 2, I think you mean you simply want it to exit with (say) 1 instead of 0, if there were any such server errors? And you would use that somehow? I could do that.

Personally, I used to run this on an hourly schedule, so I didn't really care what it returned each time, if just one RSS server had failed. But I guess you want to use it differently.

p.s. I don't actually use feeds2gmail myself anymore. I'm happy to fix a bug if I can quickly. I'd also welcome pull requests if you can fix anything.

dontdieych commented 9 years ago

It just exited rather than continuing next feed. I agree that it should be continue.