hubgit / archiveteam-reader-warc-extract

Extract feeds from Archive Team Google Reader WARC files
http://www.macropus.org/ia-feed/
3 stars 0 forks source link

Deployed version errors on JSON request #1

Open gwern opened 9 years ago

gwern commented 9 years ago

I checked out my blog's archived RSS feeds, and they are listed: http://www.macropus.org/ia-feed/?prefix=http%3A%2F%2Fgwern.net

Actually trying to get any seems to yield errors:

http://www.macropus.org/ia-feed/?feed=http%3A%2F%2Fgwern.net%2Fatom.xml%3Fc%3DCKWHwbOKoLMC%26r%3Dn%26n%3D1000%26hl%3Den%26likes%3Dtrue%26comments%3Dtrue%26client%3DArchiveTeam

Warning: gzdecode(): data error in /home/alfeaton/macropus.org/ia-feed/handler.php on line 37

Warning: Cannot modify header information - headers already sent by (output started at /home/alfeaton/macropus.org/ia-feed/handler.php:37) in /home/alfeaton/macropus.org/ia-feed/handler.php on line 44
ivan commented 9 years ago

:+1: would be great if this worked because this is currently the only way to dig through the Google Reader archive :-)

hubgit commented 8 years ago

Unfortunately the archive.org CDX server no longer returns the filename in the search results, so I can't currently see a way to get a URL to download the file.

https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server#access-control