HaveF / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

handle deflate'd content that has no header nor checksum #344

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Parse this fee: d=feedparser.parse('http://www.theverge.com/rss/index.xml')

What is the expected output? What do you see instead?

feedparser should parse the entries of this valide Atom feed

What version of the product are you using? On what operating system?
5.1.1

Please provide any additional information below.

Original issue reported on code.google.com by Shawn.Ab...@gmail.com on 14 Apr 2012 at 7:40

GoogleCodeExporter commented 9 years ago
This is happening because the HTTP headers claim that the feed is compressed 
using deflate, but is not (or the Python standard library is automatically 
decompressing the data). feedparser simply needs to ignore the error and 
continue parsing as if it successfully decompressed the data, so I'll change 
the code so that it can attempt to continue parsing.

Original comment by kurtmckee on 17 Apr 2012 at 4:17

GoogleCodeExporter commented 9 years ago
This issue was closed by revision r698.

Original comment by kurtmckee on 19 Apr 2012 at 3:53

GoogleCodeExporter commented 9 years ago
My initial inspection must have been a fluke because the server actually was 
returning deflate'd content. However, I discovered that it was raw compressed 
data: no header, no checksum.

http://www.zlib.net/zlib_faq.html#faq39

Thanks for reporting this issue!

Original comment by kurtmckee on 19 Apr 2012 at 3:56

GoogleCodeExporter commented 9 years ago
Issue 348 has been merged into this issue.

Original comment by kurtmckee on 24 Apr 2012 at 4:29

GoogleCodeExporter commented 9 years ago
Issue 348 has been merged into this issue.

Original comment by kurtmckee on 1 Dec 2012 at 3:05