libo26 / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

unresolvable external doctype causes crash #297

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Parse http://thehill.com/index.php or contents of attached html
2.
3.

What is the expected output? What do you see instead?
Expect: a feed object returned (though lacking any useful feed fields since 
there are none)
Instead: Process aborts prior to parse() return

What version of the product are you using? On what operating system?
product version: 5.0.1 release: 1.fc15 on Fedora 15 (Linux)

Please provide any additional information below.
html is, granted, not an rss or other type of feed, but feedparser gracefully 
degrades in other cases where it is asked to parse html.

Original issue reported on code.google.com by p.a.row...@gmail.com on 27 Jun 2011 at 11:57

Attachments:

GoogleCodeExporter commented 9 years ago
Fixed in r559.

This wasn't happening because the source document was HTML, but rather because 
the SAX parser was referencing an unresolvable URL in the DOCTYPE.

Original comment by kurtmckee on 15 Aug 2011 at 5:21