jacob1044 / pubsubhubbub

Automatically exported from code.google.com/p/pubsubhubbub
Other
1 stars 0 forks source link

Ensure hub code properly preserves CDATA XML sections #73

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Not sure if it works properly right now

Original issue reported on code.google.com by bslatkin on 22 Sep 2009 at 4:36

GoogleCodeExporter commented 9 years ago
Definitely looking broken:

Could not get entries for content of 82850 bytes in format "atom" for topic 
u'http://feeds.feedburner.com/HomeEnergyAlternativesGeothermalEnergy':
Traceback (most recent call last):
  File "/base/data/home/apps/pubsubhubbub/memory-queue.342978944075695396/main.py", line 2469, in parse_feed
    feed_record.topic, format, content)
  File "/base/data/home/apps/pubsubhubbub/memory-queue.342978944075695396/main.py", line 2317, in find_feed_updates
    header_footer, entries_map = filter_feed(feed_content, format)
  File "/base/data/home/apps/pubsubhubbub/memory-queue.342978944075695396/feed_diff.py", line 245, in filter
    parser.parse(data_stream)
  File "/base/python_runtime/python_dist/lib/python2.5/xml/sax/expatreader.py", line 107, in parse
    xmlreader.IncrementalParser.parse(self, source)
  File "/base/python_runtime/python_dist/lib/python2.5/xml/sax/xmlreader.py", line 125, in parse
    self.close()
  File "/base/python_runtime/python_dist/lib/python2.5/xml/sax/expatreader.py", line 217, in close
    self.feed("", isFinal = 1)
  File "/base/python_runtime/python_dist/lib/python2.5/xml/sax/expatreader.py", line 211, in feed
    self._err_handler.fatalError(exc)
  File "/base/python_runtime/python_dist/lib/python2.5/xml/sax/handler.py", line 38, in fatalError
    raise exception
SAXParseException: <unknown>:916:0: unclosed CDATA section

Original comment by bslatkin on 28 Jun 2010 at 6:04

GoogleCodeExporter commented 9 years ago
This is also addressed by r392, which will pass these poorly formatted feeds 
straight through without a parsing step.

Original comment by bslatkin on 6 Nov 2010 at 1:19