alan-turing-institute / ReadabiliPy

A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.
MIT License
219 stars 36 forks source link

Fix breitbart issue #78

Closed jemrobinson closed 5 years ago

jemrobinson commented 5 years ago

Identified bug from unclosed tags (malformed HTML).

Closes #77

coveralls commented 5 years ago

Coverage Status

Coverage remained the same at 100.0% when pulling bf75278a9865479df7f680bf114be07f60c3acd0 on 77-breitbart-crash into 736470d4dabcc3cc5e4064526a13db00edf0663c on master.

coveralls commented 5 years ago

Coverage Status

Coverage remained the same at 100.0% when pulling bf75278a9865479df7f680bf114be07f60c3acd0 on 77-breitbart-crash into 736470d4dabcc3cc5e4064526a13db00edf0663c on master.