Closed Mmdixon closed 2 years ago
Thank you for the bug report! It was something very specific for extra zeroes at the start of an HTML entity name. Now fixed in f963df7f824c2dbcc6bc159e1dba57b082b062fe.
@earwig is the fix available on the pip version too? I also recently faced this issue. Thanks to @Mmdixon for the timely report!!
Thank you for the bug report! It was something very specific for extra zeroes at the start of an HTML entity name. Now fixed in f963df7.
Hello, in pip version mwparserfromhell==0.6.4 problem still exists, have faced it just now.
Thanks so much for your patience. 0.6.5 is now released on PyPI with this fix.
The code:
The problematic text I ran into was on this page: https://es.wikipedia.org/?curid=5006152. Note the characters
000nbsp
that came from the sentence:The error:
The expectation was for no exception to be thrown and if the text is problematic to be ignored or parsed differently.