jwilk-archive / ocrodjvu

OCR for DjVu
GNU General Public License v2.0
45 stars 19 forks source link

html5lib/_ihatexml.py:265: DataLossWarning: Coercing non-XML name #44

Open jwilk opened 2 years ago

jwilk commented 2 years ago

I see this when testing with warnings enabled:

$ PYTHONWARNINGS=a make test
python -c 'import nose; nose.main()' --verbose
...
tests.hocr2djvused.test.test_from_file('alice_tesseract3.02+charboxes', 1, '--html5') ... /usr/lib/python2.7/dist-packages/html5lib/_ihatexml.py:265: DataLossWarning: Coercing non-XML name
  warnings.warn("Coercing non-XML name", DataLossWarning)
ok
...
tests.hocr2djvused.test.test_from_file('alice_tesseract3.02+charboxes', 2, '--html5') ... /usr/lib/python2.7/dist-packages/html5lib/_ihatexml.py:265: DataLossWarning: Coercing non-XML name
  warnings.warn("Coercing non-XML name", DataLossWarning)
ok
...