yuanxu-li / html-table-extractor

extract data from html table
MIT License
84 stars 23 forks source link

table from Wikipedia that doesn't parse #23

Closed jowens closed 4 months ago

jowens commented 4 months ago

While investigating pandas's HTML table ingest, we wanted to compare vs. html-table-extractor, and the table in question didn't parse in html-table-extractor, please see: https://github.com/pandas-dev/pandas/issues/58461#issuecomment-2081527264

Macports py312-beautifulsoup4 @4.12.2_0+html5lib+lxml

$ pip list | grep html-table-extractor
html-table-extractor          1.4.1
jowens commented 4 months ago

This was totally my mistake, I was missing a <. Doh!