manuc352 / PDL_1920_groupe-7

0 stars 9 forks source link

Understand causes of different number of tables extracted per URL #23

Open manuc352 opened 4 years ago

manuc352 commented 4 years ago

For some URL, the number of tables is not the same. --> Understand why and resolve it

manuc352 commented 4 years ago

For example, with url https://en.wikipedia.org/wiki/Comparison_of_browser_engines_(HTML_support)#Non-standard_items , wikitext extractor doesn't extract the first and the second table. We didn't find why it is the case.

manuc352 commented 4 years ago

As you asked during the presentation, here you will find some URLs which don't product the same number of tables for the both extractors :