tatuylonen / wiktextract

Wiktionary dump file parser and multilingual data extractor
Other
799 stars 82 forks source link

[en] don't extract "span" tag in example source "dd" tags #822

Closed xxyzz closed 2 weeks ago

xxyzz commented 2 weeks ago

Similar to zh commit 4468de7, there is a small difference in expanded HTML nodes: "trad." and "simp." links after example text are inside an <i> tag, zh edition doesn't have it.

Test was added in the zh edition commit, I also tested some pages locally.