metafacture / metafacture-core

Core package of the Metafacture tool suite for metadata processing.
https://metafacture.org
Apache License 2.0
71 stars 34 forks source link

Emit combined text for mixed content in HtmlDecoder #477

Closed fsteeg closed 1 year ago

fsteeg commented 1 year ago

(Came up processing an HTML source in OERSI.)

E.g. for <p>This is the <strong>full</strong> text</p>, emit This is the full text.

Was only emitting the children's text (full) before (emitting text only if children were empty).