mtmse / pipeline

Super-project that aggregates all Pipeline related code, provides a common tracker for Pipeline related issues and holds the Pipeline website
http://daisy.github.io/pipeline
0 stars 2 forks source link

Sentence detection marks up `<sub>` elements as sentences #6

Closed martinpub closed 3 years ago

martinpub commented 3 years ago

Sample output:

<p><span id="st11-23">Varning för pandemi (H</span><sub><span id="st11-24">1</span></sub><span id="st11-25">N</span><sub><span id="st11-26">1</span></sub><span id="st11-27">,"svininfluensan").</span></p>

Desired result:

<p><span id="st11-23">Varning för pandemi (H<sub>1</sub>N<sub>1</sub>,"svininfluensan").</span></p>

Comment from @bertfrees:

Good catch. I'll fix it (in modules/nlp/html-break-detection/src/main/resources/xml/library.xpl).

bertfrees commented 3 years ago

This is fixed (not deployed yet).

martinpub commented 3 years ago

Yay! Looking forward to testing it. Thanks!

martinpub commented 3 years ago

Verified fixed, thanks @bertfrees!