lfoppiano / grobid-superconductors

Grobid module for superconductor material and properties extraction
Apache License 2.0
18 stars 2 forks source link

Output sentence offsets #57

Open lfoppiano opened 1 year ago

lfoppiano commented 1 year ago

When we process paragraphs and the output are sentence-based, we need the sentence offsets to be included in the output.

With the sentence offsets we can easily re-match the spans offsets back to the original paragraphs.

E.g. Spaces between sentences are usually lost in the sentence split.