kermitt2 / grobid

A machine learning software for extracting information from scholarly documents
https://grobid.readthedocs.io
Apache License 2.0
3.59k stars 459 forks source link

Alternative articles flavors #1202

Open lfoppiano opened 1 day ago

lfoppiano commented 1 day ago

This PR implements two alternatives segmentation flavors:

The article's body is then composed by two paragraphs:

The PR #1151 was tested in this PR.

lfoppiano commented 1 day ago

@kermitt2 would be OK to merge this into #1151 so that we have a single PR?

coveralls commented 23 hours ago

Coverage Status

coverage: 40.572% (-0.2%) from 40.739% when pulling 2365fac8c0ec40875b2b2d9e4db779313e55131d on feature/segmentation-light into 01745c06b1cb836bdc560a08427d9530effb4148 on flavor.