kermitt2 / grobid

A machine learning software for extracting information from scholarly documents
https://grobid.readthedocs.io
Apache License 2.0
3.58k stars 458 forks source link

missing references #323

Closed AlainMonteil closed 2 years ago

AlainMonteil commented 6 years ago

Good afternoon we need toextract all references from articles in Arima for make this review indexed by zentralblatt math... I try with on pdf (in copy) but the two first references are missing ! Maybe it's because they are at the end of the pages. arima_28-1-12.pdf I have used process all references and process fulltext document Thanks for you help Alain

kermitt2 commented 4 years ago

I am re-testing because there's more training data for the segmentation model...

... and here they are now both:

                 <listBibl>
                    <biblStruct xml:id="b0">
                        <analytic>
                            <title level="a" type="main">Reconstruction of the action potential of ventricular myocardial fibres</title>
                            <author>
                                <persName
                                    xmlns="http://www.tei-c.org/ns/1.0">
                                    <forename type="first">G</forename>
                                    <forename type="middle">W</forename>
                                    <surname>Beeler</surname>
                                </persName>
                            </author>
                            <author>
                                <persName
                                    xmlns="http://www.tei-c.org/ns/1.0">
                                    <forename type="first">H</forename>
                                    <surname>Reuter</surname>
                                </persName>
                            </author>
                        </analytic>
                        <monogr>
                            <title level="j">J. Physiol</title>
                            <imprint>
                                <biblScope unit="volume">268</biblScope>
                                <biblScope unit="issue">1</biblScope>
                                <biblScope unit="page" from="177" to="210" />
                                <date type="published" when="1977" />
                            </imprint>
                        </monogr>
                    </biblStruct>
                    <biblStruct xml:id="b1">
                        <monogr>
                            <title level="m" type="main">Exponential Adams Bashforth integrators for stiff ODEs, application to cardiac electrophysiology</title>
                            <author>
                                <persName
                                    xmlns="http://www.tei-c.org/ns/1.0">
                                    <forename type="first">Y</forename>
                                    <surname>Coudiére</surname>
                                </persName>
                            </author>
                            <author>
                                <persName
                                    xmlns="http://www.tei-c.org/ns/1.0">
                                    <forename type="first">C</forename>
                                    <surname>Douanla-Lontsi</surname>
                                </persName>
                            </author>
                            <author>
                                <persName
                                    xmlns="http://www.tei-c.org/ns/1.0">
                                    <forename type="first">C</forename>
                                    <surname>Pierre</surname>
                                </persName>
                            </author>
                            <idno>no. hal-01394036</idno>
                            <imprint>
                                <date type="published" when="2017" />
                            </imprint>
                        </monogr>
                        <note type="report_type">HAL Preprint</note>
                    </biblStruct>