kermitt2 / biblio-glutton

A high performance bibliographic information service: https://biblio-glutton.readthedocs.io
124 stars 16 forks source link

When using biblio-gluton with grobid it looks only titles level `a` are consolidated #107

Open frankrod opened 1 week ago

frankrod commented 1 week ago

Hi,

Currently I'm using biblio-glutton with Grobid and using the consolidation of references. However I've noticed that when references are consolidated, references with title level m are not consolidated. Is that intended and if so I'd like to know why? Thanks

kermitt2 commented 1 week ago

Hi @frankrod

It's an interesting observation. I think you're correct, currently only the title level a and title level j (journal title) are used for consolidation. When a reference with only title level m is inputted, it will not be consolidated. In general, for the moment, the field title level m is not used for consolidation.

I think the reason is that we focused on consolidation against CrossRef where we have mainly articles (journal articles) and chapters (chapters from serials), and we don't have a lot of DOI at the level of the monograph in CrossRef.

The current plans for the future involves using a Machine Learning based pairwise comparison which would use all available fields. Another objective would be to cover more bibliographical databases without DOI, and for example consolidating against ISBN would be a way to match references to monographs/books.