Closed hassanakbar4 closed 3 years ago
@{"email"=>"julian.reschke@gmx.de", "name"=>nil, "username"=>nil} commented
I agree that a tool would be nice to have, but I disagree that it should be part of xml2rfc.
@{"email"=>"tony@att.com", "name"=>nil, "username"=>nil} commented
I think a better method is to create a bibxml tool to generate the references.
Doing a dig on http://dx.doi.org/${doi} with the header "Accept: application/citeproc+json" appears to generate a nicely formatted JSON reference object.
According to Carsten, the data may not be as regular as we want, but it's a good start.
My thoughts are to have http://xml2rfc.ietf.org/public/rfc/bibxml-doi/reference-DOI.${doi}.xml do the dig and conversion (keeping a local cache so multiple requests don't overwhelm dx.doi.org).
@{"email"=>"johnl@iecc.com", "name"=>nil, "username"=>nil} commented
It's not totally trivial, because different registration agencies support different formats, but it shouldn't be hard to get a 95% result. It is my impression that most of the publishers we're likely to reference such as ACM, IEEE, and the Elsevier journals, all use Crossref.
@{"email"=>"brian.e.carpenter@gmail.com", "name"=>nil, "username"=>nil} commented
it shouldn't be hard to get a 95% result Which isn't good enough for automation. Since a human check will always be needed, this definitely needs to be separated from xml2rfc itself. Linking it to bibxml seems by far the best approach, given the likely users are mainly in the IRTF space.
@{"email"=>"johnl@iecc.com", "name"=>nil, "username"=>nil} commented
it shouldn't be hard to get a 95% result Which isn't good enough for automation.
I don't see why not. If xml2rfc can't resolve the DOI reference to a usable chunk of bibxml, it fails like any other broken reference. It seems unlikely that a reference that works once would stop working, so I don't see how this would be a problem in practice.
@{"email"=>"brian.e.carpenter@gmail.com", "name"=>nil, "username"=>nil} commented
If xml2rfc can't resolve the DOI reference to a usable chunk of bibxml, it fails like any other broken reference. Sure. But my concern is if it resolves to a syntactically correct but actually wrong DOI reference.
@{"email"=>"tony@att.com", "name"=>nil, "username"=>nil} commented
I really doubt that 1 in 20 of the DOI citations returned by a lookup on dx.doi.org or api.crossref.org are returning information for the wrong document. I'd be surprised to find any incorrect documents returned from the tools.
However, I wouldn't be surprised to find information missing from some of the entries. And I /think/ that's the 95% that John was referring to. John, would you care to elaborate more on your 95% estimate?
@{"email"=>"johnl@iecc.com", "name"=>nil, "username"=>nil} commented
The 95% is for DOIs issued by Crossref or by other DOI agencies who provide bibliographic info in the same formats that Crossref does so the same code can parse it. Since publishers upload the DOI and the bibliographic info at the same time, it's hard to think of a plausible way that they would be wrong very often. The other 5%, if that much, would be where the info is there but the publisher doesn't return it in a form we can parse, and in that case the code knows that it's failed. The page at crosscite lists a bunch of formats supported by Crossref, DataCite (used mostly for research datasets), and mEDRA (used by some European publishers), and most of those formats are supported by all three. I expect it'll be closer to 99% than 95%.
@{"email"=>"johnl@iecc.com", "name"=>nil, "username"=>nil} changed _comment0 which not transferred by tractive
component_Version 2 cli
resolution_overtaken by events
type_enhancement
| by lars@netapp.comBecause RFCs now have DOI numbers, the reference sections in RFCs are now supposed to include DOIs for all references for which they have been assigned (e.g., citations of academic papers).
This currently places quite an extra load on the RFC Editor, which has to manually track those DOIs down and insert them into the XML source. This is esp. time consuming for IRTF documents, which tend to cite many academic papers.
The good news is that the metadata for publications with DOIs is readily available in a number of formats; see http://crosscite.org/cn/
For example:
So it is hopefully relatively easy to extend xml2rfc to transform that XML into something it can use to automatically populate citation entries with DOIs.
Issue migrated from trac:326 at 2021-10-20 18:23:55 +0500