ourresearch / citeas-api

Get the scholarly citation for any research product: software, preprint, paper, or dataset
https://citeas.org
MIT License
67 stars 4 forks source link

Identify DOI near request text for https://protege.stanford.edu/about.php #24

Open jameshowison opened 6 years ago

jameshowison commented 6 years ago

https://protege.stanford.edu/about.php has this text:

A WORD ABOUT CITING We would be grateful if scientific publications resulting from projects that make use of Protégé would include the following sentence in the acknowledgments section:

“This work was conducted using the Protégé resource, which is supported by grant GM10331601 from the National Institute of General Medical Sciences of the United States National Institutes of Health.”

Please use the following reference citation for Protégé:

Musen, M.A. The Protégé project: A look back and a look forward. AI Matters. Association of Computing Machinery Specific Interest Group in Artificial Intelligence, 1(4), June 2015. DOI: 10.1145/2557001.25757003.

There is also a link to PMC in the title of that. I'm not sure if it's reasonable to try to infer that this DOI is the requested citation.

hpiwowar commented 6 years ago

Getting the "include the following sentence" is going to be really hard. I think that is out of scope for us for now.

We are actually getting the DOI from the page, but alas they have a typo in their DOI. It should be 10.1145/2757001.2757003 not 10.1145/2557001.25757003 ... the DOI they have doesn't resolve, so although we find it we don't report it because we don't get any metadata back from it and we therefore assume it isn't actually a DOI. Maybe we should report the error differently?

I did notice we display the unicode badly in the title right now for http://citeas.org/cite/https://protege.stanford.edu/about.php . Created another issue https://github.com/Impactstory/citeas-webapp/issues/8 to address that.