To evaluate the effectiveness of scythe, let's develop a list of known citation and references that scythe should be able to find. For each of these, let's ensure that:
[x] the scythe function discovers the citation/reference in its full text search
[x] the citation/reference is properly extracted
[ ] the citation/reference is properly registered with the dataset DOI
[ ] the citation/reference shows up in EventData
[ ] the citation/reference shows up in the DataONE metrics service
[ ] the citation/reference shows up on the dataset landing page
I started a test file for these, but here are two examples that should be found from PLOS, and that currently do not show up on our dataset landing pages:
Beard et al. 2019. Migratory goose arrival time plays a larger role in influencing forage quality than advancing springs in an Arctic coastal wetland. https://doi.org/10.1371/journal.pone.0213037 (cites Arctic Data Center: https://doi.org/10.18739/A22274 in both the Data Availability statement and Acknowledgements (but not the reference list)
I'm moving this to milestone 1.0, I'd like to get 0.9.0 out soon, with the focus on finding citations and the work to register citations in the DataONE metrics service will come later
To evaluate the effectiveness of scythe, let's develop a list of known citation and references that scythe should be able to find. For each of these, let's ensure that:
I started a test file for these, but here are two examples that should be found from PLOS, and that currently do not show up on our dataset landing pages:
Add other examples to the test file.