softwaresaved / habeas-corpus

A corpus of research software used in COVID-19 research.
MIT License
5 stars 4 forks source link

Create a process for how we QA the CORD19_software_mentions data #13

Open npch opened 3 years ago

npch commented 3 years ago

E.g.

  1. Search for software title in the original CORD19_software_mentions.csv
  2. Get links for papers that mention it
  3. Open full text and search for the software
  4. Quality Code it: e.g.
    • software with repo
    • software but no repo
    • typo but is software
    • part of a larger software package
    • not software
    • typo but not software