The mapping between paper text/citations and software projects will probably be difficult. Here are a few ways I see this going down:
Direct citation of software with a proper DOI. This our best case.
Citation/footnote of software via url. This is probably more common (depending on field), but also not hard to catch.
Citation of a paper related to a project. Some software authors prefer this method, since it maps directly onto their h-index. Downside: these look just like ordinary paper citations. We'll probably need a preliminary web crawl of projects in this category before analyzing paper text.
What else can go wrong? Have other previous studies run into and/or solved these issues already?
The mapping between paper text/citations and software projects will probably be difficult. Here are a few ways I see this going down:
What else can go wrong? Have other previous studies run into and/or solved these issues already?