petermr / CEVOpen

Contentmining of Open phytochemical literature for medicinal activities
26 stars 19 forks source link

size of open / closed corpus #13

Open petermr opened 4 years ago

petermr commented 4 years ago

To get some idea of size: all:

getpapers -q "essential oil" -n -a
info: Searching using eupmc API
info: Running in no-execute mode, so nothing will be downloaded
info: Found 143714 results
warn: This version of getpapers wasn't built with this version of the EuPMC api in mind
warn: getpapers EuPMCVersion: 5.3.2 vs. 6.1 reported by api

open:

getpapers -q "essential oil" -n
info: Searching using eupmc API
info: Running in no-execute mode, so nothing will be downloaded
info: Found 66694 open access results
warn: This version of getpapers wasn't built with this version of the EuPMC api in mind
warn: getpapers EuPMCVersion: 5.3.2 vs. 6.1 reported by api

so about 45% of papers are open (and presumably with XML)