OmnesRes / prepub

Production code for PrePubMed
http://www.prepubmed.org/
MIT License
17 stars 6 forks source link

Nature Precedings #10

Open OmnesRes opened 7 years ago

OmnesRes commented 7 years ago

It has come to my attention that the method that I used to scrape Nature Precedings doesn't appear to have obtained every document labeled as "manuscript".

When you go to advanced search and search for manuscript, http://precedings.nature.com/search/advanced?abstract=&author=&document_type=Manuscript, there are 2282 preprints.

But if you click the link called "Manuscripts", http://precedings.nature.com/documents/type/manuscript/revisions, there are only 2021 preprints.

And if you browse by subject, such as http://precedings.nature.com/subjects/bioinformatics, there are only around 1900 preprints.

Unfortunately I browsed by subject, so I seem to be missing some documents.

I don't understand the nature of this clusterfuck, but people don't really care about preprints posted 10 years ago so I probably won't take the time to add them to the production server.

However, when I make the graph for September preprints I will update the Nature Precedings statistics.