science-collective / scoping-review

A scoping review of open collaboration within scientific research
2 stars 2 forks source link

Complete functions for extracting from MedRxiv #42

Closed lwjohnst86 closed 11 months ago

lwjohnst86 commented 1 year ago

Write functions that will extract from the source and download to the repository. To finish this issue, we don't need to download and save the data from the source, we only need the code.

Code should be placed in: R/medrxiv-search.R

MarioGuCBMR commented 1 year ago

Medrxiv code is giving me headaches. I cannot replicate the searches of the website with the code. I tried running the code with just "open collaboration" as search term, but only obtained one result.

However, when I search in medrxiv I found 7000~ results: https://www.medrxiv.org/search/open%252Bcollaboration%20jcode%3Amedrxiv%20limit_from%3A2017-01-01%20limit_to%3A2023-04-21%20numresults%3A10%20sort%3Arelevance-rank%20format_result%3Astandard

The reason is that mx_search has a totally independent way of running the searches. I might need a bit of help with this!

MarioGuCBMR commented 1 year ago

Update: last week they added this get started page that might be useful: https://docs.ropensci.org/medrxivr/articles/medrxivr.html

MarioGuCBMR commented 1 year ago

The issue @danielibsen found with mx_snapshot is still there, but using the parent function locally (mx_api_content), seems to solve this. It takes a while though

MarioGuCBMR commented 1 year ago

Additionally, I have finally understood how the terms work here. You need to generate a list of vectors in which: search = list(c("open"), #each element of the list is equal to AND c("research", "science")) #each element of vector is equal to OR.

Thus the search the list above is equal to this search: open AND (research or science).

The next thing to solve is how to figure having open in title and the rest of the terms in all. So far I think you cannot do this with the package, but can be easily done with an in-house function.

lwjohnst86 commented 11 months ago

This was finished a while ago.