neuroquery / pubget

Collecting papers from PubMed Central and extracting text, metadata and stereotactic coordinates.
https://neuroquery.github.io/pubget/
MIT License
20 stars 12 forks source link

Add NeuroVault extraction #25

Closed adelavega closed 1 year ago

adelavega commented 1 year ago

Might be helpful to automate the NV link extraction based on regex pattern matching on links

jeromedockes commented 1 year ago

why not -- do you mean like having separate PMCID x neurovault collection ID and PMCID x neurovault image ID files besides the links.csv?

adelavega commented 1 year ago

indeed

jeromedockes commented 1 year ago

@adelavega can you confirm that all image links will look like .*neurovault.org/images/(\d+).* and all collections like .*neurovault.org/collections/(\d+).*, where the captured group is the ID?

adelavega commented 1 year ago

Two changes to that:

jeromedockes commented 1 year ago

thanks!