Bioconductor / ExperimentHub

Client to access ExperimentHub resources
https://bioconductor.org/packages/ExperimentHub
9 stars 13 forks source link

helper function to prioritize link over EH resource location #25

Closed LiNk-NY closed 2 years ago

LiNk-NY commented 2 years ago

Similar to Levi's suggestion, perhaps we can provide some mechanism to download from alternative remote resource locations. In a scenario where: Link1 = cloud-based / institution URL location && Link2 = EH location

We'd want to download from link1 because the submitter is already hosting the resource.

Perhaps we could benefit by providing functionality to try to download from Link1 first and if it fails use Link2.

Do we have MD5 sums for EH resources? This would ensure that we are getting the same file from all locations. Providers should also have MD5 sums posted somewhere as well.

lshep commented 2 years ago

No one has to use Bioconductor resources. You can already host on alternative servers and still be visible in the hubs. I've said this multiple times now.
If someone is already hosting then we don't want to have to host the data again for them and have that cost. We prefer to use theirs. The only restriction is it has to be public and stable (institution server, aws bucket, zenodo) and not GitHub or personal server

lshep commented 2 years ago

We have a spot in the database for mdsum.... I'll see if it's actually used it not. If not we should.