distribits / distribits-2024-hackathon

1 stars 1 forks source link

Onedata as a special remote for git-annex / DataLad? #6

Open lopiola opened 8 months ago

lopiola commented 8 months ago

I imagine a nice synergy between git-annex / DataLad and Onedata serving as a special remote (third-party data infrastructure) for them. I'm thinking especially about scenarios of collaborative data sharing across organizations. Imagine a couple of federated organizations with their heterogeneous storage systems, hosting datasets for research purposes. Collaborating users may use DataLad in this setup, but every user has to configure access to all different storage systems holding the data (picture 1). Onedata can virtualize access to all the sites in a single namespace and serve as a special remote, this way everyone would configure just one set of credentials and be able to share the data freely across the federation (picture 2).

no-onedata


yes-onedata

I'll be happy to hear your thoughts and feedback.