DataONEorg / rdataone

R package for reading and writing data at DataONE data repositories
http://doi.org/10.5063/F1M61H5X
36 stars 19 forks source link

Workflow improvements #287

Open gothub opened 2 years ago

gothub commented 2 years ago

Update the rdataone and dependent packages to enhance ease-of-use and to reduce the amount of user effort and to reduce the amount of effort required to achieve a desired outcome.

amoeba commented 2 years ago

Getting the data for a given Solr result is a great example of a workflow we might target with this work. See this Discussion, specifically the answer, how much explanation it takes, and how the answer is still incomplete despite its length.

mbjones commented 2 years ago

Agreed. That said, in the case of the "download all objects for a package" case, that is what getPackage was designed for, which should significantly shorten the steps needed (e.g., add the resourceMap field to the SOLR query, and then use it to call /packages/bagit-1.0.0/{resource_map_pid} on the MN. Two majro barriers with this: 1) can't be called on the CN (I think), and 2) as an optional API, only a few repositories implement this API (namely Metacat and GMN nodes).