IQSS / dataverse-pm

Project management issue tracker for the Dataverse Project. Note: Related links and documents may not be public.
https://dataverse.org
0 stars 0 forks source link

GREI 5: HDV Task - Large Data Support #176

Open cmbz opened 5 months ago

cmbz commented 5 months ago

Overview

Support the sharing of very large datasets (>TBs) by integrating the metadata in the repository with the data in the research computing storage" (Source: NIH OTA)

Tasks

Issues

Pending

Resources

cmbz commented 5 months ago

Status: January 2024

Several Globus improvements and bug fixes were made to support large data deposits and integration with research computing services such as the Northeast Storage Exchange (NESE).

Completed

cmbz commented 4 months ago

Status: February 2024

A containerized Dataverse was deployed on Mass Open Cloud (MOC), using Northeast Storage Exchange (NESE) compute and resources to demonstrate how Dataverse can support computing on large Dataverse datasets stored on NESE tape resources. A demo of the proof-of-concept was presented at the Mass Open Cloud Alliance Conference on 2024/02/28. Work has begun to fully operationalize the strategy in Epic: Operationalize Large Data and Compute Infrastructure.

Completed

cmbz commented 4 months ago

Status: April 2024

Large Data Support Working Group

Large Data Support Pilot

cmbz commented 2 months ago

Status: May 2024

cmbz commented 2 months ago

Status: June 2024