AnushaPB / ucb_data_storage

Unofficial overview of data storage options provided by UC Berkeley
https://anushapb.github.io/ucb_data_storage/
0 stars 0 forks source link

UC Berkeley Data Storage

This unofficial guide is intended to provide a brief overview of data storage options provided by UC Berkeley. The information included in this guide is up to date as of 6/1/2024, however storage options are subject to change. Please consult with the Berkeley Research Data Management Program for the most up to date information and support.


Table of Contents


Box

*SPAs are accounts with CalNet IDs that can be shared by multiple users. They allow for persistence in data access as they are not tied to individuals.


Google Drive

MyDrive (bdrive)

Shared Drives


Wasabi

This is a relatively inexpensive (e.g., cheaper than google) and easy to use option for data storage. It is user friendly cloud storage and offers a simple GUI option to access and drop files from wasabi onto your local drive.


Cloud Options

  1. Amazon Web Services (AWS)
  2. Google Cloud Platform (GCP)
  3. Microsoft Azure

Active Archive Object Storage (AAOS)


Savio Condo Storage


Enterprise Storage

*This storage option is probably too expensive for most individuals and labs and is used more for business processes on campus, but it is included here since it is a Berkeley storage offering


CalShare

Intended for departmental collaboration, departmental intranet websites, and low-volume P4 data storage (not for individual use).


Additional notes

Privacy and security: most of the Berkeley data storage options have a Data Security Level of P3. For P4 data storage options check out CalShare, Secure Research Data and Compute, and Active Archive Object Storage (AAOS).

Note about non-berkeley affiliated storage options: there are other options for storage (such as Dropbox, OneDrive, and iCloud) that UC Berkeley does not have an existing contract with. Using these services comes with greater risks because UC Berkeley contracts have more security and liability measures in place to protect you and your data. Here is basic information about pricing for some common non-affiliated services:

Unofficial Advice: I created this guide to serve as a resource for UC Berkeley graduate students, post-docs, staff, and PIs who are responsible for generating and maintaining research data. For those looking for the short answer to “what do I do with my data?”, based on the resources described here I think the cheapest and easiest option for labs or research groups is to pay for storage via Wasabi or Google Shared Drives. For individuals who need to store their data independently, Box/Google Drive work if you need less than 2 TB and Wasabi seems to be the easiest and cheapest option if you have more than that. There are definitely more cost effective options described above that may take a bit more work to initially set-up. - Anusha

Sources

Berkeley Research Data and Management: Data Storage & Backup

Berkeley bConnected: Alternative Storage Options

Contributors

Anusha Bishop (Creator, maintainer)

Rick Jaffe