WormBase / caltech-curation-services

Modular curation services for Caltech WormBase data
1 stars 0 forks source link

Backup /usr/caltech_curation_files #39

Closed valearna closed 9 months ago

valearna commented 1 year ago

Set up duplicity with Raymond to back up curator files and scripts (keep symlinks)

Also need a place for files that shouldn't get backed up.

We need to talk to Wen to figure out what to back up from her files

valearna commented 1 year ago

From 05/08 meeting with Raymond:

Restic seems a great option and it has plenty of config parameters. We can use it to backup data to s3

Backup policy:

Avoid encryption if we can - so that we can access the files even if we lose the keys

Examples:

restic -r s3:s3.amazonaws.com/daily_restic_backup init restic -r s3:s3.amazonaws.com/monthly_restic_backup init

restic -r s3:s3.amazonaws.com/daily_restic_backup /usr/caltech_curation_files --exclude-file=excludes.txt restic -r s3:s3.amazonaws.com/monthly_restic_backup /usr/caltech_curation_files --exclude-file=excludes.txt

restic forget -r s3:s3.amazonaws.com/daily_restic_backup --keep-last 30 --prune restic forget -r s3:s3.amazonaws.com/monthly_restic_backup --keep-last 24 --prune

Exclusion list:

/usr/caltech_curation_files/kimberly/uniprot/uniprot_paper_data/files/

Check with Wen about simplemine files

Restic has options to migrate from one repo to another (for example in case we need to move from s3 to a physical server in the future)

TODO:

valearna commented 1 year ago

Add documentation in form of a README file on github