sheynkman-lab / Long-Read-Proteogenomics

A workflow for enhanced protein isoform detection through integration of long-read RNA-seq and mass spectrometry-based proteomics.
MIT License
38 stars 16 forks source link

This creates the accounting for running from Zenodo dealing with the two folders mass_spec and star_genome #143

Closed adeslatt closed 3 years ago

adeslatt commented 3 years ago

PR checklist

REASON In the original amazon S3 bucket there were two folders: star_genome and mass_spec. Using Zenodo as a location instead meant Nextflow would need to understand how to deal with a tar.gz, that is a tar'd and zip'd contents of that folder similarly named. Respectively, the folders became star_genome.tar.gz and mass_spec.tar.gz.

SOLUTION

tar'd and gzipped star_genome_dir and mass_spec, as well as all the fasta files that were large and the RData file.

Updated the conf/test.config file for both the GitHub Actions that do the continuous integration and continuous testing functions, as well as just for testing to make sure the files work.

There is now an accompanying DOI repository for our use.