PominovaMS / denovo_benchmarks

4 stars 9 forks source link

Add reference proteomes and script for programmatic download #28

Open BioGeek opened 2 weeks ago

BioGeek commented 2 weeks ago

To enable local reproduction of the benchmark, access to the exact reference proteomes listed in dataset_tags.tsv is necessary.

Please consider the following options to provide these files:

  1. Git Large File Storage (Git LFS): Use Git LFS to upload the reference proteomes to the repository.
  2. External Download Option: Alternatively, provide an external download link for these proteomes, along with instructions to place them in PROTEOMES_DIR.

Additionally, please provide a script that automates the retrieval of reference proteomes. The script should:

This would allow users to: