nathandunn / apollo-performance

A set of scripts for loading Apollo databases and evaluating their perofrmance
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

prepare and load AGR examples locally #11

Closed nathandunn closed 4 years ago

nathandunn commented 4 years ago

For yeast, Ensembl works well:

unrelated, but where can I get your original FASTA and GFF3 files? Are you just grabbing most of them through here? https://uswest.ensembl.org/Saccharomyces_cerevisiae/Info/Index

from @scottcain

For yeast, fly and worm, I got the fastas from the individual MODs' download pages. For zebrafish, mouse, rat and human, I got the fastas from RefSeq (making sure to get the right assemblies!). For the GFFs, I got the URLs from https://fms.alliancegenome.org/api/datafile/by/GFF/ (the url is the "s3path" value appended to http://download.alliancegenome.org/)

For each organism I need to (a) create a correct JBrowse directory, (b) provide a valid GFF3.

nathandunn commented 4 years ago

wget fro here: https://s3.console.aws.amazon.com/s3/buckets/apollo-jbrowse-data/?region=us-east-1

nathandunn commented 4 years ago

wget ftp://ftp.flybase.net/genomes/Drosophila_melanogaster/dmel_r6.33_FB2020_02/fasta/dmel-all-chromosome-r6.33.fasta.gz