Open StevenCannon-USDA opened 7 months ago
While the files are in preparation, I have put them here:
/usr/local/www/data/annex/
@adf-ncgr - I'll add to the Glycine pan-genes next.
@StevenCannon-USDA I see the annotations in the annex but not the genomes; if you can point me to those as well when convenient I can run the BUSCO against them. But I'll start on the annotations.
@adf-ncgr Oops. The genomes are now in the expected place.
@StevenCannon-USDA the BUSCO, gfa and gff updates (adding AHRD) are all now in the annex locations; let me know when you've moved them into datastore proper and updated the datastore-metadata repo and then we can commence some of the downstream tasks.
Main steps for adding new genome and annotation collections
Genus/species/collection names:
What are the collection types and names? Example:
Glycine/max/annotations/Lee.gnm3.ann1.ZYY3
Glycine/max/genomes/Lee.gnm3.VG1C
Glycine/max/genomes/Wm82.gnm5.NRKG
Glycine/max/annotations/Wm82.gnm5.ann1.J7HW
[x] Add collection(s) to the Data Store
[x] Validate the README(s)
[ ] Update about_this_collection.yml
[x] Calculate AHRD functional annotations
[x] Calculate gene family assignments (.gfa)
[ ] Add to pan-gene set
[ ] Load relevant mine
[x] Add BLAST targets
[x] Incorporate into GCV
[ ] Update the jekyll collections listing
[ ] Update browser configs
[x] run BUSCO