Closed suryasaha closed 2 years ago
Completed Aethina tumida Amyelois transitella
Completed: Bactrocera cucurbitae Bactrocera dorsalis Bactrocera_oleae Bemisia_tabaci Bombus_impatiens Bombus_terrestris functional_annotation_list.xlsx
Completed: Cephus_cinctus Chelonus_insularis Contarinia_nasturtii functional_annotation_list.xlsx
Completed: Diachasma_alloeum Diaphorina_citri Drosophila_biarmipes Drosophila_bipectinata Drosophila_ficusphila Drosophila_rhopaloa Drosophila_takahashii Dufourea_novaeangliae functional_annotation_list-3.xlsx
Completed: Eufriesea_mexicana Fopius_arisanus Galleria_mellonella Habropoda_laboriosa Leptinotarsa_decemlineata functional_annotation_list-3.xlsx
Neodiprion_lecontei,GCF_001263575.1, refseq assembly has been "suppressed"
Completed: Nylanderia_fulva Nicrophorus_vespilloides Microplitis_demolitor Megachile_rotundata
Completed: Odontomachus_brunneus Osmia_lignaria Varroa_jacobsoni functional_annotation_list-3.xlsx
This is the first batch of non-refseq I have done. I based the file and folder names on the names used in the file paths above. Completed: Tigriopus_californicus Trichogramma_pretiosum Pachypsylla_venusta Parasteatoda_tepidariorum functional_annotation_list-3.xlsx
Completed: Onthophagus_taurus Orussus_abietinus Melipona_quadrifasciata Medauroidea_extradentata Mayetiola_destructor Manduca_sexta functional_annotation_list-3.xlsx
Completed: Locusta_migratoria Lasioglossum_albipes Laodelphax_striatellus Ladona_fulva Hyalella_azteca Holacanthella_duospinosa Heliothis_virescens Halyomorpha_halys Gerris_buenoi Frankliniella_occidentalis functional_annotation_list-3.xlsx
Completed: Anoplophora_glabripennis Blattella_germanica Catajapyx_aquilonaris Centruroides_sculpturatus Clitarchus_hookeri Ephemera_danica Euglossa_dilemma functional_annotation_list-3.xlsx
Completed: Neodiprion lecontei functional_annotation_list-3.xlsx
Species to process (Species, assembly accession number, protein fasta location). If protein location fasta is 'refseq', 1. do first, 2. use NCBI datasets to get fasta file
Datasets URL: https://www.ncbi.nlm.nih.gov/datasets/docs/v1/quickstarts/command-line-tools/ Example command:
./datasets download genome accession GCF_000002195.4 --filename test.zip --exclude-gff3 --exclude-seq --exclude-genomic-cds --exclude-rna
The list below as a spreadsheet: functional_annotation_list.xlsx