NAL-i5K / general_issues

for issues and discussions not tied to a specific repository
2 stars 0 forks source link

Generate KOBAS v3.0.3_0 Pathways for all i5k organisms by Sept 2022 #151

Closed suryasaha closed 2 years ago

suryasaha commented 4 years ago

Species to process (Species, assembly accession number, protein fasta location). If protein location fasta is 'refseq', 1. do first, 2. use NCBI datasets to get fasta file

Datasets URL: https://www.ncbi.nlm.nih.gov/datasets/docs/v1/quickstarts/command-line-tools/ Example command: ./datasets download genome accession GCF_000002195.4 --filename test.zip --exclude-gff3  --exclude-seq --exclude-genomic-cds --exclude-rna The list below as a spreadsheet: functional_annotation_list.xlsx

amcooksey commented 2 years ago

Completed Aethina tumida Amyelois transitella

functional_annotation_list.xlsx

amcooksey commented 2 years ago

Completed: Bactrocera cucurbitae Bactrocera dorsalis Bactrocera_oleae Bemisia_tabaci Bombus_impatiens Bombus_terrestris functional_annotation_list.xlsx

amcooksey commented 2 years ago

Completed: Cephus_cinctus Chelonus_insularis Contarinia_nasturtii functional_annotation_list.xlsx

amcooksey commented 2 years ago

Completed: Diachasma_alloeum Diaphorina_citri Drosophila_biarmipes Drosophila_bipectinata Drosophila_ficusphila Drosophila_rhopaloa Drosophila_takahashii Dufourea_novaeangliae functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Eufriesea_mexicana Fopius_arisanus Galleria_mellonella Habropoda_laboriosa Leptinotarsa_decemlineata functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Neodiprion_lecontei,GCF_001263575.1, refseq assembly has been "suppressed"

amcooksey commented 2 years ago

Completed: Nylanderia_fulva Nicrophorus_vespilloides Microplitis_demolitor Megachile_rotundata

functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Odontomachus_brunneus Osmia_lignaria Varroa_jacobsoni functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

This is the first batch of non-refseq I have done. I based the file and folder names on the names used in the file paths above. Completed: Tigriopus_californicus Trichogramma_pretiosum Pachypsylla_venusta Parasteatoda_tepidariorum functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Onthophagus_taurus Orussus_abietinus Melipona_quadrifasciata Medauroidea_extradentata Mayetiola_destructor Manduca_sexta functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Locusta_migratoria Lasioglossum_albipes Laodelphax_striatellus Ladona_fulva Hyalella_azteca Holacanthella_duospinosa Heliothis_virescens Halyomorpha_halys Gerris_buenoi Frankliniella_occidentalis functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Anoplophora_glabripennis Blattella_germanica Catajapyx_aquilonaris Centruroides_sculpturatus Clitarchus_hookeri Ephemera_danica Euglossa_dilemma functional_annotation_list-3.xlsx

amcooksey commented 2 years ago

Completed: Neodiprion lecontei functional_annotation_list-3.xlsx