ababaian / serratus

Ultra-deep search for novel viruses
http://serratus.io
GNU General Public License v3.0
254 stars 33 forks source link

all new nido/cov assemblies #247

Open rchikhi opened 3 years ago

rchikhi commented 3 years ago

List of the 990 accessions where there's possibly a new CoV/nido RdRp according to sra_species_table.tsv: s3://serratus-rayan/pro_new_cov_nido-assembly/all_new_cov_nido.sra

900 could be assembled, here are the:

An immediate take-away is that most of the checkv_filtered assemblies are empty. So I recommend not using them but instead take gene_clusters.fasta or to be even more conservative, the whole scaffolds.fasta.

rchikhi commented 3 years ago

all contigs having a motifator hit: s3://serratus-rayan/pro_new_cov_nido-assembly/all_new_cov_nido.scaffolds_motifator.whole_contigs_hits.fasta

rchikhi commented 3 years ago

hmmsearch results versus Pfam-A:

s3://serratus-rayan/pro_new_cov_nido-assembly/all_new_cov_nido.scaffolds_motifator.whole_contigs_hits.fasta.transeq.faa.* (ran with hmmsearch -A [.sto] --tblout [.tbl] --domtblout [.domtbl] -o [.hmmsearch_stdout] Pfam-A.hmm [contigs.fa])