trainrun / Euplotes_script

0 stars 0 forks source link

Genome decontamination #1

Open stephen-14 opened 6 months ago

stephen-14 commented 6 months ago

Hello everyone, I'm trying to follow the pipeline to eliminate the foreign genomes in the de novo assembled genome. I have a questions regarded to search against the database by DIAMOND. I got the result like a screenshot. What is the strategy to know and select the exogenous genomes? we based on the identify similarity or we will eliminate all the non-ciliated genomes? or another parameters? Thank you! Screenshot from 2023-12-22 15-49-29

trainrun commented 6 months ago

The software Taxonkit can change accession number (the second column) into Gi number and judge whether the number is under ciliates taxon. But you may need to write a script to do that. Good luck!