trainrun / Euplotes_script

1 stars 0 forks source link

Genome decontamination #1

Open stephen-14 opened 11 months ago

stephen-14 commented 11 months ago

Hello everyone, I'm trying to follow the pipeline to eliminate the foreign genomes in the de novo assembled genome. I have a questions regarded to search against the database by DIAMOND. I got the result like a screenshot. What is the strategy to know and select the exogenous genomes? we based on the identify similarity or we will eliminate all the non-ciliated genomes? or another parameters? Thank you! Screenshot from 2023-12-22 15-49-29

trainrun commented 11 months ago

The software Taxonkit can change accession number (the second column) into Gi number and judge whether the number is under ciliates taxon. But you may need to write a script to do that. Good luck!