GDKO / AvP

Automatic evaluation of HGTs
GNU General Public License v3.0
22 stars 2 forks source link

I don't want to exclude any species. #15

Closed maruiqi0710 closed 8 months ago

maruiqi0710 commented 10 months ago

My target species is Saccharomyces cerevisiae, and I want to detect any horizontal gene transfer from other species, including other species in the Saccharomyces genus. I set groups.yaml to

Ingroup:
4932: S.cerevisiae

How should I fill in the EGP field? I don't want to exclude any species.

GDKO commented 9 months ago

Hi @maruiqi0710,

If you want to detect HGTs in S.cerevisiae I would recommend

---
Ingroup:
 4930: Saccharomyces
EGP:
 4932: S.cerevisiae

This will identify HGTs that are present only in S.cerevisiae and not in other Saccharomyces species.

The problem by allowing HGTs from closely related species is that we cannot distinguish between vertical and horizontal transfers.

maruiqi0710 commented 9 months ago

I am a little confused now. Maybe my description wasn't accurate enough. My studied organism is S.cerevisiae, and I want to know HGT from other species to S.cerevisiae, including from other Saccharomyces species. If I set "Ingroup: 4930: Saccharomyces, EGP: 4932: S.cerevisiae", will I only detect HGTs from Non-Saccharomyces species to S.cerevisiae?

Thanks a lot.

GDKO commented 9 months ago

Let me explain with an example

Suppose your species of interest is S.cerevisiae with taxonomy

phylum:Ascomycota;class:Saccharomycetes;order:Saccharomycetales;family:Saccharomycetaceae;genus:Saccharomyces

It is impossible to distinguish vertical from horizontal transmission in closely related species with AvP. If you set ingroup to S. cerevisiae then apart from novel genes in this specific species everything else will be HGT according to the algorithm.

AvP is not suited for this type of analysis. If you want to find HGTs from other Saccharomyces species, I would recommend running a gene-tree vs species-tree type of analysis.