MrOlm / inStrain

Bioinformatics program inStrain
MIT License
134 stars 33 forks source link

genes with allele frequency differences #160

Open haihao999 opened 9 months ago

haihao999 commented 9 months ago

Hi,Matt I have a simple question, if I want to find genes with allele frequency differences, do I compare the con_freq of this locus, or the average of the con_freq of all loci of this gene? thanks!

MrOlm commented 9 months ago

Hi @haihao999 -

This is a good question, but more complex than you'd initially think. There are many, many different ways to compare allele frequency differences, and both of the method you mention are sometimes used. It really depends on the question you want to answer.

I will also mention that you might want to compare the "ref_freq" instead. "con_freq" is the frequency of the consensus base (the most common base), so the consensus base can change from sample to sample. "ref_freq" is based on the reference genome, though, so it will be reporting the frequency of the same base in each sample.

Best, Matt

haihao999 commented 9 months ago

Hi Matt Thank you very much for your tools and advice, I aim to find out the genes that are adapted to both ammonium concentrations, I will try it out and hopefully it will be useful! Best, yanpeng