VerisimilitudeX / DNAnalyzer

Revolutionizing DNA analysis and making it accessible to all through innovative AI-powered analysis and interpretive tools.
Other
130 stars 56 forks source link

Explain the role of G-C content accordingly to organism enviroment, evolutionary status and characteristics #405

Closed i2kmt closed 10 months ago

i2kmt commented 1 year ago

After jumping on to @LimesKey 's issue here and chatting with him, I suggested using the G-C content information to describe the selected organisms. The role of G-C content in DNA varies and can be: 1) Keeping DNA stable (if the organism lives in a high-temperature environment) 2) Let transcription factors bind on and regulate the genome expression of a specific region. 3) Eucaryotic organisms have G-C-rich regions on their DNA. These regions contain many genes, which relates to the previous point.

I will try to see if there is anything else we could relate the G-C content with. What do you think? @VerisimilitudeX

LimesKey commented 1 year ago

The roles of the G-C content in DNA don't seem that useful, would the average G-C content of the entire organism be useful at all? Or would I need the GC content of individual genes?

i2kmt commented 1 year ago

Besides keeping the DNA stable in high-temperature environments, I don't think that the average G-C content should show something more. I think individual genes should be better. I propose to choose some orthologous genes among the organisms and compare their G-C content. That way we could see how close or far the organisms are according to an evolutionary path. This is the simplest thing that comes to my mind for this subject. We can dig even deeper but I don't think it's necessary.

i2kmt commented 1 year ago

You can check this paper: Hongli Du, Haofu Hu, Yuhuan Meng, Weihao Zheng, Fei Ling, Jufang Wang, Xiquan Zhang, Qinghua Nie, Xiaoning Wang, The correlation coefficient of GC content of the genome-wide genes is positively correlated with animal evolutionary relationships, FEBS Letters, Volume 584, Issue 18, 2010, Pages 3990-3994, ISSN 0014-5793

LimesKey commented 1 year ago

So then the most difficult and challanging part of this would be to seperate the entire genome into individual genes for analysis?

i2kmt commented 1 year ago

Yes, but I don't think we should separate it, we should just find orthologous genes among these organisms. I can start working on it and let you know if it's doable

LimesKey commented 1 year ago

But your comparing a few genes to an entire genome? Little confused but your free to work on it. Feel free to use the average gc content files that I created.

i2kmt commented 1 year ago

I believe there are A LOT of orthologous genes. Most of them will be related to basic cellular or tissue functions. So I'm going to focus on how the basic genes have changed through time

VerisimilitudeX commented 1 year ago

Alright @i2kmt, sounds good! Just checking in - how's your research on orthologous genes going? Once you figure out whether we can use it in DNAnalyzer or not, let me know how we could implement it in code. Thanks!

VerisimilitudeX commented 1 year ago

Also I saw the work that you're doing for AcubeSAT and it's super cool to see!

i2kmt commented 1 year ago

Heyy now it's exam period in my uni so I haven't worked on it. I will be able to do so in 2 weeks. Thank you for appreciating AcubeSAT :)

VerisimilitudeX commented 1 year ago

Heyy now it's exam period in my uni so I haven't worked on it. I will be able to do so in 2 weeks. Thank you for appreciating AcubeSAT :)

No worries, good luck!

github-actions[bot] commented 10 months ago

Stale issue message