First of all, the paragraph is well-written and clearly engaging. If I had one more thing to add, I would want to highlight the importance of multiomics studies. Why is it better to corroborate findings in multiple lines of evidence? Are they any statistical challenges that we need to overcome?
It is important to consider the temporal axis (four years) regarding the methodology. Perhaps, another implicit axis would be the immune landscape.
What would be the best strategy to call differential expression genes and pathways? Do you expect to see the same patterns across different data modalities? How would you integrate multi-modal signals? Is it a multi-input-output regression model? Do you want to extract some values to summarize the multimodality (e.g., Multi-Omics Factor Analysis)?
Nice to meet you all! I will be one of your project mentors for this group project. Here are some of my thoughts on your initial proposal:
Interesting proposal and choice of dataset!
In your final proposal, outline the specific objectives/aims you will study and describe the methodology for your proposed analysis.
Emphasis on any hypothesis you might have from the literature for the differential gene expression between healthy and diabetic/prediabetic cases in your final proposal
"predicting insulin resistance is possible in a prediabetes state" - how do you propose to do this analysis? Are you planning on using machine learning algorithms or any other methods? How would you transition from the DEG analysis to this? Adding more details on these would be really helpful in your final proposal
First of all, the paragraph is well-written and clearly engaging. If I had one more thing to add, I would want to highlight the importance of multiomics studies. Why is it better to corroborate findings in multiple lines of evidence? Are they any statistical challenges that we need to overcome?
It is important to consider the temporal axis (four years) regarding the methodology. Perhaps, another implicit axis would be the immune landscape.
What would be the best strategy to call differential expression genes and pathways? Do you expect to see the same patterns across different data modalities? How would you integrate multi-modal signals? Is it a multi-input-output regression model? Do you want to extract some values to summarize the multimodality (e.g., Multi-Omics Factor Analysis)?