Vitek-Lab / MSstatsBioNet

https://vitek-lab.github.io/MSstatsPPI/
Other
0 stars 0 forks source link

[Pruthvi] Implement ID Conversion & Protein Function Query Web Service #15

Open tonywu1999 opened 5 days ago

tonywu1999 commented 5 days ago

Context

MSstats output generally contains uniprot IDs or uniprot mnemonic IDs, but not HGNC IDs. In order to query INDRA properly, HGNC IDs are ideal. Fortunately, there are INDRA python client libraries that enable conversion from uniprot to HGNC IDs. Furthermore, there is also a lack of annotation on the protein functions of each protein, e.g. is the protein a kinase? INDRA's python client libraries also enable queries on whether a protein is a kinase, TF, etc.

Since MSstatsBioNet is implemented in R, we need a way to integrate these libraries somehow in R. The best solution at a higher level has been determined to place these libraries behind a web service. Then, MSstatsBioNet will call this web service for ID conversion and protein function queries.

See this python notebook for reference.

Subtasks

Acceptance Criteria

We should be able to perform ID conversions and protein function queries from MSstatsBioNet.

tonywu1999 commented 5 days ago

After discussing with Klas/Ben, we came with the final conclusion: