ersilia-os / ersilia

The Ersilia Model Hub, a repository of AI/ML models for infectious and neglected disease research.
https://ersilia.io
GNU General Public License v3.0
211 stars 149 forks source link

✍️ Contribution period: Tshifhumulo Mapasa #834

Closed Tshifhumulo10 closed 11 months ago

Tshifhumulo10 commented 1 year ago

Week 1 - Get to know the community

Week 2 - Install and run an ML model

Week 3 - Propose new models

Week 4 - Prepare your final application

DhanshreeA commented 1 year ago

Hi @Tshifhumulo10 can you confirm if you were able to run Ersilia with the simplest model and obtain the desired output as mentioned in the instructions?

Tshifhumulo10 commented 1 year ago

Hi @Tshifhumulo10 can you confirm if you were able to run Ersilia with the simplest model and obtain the desired output as mentioned in the instructions?

Hi @DhanshreeA thank you for checking up. I was able to run Ersilia with the simplest model and obtained output as mentioned in the instructions. I was able to do so after reading the instruction given by @HellenNamulinda, https://github.com/ersilia-os/ersilia/issues/820#issuecomment-1744307345

Tshifhumulo10 commented 1 year ago

Retrospection

  1. joined the Slack community, and I was so intrigued by the collaboration, and I am happy to be part of the community.
  2. Opened the issue, and I was very motivated when I saw that other interns had already completed this week to-do list.
  3. installed the Ersilia model hub and tested the simplest model and below are the steps that I took:
Tshifhumulo10 commented 1 year ago

Motivation statement to work at Ersilia

I am drawn to the Ersilia project for several compelling reasons. First and foremost, I believe in the transformative power of data. In the words of Clive Humby, "Data is the new oil," and I've witnessed firsthand how harnessing data can drive innovation and impact lives positively.

With a background in Biochemistry and Microbiology, I ventured into the world of data science through intensive coursework. As I navigated through various projects and opportunities, I was pleasantly surprised to stumble upon Ersilia. What immediately struck me was the project's harmonious blend of cutting-edge technology and domain expertise.

Having spent time working in a laboratory, I understand the tangible difference Ersilia's projects make in the scientific community and beyond. I am passionate about being part of a team that bridges the gap between data scientists and experimental researchers, making AI/ML expertise accessible to scientists worldwide. This aligns perfectly with my values of transparency, knowledge sharing, and collaboration.

Ersilia's dedication to advancing technology in machine learning, cloud computing, and data privacy promises to keep me at the forefront of technological progress—a prospect that excites me greatly. Furthermore, the project's unique fusion of chemistry, molecular biology, and computational pharmacology, combined with a deep commitment to global health, presents a compelling challenge and an opportunity to create meaningful change.

Lastly, the focus on open-source initiatives aimed at generating low-cost drugs is a testament to Ersilia's commitment to making a positive societal impact. I am honored and eager to contribute my skills and enthusiasm to this noble cause. I am deeply motivated to join the Ersilia project because it embodies the values and interests of innovation, collaboration, technology, and making a tangible difference in the world through data-driven solutions. I eagerly anticipate the opportunity to contribute to this remarkable endeavor.

DhanshreeA commented 1 year ago

Hi @Tshifhumulo10 thank you for the updates. Please record this as your first contribution on the Outreachy website, and proceed with the tasks from week 2.

Tshifhumulo10 commented 1 year ago

Hi @Tshifhumulo10 thank you for the updates. Please record this as your first contribution on the Outreachy website, and proceed with the tasks from week 2.

Hi @DhanshreeA, I will do just that.

DhanshreeA commented 1 year ago

Hey @Tshifhumulo10 all good? Do you need any help?

Tshifhumulo10 commented 1 year ago

Hey @Tshifhumulo10 all good? Do you need any help?

Hey @DhanshreeA, thank you for checking up. I have been trying to use "NCATS Rat Liver Microsomal Stability" but it took forever to load, So I have switched to STOUT.

Tshifhumulo10 commented 1 year ago

Week 2

  1. Select a model from the suggested list

From the suggested list, I was drawn to the “NCats Rat Liver Microsomal Stability” because I have always been fascinated about how drugs operate. However, it was taking forever to install, so I switched to STOUT. Manually translating the SMILES to their IUPAC names can be challenging, thus STOUT is pivotal for such endeavor. I have chosen STOUT model because I understand the impact that these models can make for chemists and researcher in saving their time, reducing errors and increasing the credibility of their work since these models have an accuracy of 90%.

  1. Install the model in your system
Tshifhumulo10 commented 1 year ago
  1. Run predictions for the EML
import pandas as pd
from STOUT import translate_forward, translate_reverse
df=pd.read_csv('eml_canonical (3).csv')
name_smiles = list(df['smiles'].head(10))
IUPAC_nm=[]
for i in name_smiles:
    IUPAC_name = translate_forward(i)
    IUPAC_nm.append(IUPAC_name)
    DF=pd.DataFrame({"SMILES": name_smiles,"Predicted IUPAC": IUPAC_nm})
for i in range(0, 10):
    #print("IUPAC name of " + DF.loc[ i, "SMILES"] + " is: " + DF.loc[i, "Predicted IUPAC"])

-OUTPUT Screenshot 2023-10-13 133959

OUTPUT: Screenshot 2023-10-13 140519

Tshifhumulo10 commented 1 year ago
  1. Compare results with the Ersilia Model Hub implementation!

Screenshot 2023-10-15 134626

1.ersilia -v api run -i 'Nc1nc(NC2CC2)c3ncn([C@@H]4C[C@H](CO)C=C4)c3n1'

Output: Screenshot 2023-10-15 135038

  1. ersilia -v api run -i 'C[C@]12CC[C@H](O)CC1=CC[C@@H]3[C@@H]2CC[C@@]4(C)[C@H]3CC=C4c5cccnc5'

Output: Screenshot 2023-10-15 135525

  1. ersilia -v api run -i 'CC(=O)Nc1sc(nn1)[S](N)(=O)=O' Output Screenshot 2023-10-15 135802

  2. ersilia -v api run -i 'CC(O)=O' Output Screenshot 2023-10-15 135953

  3. ersilia -v api run -i 'CC(=O)N[C@@H](CS)C(O)=O' Output Screenshot 2023-10-15 140203

DhanshreeA commented 1 year ago

Hi @Tshifhumulo10 thank you for the updates so far. Could you comment on the comparison between results obtained using original implementation of STOUT vs the Ersilia implementation? For example, do you notice any differences if any?

You can also move to week 3 tasks afterwards.

GemmaTuron commented 11 months ago

Hello,

Thanks for your work during the Outreachy contribution period, we hope you enjoyed it! We will now close this issue while we work on the selection of interns. Thanks again!