ersilia-os / ersilia

The Ersilia Model Hub, a repository of AI/ML models for infectious and neglected disease research.
https://ersilia.io
GNU General Public License v3.0
189 stars 123 forks source link

✍️ Contribution period: <Nwuguru Chidiebere> #1013

Closed Nwuguru-Chidiebere-Sullivan closed 3 months ago

Nwuguru-Chidiebere-Sullivan commented 4 months ago

Week 1 - Get to know the community

Week 2 - Get Familiar with Machine Learning for Chemistry

Week 3 - Validate a Model in the Wild

Week 4 - Prepare your final application

Nwuguru-Chidiebere-Sullivan commented 4 months ago

Screenshot 2024-03-08 041656

Installing docker and connecting it to my terminal gave me a whole lot of difficulties; I spent two nights on it. It was this difficult because I was initially trying to run systemctl command in WSL environment. How I resolved this and was able to run docker from my Ubuntu terminal was:

This way, I was able to run 'hello-world' with docker from my Ubuntu terminal and it went successfully.

Nwuguru-Chidiebere-Sullivan commented 4 months ago

Screenshot 2024-03-08 044701

Attached is the proof to show that Ersilia was recognised by my CLI and was able to run a simple model as instructed.

Nwuguru-Chidiebere-Sullivan commented 4 months ago
fetching from dockerhub

Attached is the proof showing that I was able to pull a model image, test the mode I fetched from DockerHub, and was able to generate the Morgan Fingerprints for a molecule.

Nwuguru-Chidiebere-Sullivan commented 4 months ago

Motivation Statement

It was in my last Molecular Biology class as a final-year medical student, while we were being taught Bioinformatics by a lecturer who I didn't quite like and barely attended his classes, especially because students seldom make good grades in his courses, that I first heard about data analysis. I still remember the lecturer's exact words as he asked, "Who in this class knows about data analysis and how it applies to healthcare?" None of us in the class knew what it was, and it wasn't even a surprise because, in the part of Nigeria where I lived and schooled, the presence of tech industries can only be imagined, even at this point. After the class, I went home determined to research data analysis, especially since the lecturer did say a few things about it and how it applies to healthcare. During my research, I came across data science and even loved it the most, especially how it applies to healthcare. Ever since then, the trajectory of my career as a young medical professional changed; I no longer wanted to practice directly from the hospital, I wanted to explore data science and proffer a more robust solution to many healthcare challenges by leveraging the power of actionable insights that can be uncovered from clinical data. This quest led me to a rigorous self-study that enabled me to acquire data analytical and machine learning skills.

Although I have been able to learn and acquire analytical skills such as Python, SQL, Excel, Tableau, ML, pandas, seaborn, matplotlib, scikit-learn, Git, MongoDB, Jupyter Notebook, Google Colab, etc, the journey has never been easy and I still lack substantial work experience, especially for someone who came from a non-tech background and currently resides in a technologically disadvantaged area; I needed an enabling environment that would allow me to implement my skills, collaborate with others, and further learn from more experienced data professionals. So late last year, I came across the Outreachy internship and how it supports technologically disadvantaged individuals in gaining substantial experience through their internship; getting into the Outreachy internship became my big dream because that was the way I could get to finally advance my tech journey, especially as someone who wishes to integrate his medical expertise with data science and machine learning to make a meaningful impact in the healthcare industry.

On getting into the Outreachy contribution phase, I came across Ersilia and the project resonated so much with me, especially since it is geared towards a course that would help in combating infectious diseases in most underdeveloped places, just like where I come from. As someone whose long-term goal is to gather enough experience and become a healthcare data scientist to reckon with, a project geared towards ready-to-use AI/ML models for biomedical research was just everything I needed at this stage. Working with Ersilia will not only help advance my career but will inculcate a culture of collaboration at a high level and provide the enabling environment where I can put into practice the skills I have learned so far, thus, helping to shape my future as a healthcare data scientist, while in return, I will play a part in advancing the course of Ersilia project through my meaningful contributions, especially as it concerns improving drug discovery process and the general healthcare system through Artificial Intelligence and Machine Learning. I am confident that with my medical background and technical skills, I will make a meaningful contribution to this project. Finally, I also believe that this internship if given to me will help equip me with ways to deploy machine-learning approaches to solving so many health challenges confronting Sub-Saharan Africa, Nigeria, particularly the place I come from where adequate healthcare is a luxury.

DhanshreeA commented 4 months ago

@Nwuguru-Chidiebere-Sullivan Quick question, if you have access to an Ubuntu machine why not work on that? WSL has been more painful to work with in our experience as well.

Nwuguru-Chidiebere-Sullivan commented 4 months ago

@DhanshreeA Yeah, I am currently working directly from an Ubuntu machine. I just don't know for what reason my computer was finding it difficult to integrate docker into my Ubuntu terminal until I had done it from WSL, but now it's working well in my Ubuntu terminal.

GemmaTuron commented 3 months ago

Hi @Nwuguru-Chidiebere-Sullivan

You have ticked off W2 tasks but we have not seen updates here. Please can you detail the tasks and link to your repository so we can provide feedback?

Nwuguru-Chidiebere-Sullivan commented 3 months ago

Hi @GemmaTuron, @DhanshreeA, I had a hard time concluding the tasks for Week 2 and 3 but I'm glad I persisted, it allowed me to learn a lot through the difficulties.

Here is the link to my GitHub Repo containing my detailed works on the week 2 tasks. I shall also detail my encounters, struggles, and findings from the week 2 tasks in the section below after I am done implementing any recommendation(s) from you. Please, I look forward to your feedback so I can be able to move on to week 3 tasks or move straight to my final application, especially since time is almost running against me. Thank you!

GemmaTuron commented 3 months ago

Hi @Nwuguru-Chidiebere-Sullivan,

Good work, I don't have any specific feedback, just a clarification. task 3 actually refers to week 3 task where you are validating the model externally? Please clarify this and then move onto preparing your final application

Nwuguru-Chidiebere-Sullivan commented 3 months ago

Hi @Nwuguru-Chidiebere-Sullivan,

Good work, I don't have any specific feedback, just a clarification. task 3 actually refers to week 3 task where you are validating the model externally? Please clarify this and then move on to preparing your final application

Thank you so much @GemmaTuron and @DhanshreeA for the feedback. I have indicated the task for week 3 as you instructed in my notebook and ReadMe.