Closed Ajoke23 closed 3 months ago
Week 1 DAY 1 (5th March, 2024)
DAY 2 (6th March, 2024)
I went through this documentation on Ersilia Model Hub installation
I started the installation process and encountered an error when I got to pre-requisites 5: isaura data lake installation. The error encountered can be viewed here isaura.txt
Then, in the process of debugging I decided to install Ersilia first using this code
# create a conda environment
conda create -n ersilia python=3.7
# activate the environment
conda activate ersilia
Aftermath, I now decided to run the code for Isaura data lake installation and it was successful. Then I realized that error i was getting earlier was because h5py requires a python installed to install wheel h5py. Hence the reason why setting up Ersilia environment was important first before installation of Isaura.
I proceeded with other steps outlined which was successful. To be sure I've successfully installed Ersilia and the CLI terminal is working, I ran the following code
# see ersilia CLI options
ersilia --help
#see ersilia's model catalog
ersilia catalog
Output: output1.log catalog_output.log This output shows that I have successfully installed Ersilia and CLI terminal is working fine.
Now that Ersilia is recognized in Ubuntu, I tested eos3b5e
models by fetching, serving eos3b5e
model, and calculating the molecular weight as required in the task. I got the following outputs: fetch.log, serve.log, model_output.log
DAY 3 (7th March, 2024)
sudo apt update
sudo apt install apt-transport-https ca-certificates curl software-properties-common
apt-cache policy docker-ce
sudo systemctl start docker
sudo systemctl enable docker
docker --version
to check the version of docker installed and i got the output below
(base) ajoke@DESKTOP-KTJU3QV:~$ docker --version
Docker version 25.0.4, build 1a576c5
TESTING OF MODEL eos4wt0
eos4wt0
model from Ersilia model hub and i got this output
(base) ajoke@DESKTOP-KTJU3QV:~$ sudo docker pull ersiliaos/eos4wt0:latest
latest: Pulling from ersiliaos/eos4wt0
8b91b88d5577: Already exists
824416e23423: Already exists
bbe2c2981082: Already exists
7b6b68d15a5c: Already exists
71f8f4db541d: Already exists
4f4fb700ef54: Pull complete
b29b0c06109d: Already exists
ddc20b6d4ab1: Pull complete
bb4587482098: Pull complete
28489519aef7: Pull complete
35554e140baa: Pull complete
Digest: sha256:9738b7353c56e9d26373edd73e6ff299166322b9cbd1513ff3ed85133d038e90
Status: Downloaded newer image for ersiliaos/eos4wt0:latest
docker.io/ersiliaos/eos4wt0:latest
- Then i ran `sudo docker ps`
**Output:**
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES b458fce09d65 ersiliaos/eos4wt0:latest "sh /root/docker-ent…" 3 hours ago Up 3 hours 0.0.0.0:37183->80/tcp eos4wt0_d75e
- I tested the model eos4swt0 that was fetched by running this codel below:
ersilia serve eos4wt0 ersilia -v api run -i "CCCC"
**Output gotten:** [model.log](https://github.com/ersilia-os/ersilia/files/14551529/model.log) which same output in the gitbook.
**DAY 4 (8th of March 2024)**
**MOTIVATION STATEMENT**
I'm Ajoke Yusuf, a Data Scientist, Machine Learning enthusiast, and SDG 3 advocate. I'm a hardworking, resourceful, goal-oriented individual who possesses strong analytical and problem-solving skills with an unending quest for knowledge. I pride myself on being a fast learner and honed strong skills in problem-solving and research.
My last experience of contributing to Ersilia during the contribution stage last year in October 2023, I decided to apply again hoping Ersilia project will be there because I had a wonderful and learnable experience coupled with the amazing community. Receiving the Outreachy email and checking the project named, I decided to scroll down to alphabet "E" to look for Ersilia and I felt so excited seeing Ersilia project.
My main aim of choosing Ersilia project is because their aim, and mission align with my goal and career objective as an impact maker and an SDG 3 advocate. One of the life experiences that ignited my interest in drug discovery due to my personal experience with cerebral malaria that almost took my life and also the death of a friend who lost her life due to sickle cell.
As an Engineering graduate living in Nigeria, the increasing mortality rate of infectious diseases in Nigeria and sub-Saharan Africa is alarming, hence I developed an interest in the biomedical field. Based on statistics, according to UNICEF (United Nations International Children Emergency Funds), **_infectious disease is the major cause of the mortality rate in children ≤ 5 years_** which was cited from this [article](https://data.unicef.org/topic/child-survival/under-five-mortality/). Research from NIH (National Library of Medicine) & NCBI (National Centre for Biotechnology Information) confirms that _**"The infrastructure and level of support for surveillance, research, and training on emerging infectious diseases in Africa are extremely limited"**_ which was cited from this [article](https://www.ncbi.nlm.nih.gov/books/NBK99567/#:~:text=At%20a%20time%20when%20increasing,%2C%20yellow%20fever%2C%20and%20trypanosomiasis).
With my knowledge in Python for Data Science, a bit knowledge in Machine Learning in conjunction with strong analytical and research skill, I believe that contributing to this project will help me garner knowledge and technical skills that will help in advancing and improving health research in Nigeria, Africa and eventually the world.
**If accepted for the 3 months internship**, I'll commit myself to bringing suggestions, undergoing research, and collaborating with the Ersilia team while learning and honing skills in Artificial Intelligence and Machine Learning. This period of internship will help propel my research and problem-solving skills which will be useful in the long run for the advancement of technology in the health sector, improving and making a sustainable impact in health research in Nigeria because I fiercely believe that the availability & accessibility of scientific tools and data-driven insights is necessary towards solving prevalent health challenges because as a young lady living in Nigeria, an underdeveloped and low-income country, I have experienced challenges encountered in having accessibility tools for prevalent infectious diseases in my community and country (Nigeria) at large.
**After the internship**, I plan on utilizing the skills gained to improve and sustain health research tools, solve prevalent health disease issues in Nigeria, and reduce the mortality rate caused by infectious diseases. Thus, propelling sustainable research skills that will leave a long-lasting impact in the health sector in my community, Nigeria, sub-Saharan Africa, and eventually, globally.
DAY 4 (8th of March 2024)
MOTIVATION STATEMENT
I'm Ajoke Yusuf, a Data Scientist, Machine Learning enthusiast, and SDG 3 advocate. I'm a hardworking, resourceful, goal-oriented individual who possesses strong analytical and problem-solving skills with an unending quest for knowledge. I pride myself on being a fast learner and honed strong skills in problem-solving and research. Due to my last memorable experience during the contribution stage last year in October 2023, I got ignited to apply again hoping Ersilia project will be there because I had a wonderful and learnable experience coupled with the amazing community. Receiving the Outreachy email and checking the project named, I decided to scroll down to alphabet "E" to look for Ersilia and I felt so excited seeing Ersilia project.
My main aim of choosing Ersilia project is because their aim, and mission align with my goal and career objective as an impact maker and an SDG 3 advocate. One of the life experiences that ignited my interest in drug discovery is due to my personal experience with cerebral malaria that almost took my life and also the death of a friend who lost her life due to sickle cell.
As an Engineering graduate living in Nigeria, the increasing mortality rate of infectious diseases in Nigeria and sub-Saharan Africa is alarming, hence I developed an interest in the biomedical field. Based on statistics, according to UNICEF (United Nations International Children Emergency Funds), infectious disease is the major cause of the mortality rate in children ≤ 5 years which was cited from this article. Research from NIH (National Library of Medicine) & NCBI (National Centre for Biotechnology Information) confirms that "The infrastructure and level of support for surveillance, research, and training on emerging infectious diseases in Africa are extremely limited" which was cited from this article.
With my knowledge in Python for Data Science, a bit knowledge in Machine Learning in conjunction with strong analytical and research skill, I believe that contributing to this project will help me garner knowledge and technical skills that will help in advancing and improving health research in Nigeria, Africa and eventually the world.
If accepted for the 3 months internship, I'll commit myself to bringing suggestions, undergoing research, and collaborating with the Ersilia team while learning and honing skills in Artificial Intelligence and Machine Learning. This period of internship will help propel my research and problem-solving skills which will be useful in the long run for the advancement of technology in the health sector, improving and making a sustainable impact in health research in Nigeria because I fiercely believe that the availability & accessibility of scientific tools and data-driven insights is necessary towards solving prevalent health challenges because as a young lady living in Nigeria, an underdeveloped and low-income country, I have experienced challenges encountered in having accessibility tools for prevalent infectious diseases in my community and country (Nigeria) at large.
After the internship, I plan on utilizing the skills gained to improve and sustain health research tools, solve prevalent health disease issues in Nigeria, and reduce the mortality rate caused by infectious diseases. Thus, propelling sustainable research skills that will leave a long-lasting impact in the health sector in my community, Nigeria, sub-Saharan Africa, and eventually, globally.
WEEK 2 TASK 1 Model selected - eos2ta5 Repository created - here @DhanshreeA @GemmaTuron Pls I appreciate any feedback.
TASK 2: MODEL REPRODUCIBILITY A well-detailed explanation can be found in the repository above.
IMPLEMENTATION OF THE AUTHORS MODEL I took the following step in implementation the author source code using Ubuntu terminal
I already had conda dependencies installed
Set up the cardiotox package on conda environment
# create a conda environment
conda create -n cardiotox python=3.7.7
# activate the environment
conda activate cardiotox
Installing of PyBioMed
cd cardiotox
cd PyBioMed
python setup.py install
return back to the home
cd ..
Installing the package's version the authors used
pip install tensorflow==2.3.1
pip install sklearn==0.0
pip install mordred==1.2.0
pip install pybel==0.14.10
pip install keras==2.4.3
Testing the model
python test.py
OUTPUT:
RESULT COMPARISON OF THE CARDIOTOX & eos2ta5 MODEL
Week 1 - Get to know the community
Week 2 - Get Familiar with Machine Learning for Chemistry
Week 3 - Validate a Model in the Wild
Week 4 - Prepare your final application