Closed GemmaTuron closed 1 year ago
In this example, following the tasks of Wednesday 21st June:
I have created all the templates for the Interns, spend a couple of hours revising the GitHub Project and updating the tasks. I have been working on identifying the bug in the GitActions, solved in this issue. I have set a meeting with @miquelduranfrigola to discuss in detail his comments on the Model testing discussion but I haven't been able to start writing yet, i am to do so by the end of the week. Looking forward to the WCAIR next lesson!
Pro tip: adding the links to the issues and discussions you mention will be very helpful!
Wednesday 21st June: Tasks
Model eos31ve
whose changes were merged, was tested and feedback shows it's working. Though its arm64 build failed, then amd64 was successful.
Docker images are quite big(4GB +) and my limited internet hindered the testing.
Thursday 22nd June: Tasks
eos92sw
: https://github.com/ersilia-os/eos92sw/issues/3eos9tyg
using CLI and Docker: https://github.com/ersilia-os/eos9tyg/issues/4eos7pw8
using CLI and Docker: https://github.com/ersilia-os/eos7pw8/issues/9#issuecomment-1601551005 eos526j
using Docker: https://github.com/ersilia-os/eos526j/issues/4I had slow internet issues and could not fetch the models pending testing. But I'm working around my network and will have these tested by the end of the weekend.
Friday 23rd June: Tasks
eos526j
using Docker: https://github.com/ersilia-os/eos526j/issues/4eos9tyg
using CLI and Docker when given output files: https://github.com/ersilia-os/eos9tyg/issues/4eos2r5a
using CLI when given output files: https://github.com/ersilia-os/eos2r5a/issues/6eos7pw8
using CLI and Docker:https://github.com/ersilia-os/eos7pw8/issues/9 I'm working to resolve the pending tasks by the end of the weekend.
I tested models eos9tyg
and eos2r5a
which were intially not allowing me to pass an output file. They all worked with output files.
Model eos526j
and modeleos7pw8
work well using Google Colab, but they gave null outputs when using CLI and Docker.
For model eos65rt
, I was able to ressolve the package dependency conflicts and created a PR.
Monday 26th June: Tasks
eos2re5
: https://github.com/ersilia-os/eos2re5/issues/4
Testing model eos2re5
using Colab failed, it was requiring shells to be activated. The sudo run commands also required root access when using CLI, and even after providing the password, it was taking too long to fetch the model.
The first PR I made for refactoring model eos65rt didn't capture all the commits for updating api, and it was failing at testing using ersilia run. I pushed the changes for updating api to run.
Currently, ersilia crashes when given invalid smiles(wrong inputs). Instead of continuing to predict for the correct ones, the execution halts. The exception isn't handled correctly.
And since most models, don't have a way of running predictions for only right smiles, this has to be handled by ersilia such that when making inferences, only valid smiles are passed. For the wrong inputs, their prediction value should a message of say "Invalid smile".
I setup a VM to test proposed changes without affecting my normal ersilia environment.
@GemmaTuron, the pending issue I'm working on is the presentation and solution to
Tuesday 27th June: Tasks
eos2gth
: https://github.com/ersilia-os/eos2gth/issues/10eos24jm
: https://github.com/ersilia-os/eos24jm/issues/4eos7pw8
: https://github.com/ersilia-os/eos7pw8/issues/9#issuecomment-1608928079eos97yu
: https://github.com/ersilia-os/eos97yu/issues/7I updated model eos2gth and pushed the changes. There were some errors when running workflows, but these were corrected by Miquel because they were originating from ersilia. On checking model eos24jm, it was up to date because it was incorporated recently. Its images were also available on dockerhub. So, I just tested it and it was functional. I tested model eos97yu using Colab, CLI and Docker. It was working well. For model eos7pw8, changes were made but they were not yet reflected on dockerhub. I will test it again.
The proposal for dealing with wrong inputs was accepted. And this feature will be incorporated in ersilia with Miquel leading the coding session on Wednesday, June 28 at 6pm CET.
Wednesday 28th June: Tasks
eos6hy3
: https://github.com/ersilia-os/eos6hy3/issues/5eos2hbd
: https://github.com/ersilia-os/eos2hbd/issues/7@HellenNamulinda I've added a second model in case you finish all the tasks!
Thursday 29th June: Tasks
eos3sa2
: https://github.com/ersilia-os/eos3sa2/issues/1eos4zfy
: https://github.com/ersilia-os/eos4zfy/issues/4 eos7pw8
: https://github.com/ersilia-os/eos7pw8/issues/9#issuecomment-1611784598Hi @HellenNamulinda !
See my comment on model es3sa2, I am afraid is a work in progress model that was never finished, I am sorry for this!
Hello @GemmaTuron, Oh yes, I have read the comment. When we are done with cleaning the other models, I will be glad to investigate the model and if possible re-incorporate it.
I will first continue to the next model eos4avb.
Also, when I convert the suggested task to issue, it unassigns me, forexample model eos6hy3. Hope you will be handling that.
Hi @HellenNamulinda I've fixed the task to issue thing !
Hello Gemma, Thank you! Due to power issues today, I'm handling the tasks late. But I will go through all my pending tasks before next week.
Hi @GemmaTuron, Monday 3rd July: Tasks
eos4avb
: https://github.com/ersilia-os/eos4avb/issues/7. Made a PR; https://github.com/ersilia-os/eos4avb/pull/10eos7asg
: https://github.com/ersilia-os/eos7asg/issues/6. Created a PR; https://github.com/ersilia-os/eos7asg/pull/7eos3cf4
: https://github.com/ersilia-os/eos3cf4/issues/3For model eos4avb
, it involved changing rdkit be installed using pip instead of conda so that the build for arm64 can be successful.
For model eos7asg
, it requires installing java-jre using conda. But this needs a very low version of conda for the installation to work. So, commands to first downgrade conda, and update it after installing packages were added.
Hi @GemmaTuron, Tuesday 4th July: Tasks
eos7asg
testing: https://github.com/ersilia-os/eos7asg/issues/6, https://github.com/ersilia-os/eos7asg/issues/5 ; Created a PR - https://github.com/ersilia-os/eos7asg/pull/8eos2thm
: https://github.com/ersilia-os/eos2thm/issues/6Hi @HellenNamulinda !
Good, I've assigned you a new model in case you finish with debugging the eos7asg.
Sure, I will work on it.
Hi @HellenNamulinda
I have assigned you a new model in case you are dine with the eos2thm, but don't worry if you don't get to it today.
Wednesday 5th July: Tasks
eos2thm
: https://github.com/ersilia-os/eos2thm/issues/6eos8a4x
: https://github.com/ersilia-os/eos8a4x/issues/6eos3zur
: https://github.com/ersilia-os/eos3zur/issues/5eos2rd8
: https://github.com/ersilia-os/eos2rd8/issues/1I spent time comparing rdkit versions for model eos2thm
. I set up an environment with packages specified in the original repo. This forced me to first downgrade conda inorder to install rdkit 2019 with 200 descriptors(messes up other packages). I got the 8 descriptors that were added to versions with 208 descriptors that are missing in 2019.03
Comparing the results of model; It so happened that all all rdkit versions with 200+ descriptors gave the same. Yet to find out why :open_mouth:
@HellenNamulinda
How are you on the tasks today? I think you still have a coupe of models for refactoring?
Hi @GemmaTuron, Yes I have to finish these two models for refactoring. Plus testing the one pending testing after changes by Emma. I'm done with eos8a4x but finalizing with local testing. With the comment I just added on model eos2thm. I just need your go ahead.
Hello @GemmaTuron
eos2thm
: https://github.com/ersilia-os/eos2thm/issues/6eos8a4x
: https://github.com/ersilia-os/eos8a4x/issues/6; created a PR, https://github.com/ersilia-os/eos8a4x/pull/7eos7qga
: https://github.com/ersilia-os/eos7qga/issues/4eos3cf4
: https://github.com/ersilia-os/eos3cf4/issues/3The original model code for eos2thm had a file molfeaturizer.py where the 200 descriptors were explicitly specified. tha's why using rdkit versions with 208 doesn't change model output.
Hi @HellenNamulinda !
I think we can safely merge the PR on eos2thm. I've assigned you two new models!
Hi @GemmaTuron
eos8a4x
: https://github.com/ersilia-os/eos8a4x/issues/6; made a PR, https://github.com/ersilia-os/eos8a4x/pull/8eos2thm
: https://github.com/ersilia-os/eos2thm/issues/6; created a PR, https://github.com/ersilia-os/eos2thm/pull/7 eos4u6p
: https://github.com/ersilia-os/eos4u6p/issues/1eos7a04
: https://github.com/ersilia-os/eos7a04/issues/6Hi @GemmaTuron
eos4u6p
: https://github.com/ersilia-os/eos4u6p/issues/1; created a PR, https://github.com/ersilia-os/eos4u6p/pull/2eos7a04
: https://github.com/ersilia-os/eos7a04/issues/6eos3ae6
using Colab, CLI and Docker: https://github.com/ersilia-os/eos3ae6/issues/6eos1amr
using Colab, CLI and Docker: https://github.com/ersilia-os/eos1amr/issues/4#issuecomment-1629822965All the models tested(eos3ae6 and eos1amr) work well using Colab and Docker. model eos1amr despite working well with string inputs on CLI, it was raising a TypeError: object of type 'float' has no len()
for file outputs. Model eos3ae6 works well using the three.
Done with refactoring model eos4u6p
and created a PR.
For model eos7a04, I'm still fixing package version conflicts.
Hi @GemmaTuron
eos7a04
: https://github.com/ersilia-os/eos7a04/issues/6, created a PR; https://github.com/ersilia-os/eos7a04/pull/7eos6m4j
using Colab, CLI and Docker: https://github.com/ersilia-os/eos6m4j/issues/3#issuecomment-1631652077eos24ci
using Colab, CLI and Docker: https://github.com/ersilia-os/eos24ci/issues/8#issuecomment-1631658191The two models tested(eos6m4j and eos24ci) work well using Colab, CLI and Docker. Though model eos6m4j returns null for some smiles on CLI(for a single string and when part of a file) yet it works well using Colab and Docker.
Thanks for the update @HellenNamulinda
There is still the issue in eos6m4j where some mols cannot be predicted? Also please confirm eos7w6n works now I'll assign you new models meanwhile.
Hi @GemmaTuron
eos7w6n
using Colab, CLI and Docker: https://github.com/ersilia-os/eos7w6n/issues/7#issuecomment-1633091343eos2re5
: https://github.com/ersilia-os/eos2re5/issues/4#issuecomment-1632869397Model eos7w6n
works well using Colab, CLI and Docker. However, for model eos2re5
, the first value in the output(its column name: smiles) is null. Probably because it's a string yet output type is Float.
Hi @GemmaTuron
eos1amr
using CLI for file outputs: https://github.com/ersilia-os/eos1amr/issues/4#issuecomment-1632477537eos5jz9
: https://github.com/ersilia-os/eos5jz9/issues/2eos9f6t
: https://github.com/ersilia-os/eos9f6t/issues/1eos238c
: https://github.com/ersilia-os/eos238c/issues/1@HellenNamulinda
Great, I will not be assigning new models since you have a few open already, good work.
Hello @GemmaTuron, Sorry for the late report. I thought I had added the Friday tasks already.
eos4cxk
: https://github.com/ersilia-os/eos4cxk/issues/4eos238c
: https://github.com/ersilia-os/eos238c/issues/1, Created a PR; https://github.com/ersilia-os/eos238c/pull/2 Hi @GemmaTuron
eos2lm8
using Colab, CLI and Docker: https://github.com/ersilia-os/eos2lm8/issues/4eos9f6t
: https://github.com/ersilia-os/eos9f6t/issues/1, created a PR; https://github.com/ersilia-os/eos9f6t/pull/2eos43at
: https://github.com/ersilia-os/eos43at/issues/1eos69p9
: https://github.com/ersilia-os/eos69p9/issues/2, created a PR; https://github.com/ersilia-os/eos69p9/pull/3The model eos2lm8
makes predictions well using Colab, CLI and Docker.
The only concern is that, output values for the same smiles are not consistent on different runs(same platform) and across the three platforms.
For model eos9f6t
, the major error was related to the incompatibility of tensorbodX and protobuf. This is because Bentoml depends on protobuf<3.19,>=3.8.0. But initially installing chemprop==1.3.0, was installing tensorboardx 2.6.1 since the compatible version for it was not specified and it caused a dependency error tensorboardx 2.6.1 requires protobuf>=4.22.3, but you have protobuf 3.18.3 which is incompatible
. So, I had to specify the tensorboardX==2.0 because it is compatible with chemprop 1.3
and protobuf 3.18.3
The only issue with the model is that, while run.sh returns consistent values, its output values when served in ersilia for the same smiles are not consistent on different runs.
With model eos69p9
, it works well locally and output values are consistent.
Model eos43at
uses rdkit 2019.3.3 which has 200 descriptors, and rdkit 2020+ versions don't have the same number. I'm still working on the best way to install it(outdated for new conda versions, and also not available on pypi) while maintaining the model output. I'm testing after downloading the files using wget. instead of downgrading conda.
eos43at
: https://github.com/ersilia-os/eos43at/issues/1eos9f6t
, the inconsistent output: https://github.com/ersilia-os/eos9f6t/issues/1#issuecomment-1639645436, created a PR; https://github.com/ersilia-os/eos9f6t/pull/4eos43at
: https://github.com/ersilia-os/eos43at/issues/1#issuecomment-1643039571, submitted a PR, https://github.com/ersilia-os/eos43at/pull/3eos7pw8
, fetching from github/S3: https://github.com/ersilia-os/eos7pw8/issues/9#issuecomment-1640662381eos6fza
: https://github.com/ersilia-os/eos6fza/issues/1When testing eos7pwe, the error persisted when fetching from S3/github and repo_path(locally).
ERROR: The certificate of ‘anaconda.org’ has expired.
#8 ERROR: process "/bin/sh -c wget https://anaconda.org/LICH/syba/1.0.2.alpha/download/noarch/syba-1.0.2.alpha-py_0.tar.bz2" did not complete successfully: exit code: 5
I will try to explore more to come up with a solution.
Hi @GemmaTuron,
eos7pw8
, fetching from github/S3: https://github.com/ersilia-os/eos7pw8/issues/9#issuecomment-1645482014, created a PR; https://github.com/ersilia-os/eos7pw8/pull/20eos43at
: Model test on PR failing, https://github.com/ersilia-os/eos43at/pull/3#issuecomment-1645579146 and yet the model works well locallyeos6fza
: https://github.com/ersilia-os/eos6fza/issues/1I spent some time resolving ERROR: The certificate of ‘anaconda.org’ has expired.
with model eos7pw8
when using CLI to fetch from S3, github and repo_path(locally). The model was using Mode: docker
when installing packaes, I configured the default to conda. All details are explained in the comment. Another issue with the model was TypeError: object of type 'float' has no len()
for csv file outputs. More explanation on how it was resolved here
Also, to extract assays from the ChEMBL database that will be used to test the EnsembleTabPFN package, I successfully set up the chembl_ml_tools package, including a postgres database server containing the ChEMBL database. I installed the latest ChEMBL database.
I will now explore model eos43at
more by testing it on Codespaces.
Plus refactoring model eos6fza
@HellenNamulinda
Thanks, I won't assign new tasks so you can focus on the current ones!
eos43at
: Model test on PR failing, https://github.com/ersilia-os/eos43at/pull/3#issuecomment-1645579146 and yet the model works well locallyeos6fza
: https://github.com/ersilia-os/eos6fza/issues/1eos77w8
: https://github.com/ersilia-os/eos77w8/issues/6eos7jlv
: https://github.com/ersilia-os/eos7jlv/issues/1eos6fza
: https://github.com/ersilia-os/eos6fza/issues/1eos77w8
: https://github.com/ersilia-os/eos77w8/issues/6@HellenNamulinda
I think you have still quite some models pending to be refactored, which is probably my fault assigning too many at once. Since you are also busy with the ChEMBL data, would you tell me which models you have not started to work on, so I might re-assign them and free up some of your tasks?
Hi @GemmaTuron, It's my fault for not updating the issues for some days now. Apologies. Let me ensure to complete all the pending by week's end.
eos9f6t
: https://github.com/ersilia-os/eos9f6t/issues/1#issuecomment-1660067794; made a PR, https://github.com/ersilia-os/eos9f6t/pull/5eos7jlv
: https://github.com/ersilia-os/eos7jlv/issues/1#issuecomment-1660206051; created a PR, https://github.com/ersilia-os/eos7jlv/pull/3eos77w8
: https://github.com/ersilia-os/eos77w8/issues/6#issuecomment-1661382362Model Testing:
eos2re5
: https://github.com/ersilia-os/eos2re5/issues/4#issuecomment-1659699877eos1amr
: https://github.com/ersilia-os/eos1amr/issues/4#issuecomment-1659649943eos1bba
: https://github.com/ersilia-os/eos1bba/issues/21eos4q1a
: https://github.com/ersilia-os/eos4q1a/issues/10Model Refactoring
eos43at
: https://github.com/ersilia-os/eos43at/issues/1#issuecomment-1659714552eos935d
: https://github.com/ersilia-os/eos935d/issues/6[x] Explore and analyze the categorized ChEMBL data
Model Testing:
eos4b8j
: https://github.com/ersilia-os/eos4b8j/issues/2Model Refactoring
eos935d
: https://github.com/ersilia-os/eos935d/issues/6@HellenNamulinda
Are you working in any other model aside from the ChEMBL data?
@GemmaTuron, There is no other model.
Perfect, let's use today's meeting to focus on the ChEMBL data then
eos6fza
: https://github.com/ersilia-os/eos6fza/issues/3#issuecomment-1670706385Hi @HellenNamulinda
In addition to the model testing and working on model eos96ia to help Riley, please:
Summary
Hello,
This is a public issue for a virtual daily stand-up. We will use this to briefly share the tasks of the day and the challenges and advances made, so that we can ensure smooth support from the Ersilia mentors and alignment between daily tasks and overall internship goals.
Scope
Initiative 🐋
Objective(s)
Internship goals:
Team
Timeline
Before starting your work, line up a few tasks and short description. This should not take long. For example, it could be something like: Wednesday 21st June
Documentation
No response