ersilia-os / eos7a45

GNU General Public License v3.0
0 stars 3 forks source link

New model ready for testing! #12

Closed github-actions[bot] closed 1 year ago

github-actions[bot] commented 1 year ago

This model is ready for testing. If you are assigned to this issue, please try it out using the CLI, Google Colab and DockerHub and let us know if it works!

samuelmaina commented 1 year ago

@GemmaTuron , The model is being fetched successfully in both colab and CLI but the fetch time was very long in CLI. colab_eos7a45_output.log eos7a45_fetch_log.log

However, some some input smiles have outputs while others cause errors(status of 500). The failing smiles are forcing ersilia to repeat the prediction and this repeats for a long time as can be seen in this log files. This happens in CLI and Colab predictions. colab_eos7a45_prediction.log eos7a45_predict.log

GemmaTuron commented 1 year ago

I see, that is helpful, thanks @samuelmaina ! The model is in Docker as well, can you try it out? @emmakodes can you also try this?

Thanks!

samuelmaina commented 1 year ago

@GemmaTuron , the model is quite large and requires a lot of internet bundles. if the internet fails,the download starts a new. I am looking for a stable internet source and will test it. I am researching ways to reduce the amd64 build.

GemmaTuron commented 1 year ago

Hi @samuelmaina !

ok, let me know if you succeed otherwise we'll find a workaround

samuelmaina commented 1 year ago

@GemmaTuron, There are some smiles in which the model is throwing errors, @HellenNamulinda , can you look at this logs?

@GemmaTuron , The model is being fetched successfully in both colab and CLI but the fetch time was very long in CLI. colab_eos7a45_output.log eos7a45_fetch_log.log

However, some some input smiles have outputs while others cause errors(status of 500). The failing smiles are forcing ersilia to repeat the prediction and this repeats for a long time as can be seen in this log files. This happens in CLI and Colab predictions. colab_eos7a45_prediction.log eos7a45_predict.log

GemmaTuron commented 1 year ago

But the model runs fully without crashing? what is the output for the smiles that do not work?

GemmaTuron commented 1 year ago

please check with a 10 smiles file that contains one wrong

samuelmaina commented 1 year ago

But the model runs fully without crashing? what is the output for the smiles that do not work?

@GemmaTuron , The model did not finish predicting . I waited very long for the prediction to finish but it would show this log

23:38:29 | ERROR    | Status Code: 500
23:38:29 | WARNING  | Batch prediction didn't seem to work. Doing predictions one by one...

after every 15 minutes ( as can be seen in the prediction logs). It kept on trying the one by one processing with the same logs after every "Doing predictions one by one" iteration , so I though it was something like an infinite loop with ersilia.

HellenNamulinda commented 1 year ago

@samuelmaina and @GemmaTuron , Yes, it is true that the model is slow at making predictions. And I did say that the model size itself is very big. Probably why it doesn't accept batch size of 100 which is specified by ersilia. also, for an inference that fails on first try, it is repeated. Usually for the eml dataset(442 records), the model takes approx 1hr to make predictions.

So, this is the more reason to document some of these model statistics just like @miquelduranfrigola suggested.

HellenNamulinda commented 1 year ago

But the model runs fully without crashing? what is the output for the smiles that do not work?

@GemmaTuron , The model did not finish predicting . I waited very long for the prediction to finish but it would show this log

23:38:29 | ERROR    | Status Code: 500
23:38:29 | WARNING  | Batch prediction didn't seem to work. Doing predictions one by one...

after every 15 minutes ( as can be seen in the prediction logs). It kept on trying the one by one processing with the same logs after every "Doing predictions one by one" iteration , so I though it was something like an infinite loop with ersilia.

@samuelmaina, If you were using the eml dataset, remember it has 442 records, which means five batches. So, Batch prediction didn't seem to work. will happen five times. But it will do predictions one by one and complete all. just that it will take long. appx 1hr.

Try with a file with fewer molecules(like 10) as suggested by @GemmaTuron. Just like I said here, 5 records took appx 3minutes.

samuelmaina commented 1 year ago

Thanks very much @HellenNamulinda for the detailed explantion. I didn't know ersilia does prediction of 100 per batch. I also thought a status of 500 meant something went wrong with the model.

GemmaTuron commented 1 year ago

Hi @samuelmaina

Remember you can also look for the specific sentence in the Ersilia code and read more about it. In this case, if you use VSCode to search for "Batch prediction didn't..." you'll see the functions that manage this.

emmakodes commented 1 year ago

I have been trying to run this model all-day

CLI

DockerHub

emmakodes commented 1 year ago

For google colab, my code is still running, once its done I will post the result immediately

emmakodes commented 1 year ago

Google Colab

samuelmaina commented 1 year ago

I was able to pull the Docker image and make predictions using the eml_canonical data eos7a45_doc_predict.csv The log are : eos7a45_docker.log

I will rerun the CLI and Colab.