Open lucaskbobadilla opened 10 months ago
Ok, I restarted the kernel and it took care of the error. But now I am stuck in The Keras model (Deepswarm was very fast):
#################################################################################################
####################### RUNNING BINARY CLASSIFICATION ##################
#################################################################################################
#################################################################################################
############################## RUNNING DEEPSWARM ###########################
#################################################################################################
Conducting architecture search now...
Testing scrambled control now...
Fitting final model now...
#################################################################################################
############################## RUNNING AUTOKERAS ###########################
#################################################################################################
Conducting architecture search now...
Any ideas?
Hi Lucas, AutoKeras can be pretty slow, compared to DeepSwarm, depending on the resources available to you. You can see the time differences we found in our Figure S2.
Can I ask what your max_runtime_minutes is set to? I sometimes raise it to 180 and let AutoKeras run overnight. Furthermore, is your dataset really large? For huge datasets (e.g., >100K sequences), it might make sense to find an optimal architecture with a subset of the dataset and then train on the identified model architecture with all sequences. Lastly, you may need to play with the -shm-size
flag as in our installation guide (bullet point under step 5 in option 1).
That's only with the exemple 01 Jupyter notebook. That is why I think it is weird it is running so slow. My dataset will have around 70,000 sequences. I am running it in as a pod in Kubernetes with more enough resources.
Also, any specific reason to use Tensorflow 1.13? I am trying to rebuild an image with CUDA but the tensorflow 1.13 o ou works with CUDA 10 which does not have an NVIDIA image available anymore. Any ideas how to make bioatomated to be able to use GPU support?
Hi Lucas, AutoKeras should probably finish up in a few hours with that size dataset. Could you let me know if you are facing the same problem with TPOT, or is it just AutoKeras?
We used Tensorflow 1 because we started the project back in 2019, and it made the most sense at the time for the goal of balancing package dependencies. If you want to try to create a Dockerfile that works for Tensorflow 2 we'd love that and would definitely encourage a PR!
Hello,
I am trying to use a Docker image of Bioautomated to train a model. The Keras models is giving me an error as described below:
I think is some error with TensorFlow. Any ideas how to solve it?
Also, I updated the dockerfile to
FROM nvidia/cuda:11.7.1-cudnn8-devel-ubuntu22.04
instead ofFROM ubuntu:18.04
to be able to use CUDA GPU.Thanks!