Closed deniseduma closed 3 years ago
HI @deniseduma ,
we need to update the documentation slightly (@konstin). You can build the image from the Docker file you find in the root. This is usually something like docker built -t .
when you run the commant from the root of the repo. @konstin you might to expand on this one.
Alternatively, we started providing auto-built docker images, but we saw that the usage was rather low, so we now limited production to web-server docker images: https://github.com/orgs/bioembeddings/packages
We could add a general "bio-embeddings" image there so that it's easier for users like you to download (or maybe simply hijack the worker to do that?).
Anyway, let me have a quick discussion with @konstin and get back to you with easier instructions @deniseduma :)
Hi Christian,
Thank you very much for getting back to me!
I'm not familiar with Docker actually and I'd much rather prefer to use the pip installation which unfortunately doesn't work for me either!
I went back to trying to install bio-embeddings on the cluster I'm using (I'm in Munich as well, I work at Helmholtz Zentrum ) but I'm having some annoying dependencies problems, as in bio-embeddings downgrades the installation of PyTorch from 1.7.1 to 1.5.1 and it then complains it's too old for torchvision and also tries to upgrade NumPy system-wise which obviously fails. So I'm currently stuck...
Ok, I've managed to install the package with pip finally!
I won't need Docker after all but thanks for getting back to me, really appreciated!
Denise
Me again!
Sorry about this, but now I have another issue, I managed to install the package on a slum cluster and submitted the job to one of the GPU machines in the cluster but now I get the following error:
Traceback (most recent call last):
File "embed_seqs.py", line 8, in
Any idea how to fix this?
Thanks, Denise
Sorry for all the trouble with the missing docker image, I've totally missed this when making the 0.1.5 release.
I've now published ghcr.io/bioembeddings/bio_embeddings:v0.1.6
which can be used like this:
docker run --rm --gpus all \
-v "$(pwd)/examples/docker":/mnt \
-v bio_embeddings_weights_cache:/root/.cache/bio_embeddings \
-u $(id -u ${USER}):$(id -g ${USER}) \
ghcr.io/bioembeddings/bio_embeddings:v0.1.6 /mnt/config.yml
urllib.error.URLError: <urlopen error [Errno 101] Network is unreachable>
This sounds like the machine has no internet, thought I'm not sure we they'd block cluster nodes from making network requests.
Thanks for updating the docker image although I guess, I'll happily pass on using it! :p I'm not a big fan of Docker I guess! :p
Regarding, [Errno 101] Network is unreachable, the cluster admins never got back to me, but yes, it seems the cluster nodes don't have Internet access which is weird! Luckily the login nodes do, so I used those to download the weights!
Thanks, Denise
Hi,
I'm trying to use docker to run bio-embeddings and you say "We provide a docker image at rostlab/bio_embeddings"
Where is the docker image because I can't find it?
Also, in the command
docker run --rm --gpus all \ -v "$(pwd)/examples/docker":/mnt \ -u $(id -u ${USER}):$(id -g ${USER}) \ rostlab/bio_embeddings /mnt/config.yml
there is $pwd, where should I run the docker command from? Can you please provide a bit more details about how to run this?
Thanks, Denise