Closed pskvins closed 2 years ago
Hey, I'm unsure 🤔 a corrupted zip download? You can download the ZIP manually from here: http://data.bioembeddings.com/public/embeddings/embedding_models/bert/
Then unzip it using whatever system software you have.
last, in the config you just need to add a parameter model_directory
in the prottrans_t5_bfd_embeddings
stage: https://github.com/sacdallago/bio_embeddings/blob/develop/examples/parameters_blueprint.yml#L95
Let me know if this works
P.S.: why ProtBert? It's not the best performing model! ProtT5 is: https://github.com/agemagician/ProtTrans/blob/master/README.md#-comparison-to-other-protein-language-models-plms
Hi, I am trying to get embeddings of proteins, but I'm encountering an error as following
I installed bio_embeddings by
pip install bio-embeddings[all]
and ranbio_embeddings --overwrite embed.yml
. The content of the embed.yml is as following `global: sequences_file: /home/sukhwan/cluster_idr/sample.fasta prefix: sample_result simple_remapping: Trueprottrans_t5_bfd_embeddings: type: embed protocol: prottrans_bert_bfd reduce: True`
I tried again after removing the cache file of the bio_embeddings, but got the same error message. Do you know what can be the cause of this problem?