About prot_t5_xl_half_uniref50-enc on HuggingFace

mircare commented 2 years ago

I'd like to let you know that the instructions at https://huggingface.co/Rostlab/prot_t5_xl_half_uniref50-enc doesn't seem to work. I get the same error in 2 different python environment, even installing just PyTorch and the transformers library.

This is the code I run :

from transformers import T5Tokenizer, T5EncoderModel
import torch
import re

tokenizer = T5Tokenizer.from_pretrained('Rostlab/prot_t5_xl_half_uniref50-enc', do_lower_case=False)

model = T5EncoderModel.from_pretrained("Rostlab/prot_t5_xl_half_uniref50-enc", torch_dtype=torch.float16)

sequences_Example = ["A E T C Z A O","S K T Z P"]

seqs = [re.sub(r"[UZOB]", "X", sequence) for sequence in sequences_Example]

ids = tokenizer.batch_encode_plus(seqs, add_special_tokens=True, padding="longest")

input_ids = torch.tensor(ids['input_ids'])
attention_mask = torch.tensor(ids['attention_mask'])

with torch.no_grad():
    embedding_rpr = model(input_ids=input_ids,attention_mask=attention_mask)
emb_0 = embedding_repr.last_hidden_state[0,:6]
emb_1 = embedding_repr.last_hidden_state[1,:4]

This is the error I get:

Traceback (most recent call last):
  File "/home/torrisim/notebooks/test.py", line 19, in <module>
    embedding_rpr = model(input_ids=input_ids,attention_mask=attention_mask)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/transformers/models/t5/modeling_t5.py", line 1838, in forward
    encoder_outputs = self.encoder(
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/transformers/models/t5/modeling_t5.py", line 1033, in forward
    layer_outputs = layer_module(
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/transformers/models/t5/modeling_t5.py", line 716, in forward
    hidden_states = self.layer[-1](hidden_states)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/transformers/models/t5/modeling_t5.py", line 326, in forward
    forwarded_states = self.DenseReluDense(forwarded_states)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/transformers/models/t5/modeling_t5.py", line 289, in forward
    hidden_states = self.act(hidden_states)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl
    return forward_call(*input, **kwargs)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/modules/activation.py", line 98, in forward
    return F.relu(input, inplace=self.inplace)
  File "/home/torrisim/miniconda/envs/test/lib/python3.10/site-packages/torch/nn/functional.py", line 1442, in relu
    result = torch.relu(input)
RuntimeError: "clamp_min_cpu" not implemented for 'Half'

I am unsure whether this is the most appropriate place, please let me know otherwise. Please also note that I assign the output of re to 'seqs' instead of 'sequences_Example' (as in the example on HuggingFace).

mheinzinger commented 2 years ago

Absolutely the right place to ask/comment, don't worry. The error you see is only related to running a half-precision model on CPU. This is currently not supported (yet?). If you execute the same code on google colab with a GPU, you should not see any error. You could "fix" the error by casting the model to full-precision (model=model.float()), or (recommended!) you run it on GPU.

We will adjust our repo accordingly to make this point more clear. Thanks for the heads-up!

Best, Michael

mircare commented 2 years ago

Thank you for the fast reply @mheinzinger. Indeed, moving model and input to the GPU solved it. Here is the updated code (there was a typo in line 21):

from transformers import T5Tokenizer, T5EncoderModel
import torch
import re

device = "cuda:0" if torch.cuda.is_available() else "cpu"

tokenizer = T5Tokenizer.from_pretrained('Rostlab/prot_t5_xl_half_uniref50-enc', do_lower_case=False)

model = T5EncoderModel.from_pretrained("Rostlab/prot_t5_xl_half_uniref50-enc", torch_dtype=torch.float16).to(device)

sequences_Example = ["A E T C Z A O","S K T Z P"]

seqs = [re.sub(r"[UZOB]", "X", sequence) for sequence in sequences_Example]

ids = tokenizer.batch_encode_plus(seqs, add_special_tokens=True, padding="longest")

input_ids = torch.tensor(ids['input_ids']).to(device)
attention_mask = torch.tensor(ids['attention_mask']).to(device)

with torch.no_grad():
    embedding_repr = model(input_ids=input_ids,attention_mask=attention_mask)
emb_0 = embedding_repr.last_hidden_state[0,:6]
emb_1 = embedding_repr.last_hidden_state[1,:4]

Let me add that I am a big fan of this project. Keep it up!

agemagician / ProtTrans

About prot_t5_xl_half_uniref50-enc on HuggingFace #81