Closed aiXander closed 3 years ago
Confirmed, running
python extract.py esm1_t6_43M_UR50S examples/some_proteins.fasta examples/representations/ --repr_layers 6 --include mean
works perfectly fine!
Hi Xander, thanks for calling that out. Did you try reducing eg --toks_per_batch 1022
? Most likely the issue is activations in the forward pass, even in no_grad
mode
(1022, allows +2 for bos/eos on longest sequence)
Thank you for such a quick reply!
python extract.py esm1b_t33_650M_UR50S examples/some_proteins.fasta examples/representations/ --repr_layers 33 --include mean --toks_per_batch 1022
did indeed work!
When trying to run
python extract.py esm1_t34_670M_UR50S examples/P62593.fasta examples/P62593_reprs/ --repr_layers 34 --include mean
I get:I'm guessing that the Colab GPU (a T4 with 15Gb of mem in my case) is unable to pull the entire model into memory? Anybody else running into this?