Open reelmath opened 2 years ago
Hey @reelmath, thanks for opening an issue, it seems you and @echarlaix managed to find the source of the problem.
We unfortunately don't have a lot of bandwidth to dive into solving that code, so I'll add an onnx
tag and a Good second issue
tag so that experienced users know that this is an issue that could be fixed. If you'd like to try your hand at it, please go ahead!
Hi, I would like to work on this if it has not been assigned to anyone, but could take some time if that is ok?
Hey @yhl48, this would be great indeed :-)
Hello @reelmath , I was trying to mimic your error with my setting as follows:
but I faced the same errors with you.
It looks like the pretrained model is not available anymore?
Upon running the following line
model = ORTModelForSeq2SeqLM.from_pretrained("longt5-tglobal-base", from_transformers=True)
The following error was raised
OSError: longt5-tglobal-base is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo with `use_auth_token` or log in with `huggingface-cli login` and pass `use_auth_token=True`.
@yhl48 I think you need to use google/long-t5-tglobal-base
name
Thanks @stancld!
Has this issue been resolved? I can no longer replicate the error.
System Info
optimum
version: 1.2.3 (installed via Github installation)transformers
version: 4.20.1Who can help?
@stancld @echarlaix @LysandreJik
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
LongT5 with TGlobal Attention isn't able to run sequences longer than *global_block_size 2**. This is because during the model tracing num_globals > 0 is being converted to False. I originally posted the error in Optimum (https://github.com/huggingface/optimum/issues/285) but @echarlaix asked me to open an issue here because this error concerns the ONNX export.
Code to reproduce is below:
Expected behavior
Should work for very large seq lens on default global block size without error