Open caffeinetoomuch opened 2 years ago
It seems this does not happen with google/long-t5-tglobal-large
. Furthermore, I was actually able to load the model as ORTModelForSeq2SeqLM
by exporting the XL checkpoint myself and using from_transformers=False
. I think some decoder external files are overwritten when it is exporting decoder with past. So, in my case, when I was exporting, I used the separate folders for decoder and decoder with past.
System Info
Who can help?
@lewtun, @michaelbenayoun
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Above code snippet causes the following exception:
Expected behavior
ONNX checkpoints of
encoder
,decoder
anddecoder with past
being generated!