Open hust-kevin opened 2 years ago
I was getting the similar error when I was trying to load google/long-t5-tglobal-xl
using ORTModelForSeq2SeqLM
. From my experience, I am seeing these errors for larger models.
@hust-kevin did you ever succeed with this?
@nickchomey Do you have the same issue? I can try and have a look shortly.
@fxmarty I actually haven't tried yet. I was just browsing around for info related to using HF Optimum to convert NLLB to ONNX, and then optimize and quantize it. Then I'd like to compare its performance with CTranslate2 (NLLB-200 with CTranslate2) and hopefully eliminate a dependency.
I assume you have everything already set up to do an easy check, so it would be much appreciated if you could do so!
Feature request
add support for convert nllb to onnx format
Motivation
I want to convert nllb to onnx, I use ORTModelForSeq2SeqLM, but get error
Your contribution
None