Closed daniil-lyakhov closed 1 week ago
Thanks for your issue @daniil-lyakhov! We'll take a look as soon as a bit of bandwidth frees up cc @eustlb
Any update on this facing same issue
Regarding the cache, also cc @gante @zucchini-nlp @ArthurZucker
@daniil-lyakhov hey, I am not sure if compile
is supposed to work out-of-the-box with ASR pipeline for whisper. We recommend to use StaticCache when compiling and compile fullgraph so that the model doesn't recompile every forward call. Please take a look at this tutorial on Whisper + compile
Currently I can overcome this error by some changes in model code, but as I am not very familiar with the pipeline for ASR we'll need more time to see what is the correct fix
@daniil-lyakhov hey, I am not sure if
compile
is supposed to work out-of-the-box with ASR pipeline for whisper. We recommend to use StaticCache when compiling and compile fullgraph so that the model doesn't recompile every forward call. Please take a look at this tutorial on Whisper +compile
Currently I can overcome this error by some changes in model code, but as I am not very familiar with the pipeline for ASR we'll need more time to see what is the correct fix
@zucchini-nlp, thank you very much for your reply! That's exactly what I need 👍
Thanks for your issue @daniil-lyakhov and thanks @zucchini-nlp for pointing out some materials.
Closing for now since it seems solved. Let us know if that works for you.
System Info
transformers
version: 4.46.2Who can help?
@ylacombe, @eustlb
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Hi there! I'm trying to employ
torch.compile
to speedup the inference of the whisper model, but I cannot overcome the following error:The expected output is something like
But an error occures:
Expected behavior
Could you please help me with that? Thank you!