[Web] `Error: [WebGPU] Kernel "[Conv] /text_encoder/encoder/layers.0/feed_forward/conv_2/Conv" failed. Error: FILTER_IN_CHANNEL should be equal to DATA_CHANNEL` #21108
Attempting to run Xenova/mms-tts-eng on WebGPU produces the following error:
Uncaught Error: [WebGPU] Kernel "[Conv] /text_encoder/encoder/layers.0/feed_forward/conv_2/Conv" failed. Error: FILTER_IN_CHANNEL should be equal to DATA_CHANNEL
Describe the issue
Attempting to run Xenova/mms-tts-eng on WebGPU produces the following error:
To reproduce
attention_mask
andinput_ids
as inputsNote that the WASM backend does not have this issue (the model runs correctly)
Urgency
Blocks vits/mms model usage in Transformers.js
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.18.0
Execution Provider
'webgpu' (WebGPU)