Open sasikr2 opened 2 months ago
@sasikr2 Would you mind trying again with Today's (10/15/2024) commit? There are updates under whisper/readme.md about different padding startegy. However, for offcial whisper, you can't remove 30s padding otherwise you would lose accuracy.
Okay, I will try today with updated code.
System Info
CPU Architecture: x86_64 GPU: NVIDIA A100-SXM4-40GB
TensorRT-LLM version: 0.14.0.dev2024091700
Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Steps to reproduce:
Expected behavior
Expected output should be: Output: ['So basically what I observed is that word error rate are very high for Chinese language but character error rate seems to be good. Higher amplitude the WR is degrading and']
actual behavior
When passing packed audio segments, output comes to be empty. while it should matched with padded input.
additional notes
Can you check once script, the way of sending packed input. OR it is some issue in c++ binding.