Open nshmyrev opened 11 months ago
another strange thing that it is not used in streaming:
The reason is that during streaming inference, the input chunk size is fixed, whereas it is a variable in the non-streaming case.
removing is_onnx=True doesn't seem to affect the accuracy
If you change the input wave duration, i.e., use a larger value than the initial length of the positional encoding vector, you will get an error, I think.
When you export non-streaming zipformer2 model to onnx using export-onnx script, the result model causes many warnings in sherpa:
the source is this commit:
https://github.com/kakashidan/icefall/pull/2
another strange thing that it is not used in streaming:
https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/zipformer/export-onnx-streaming.py#L680
compare to non-streaming
https://github.com/k2-fsa/icefall/blob/master/egs/librispeech/ASR/zipformer/export-onnx.py#L527
removing is_onnx=True doesn't seem to affect the accuracy