ictnlp / DiSeg

Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
MIT License
33 stars 2 forks source link

fatal error about agent #2

Open Zoe0713 opened 4 months ago

Zoe0713 commented 4 months ago

I tried to reproduce your work, and I found that following your script exactly could not get the results in your paper, and there was a huge difference in low latency. One of the things I found was that when the lagging_segment was 1, diseg_agent couldn't read all the audio properly. I found on one audio that when the total audio length is 14s, the agent will stop reading the audio when it reaches around 1.3s because his states.finish_hypo() becomes True