Open ysapolovych opened 1 year ago
Yes. If a try 60 sec Audio, get 20 sec transcribing, If I send 20 seconds , get 10 sec. Transcribied. If send 10 seconds audio, get 5 seconds transcribed ?
Anything on this?
Thank you
meet the same question, someone konw this?
My issue seems very similar to https://github.com/facebookresearch/seamless_communication/issues/83 , but I am using Translator Python API + ASR task. My input is 30 seconds long, and I get about half of it transcribed:
I wonder if params
text_max_len_a
,text_max_len_b
,unit_max_len_a
, andunit_max_len_b
ofpredict
method somehow contribute to that (alas, they are undocumented). Playing with them, however, did nothing.