-
**Describe the bug**
Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…
-
Hi,
I'm currently trying to replicate the performance of Qwen2-Audio on the AIR Bench. However, I noticed that the repository at [AIR-Bench](https://github.com/OFA-Sys/AIR-Bench/blob/main/score_cha…
-
First off -- AMAZING TTS!!!
I know I'm repeating several other issues that have been opened, but I've spent several days testing and code tweaking to try to resolve the issues I have found, and wan…
-
TPAC 2024 is planned to be held in Anaheim from 23 to 27 September 2024. These will be hybrid meetings with in-person and remote attendance.
The Media & Entertainment Interest Group plans to meet o…
-
Hi,
Thanks for the wonderful work.
I have a question regarding generating virtual rooms with customizable microphone and speaker placements, similar to Fig. 2 in the paper.
Is this feature a…
-
```
~ via 🐍 v3.11.9 (insanely-fast-whisper)
❯ pipx list
venvs are in ~/.local/pipx/venvs
apps are exposed on your $PATH at ~/.local/bin
manual pages are exposed at ~/.local/share/man
package …
-
Hi, I want to ask, what are the values of self.v_token_id = 15167, self.q_token_id = 16492, self.a_token_id = 22550, self.nl_id = 13 in tokenizer set based on? Or why is the value of v_token_id set …
-
### System Info
- `transformers` version: 4.44.0
- Platform: Linux-5.15.0-116-generic-x86_64-with-glibc2.35
- Python version: 3.10.12
- Huggingface_hub version: 0.24.5
- Safetensors version: 0.…
-
## Description
Whike f3rr0-C47 tells you how to build from git it does not tell you how to run it. I am not here to learn yet another build system in depth and after glancing the poetry docs and tr…
-
A new request, maybe the subtivals' next generation and version 2:
a video/audio player that plays a media inside the program to simulate the subtitling project.