-
**Is your feature request related to a problem? Please describe.**
I want to have a Github pipeline or automation that allows me to check if there are issues with Friend.
1. General obvious user flow…
-
**Description**:
We currently have news full audio and corresponding news transcript. We would like to get the news text and audio data split into segments to train our STT and TTS model.
**Implemen…
-
python -m recognition.alignment --recognition_path "./datasets/son/recognition.json" --score_threshold=0.5
이 코드를 입력 후 list index out of range 오류가 떠서 recognition.json에서 하기 텍스트를 삭제하였습니다.
NB11653121 …
-
"In practice, to save GPU memory, we do not load all Encoders directly onto the GPU but instead load the extracted features“
Does it mean we don't need modality encoder, we already have the llama inp…
-
Hi!!!
I have the following error training the model :
```
File ".\ubm.py", line 202, in
ubm.train()
File ".\ubm.py", line 50, in train
iterations=(1, 2, 2, 4, 4, 4, 4, 8, 8, 8, 8, 8, 8)
File "…
-
Can you use https://github.com/WeidiXie/VGG-Speaker-Recognition or your re-implementation (https://github.com/taylorlu/ghostvlad-speaker) to reproduce pretrained speaker model ? If yes, how?
-
Cannot build the GUI from the dockerfile:
[WARNING]: Empty continuation line found in:
RUN useradd -m $USERNAME && echo "$USERNAME:$USERNAME" | chpasswd && usermod --shell /bin/bash $USE…
-
Hi
I am sending my wav file to api see below
RequestMessage {Method: POST, RequestUri: 'https://westus.api.cognitive.microsoft.com/spid/v1.0/identificationProfiles/034f5efa-a536-4d9b-a90f-38686…
-
# Task Name
African American Vernacular English (AAVE) Speech Recognition
## Task Objective
Mainstream speech recognition systems often perform poorly on non-standard dialects and sociolects,…
-
Steps to reproduce
------------------
Expected behaviour
------------------
I am trying to build a voice activated control system that would be integrated with Arduino
Actual behaviour
-----…