-
We are currently calling Google Cloud transcription via the speech recognition package.
This may break if the audio exceeds 1 minute:
https://cloud.google.com/speech-to-text/docs/async-recognize#s…
dger1 updated
5 years ago
-
# Task Name
Musical Style Transformation
## Task Objective
The goal of this task is to transform a given music piece from one musical genre to another, preserving the original melody and lyri…
-
By design, AdVent cannot, and probably will never work 100% error free. When it makes mistakes, usually some kind of manual action on console or on TV control is needed. To facilitate its use without …
-
* [x] app-audio (-like) waveform source
* [ ] randomize color palette button (mb slight animated color shift)
* [ ] baseline highlight
* [x] ~~download generated optimized static font (based on se…
-
My input file is 16 bit PCM wav file.
Does google has limits for number of attempts per day/hour...
Traceback (most recent call last):
File "ASR_speech.py", line 17, in
print("The audio f…
-
### Feature request
The PR https://github.com/huggingface/transformers/pull/21754 adds the PyTorch version of `WhisperForAudioClassification`. It would be great to add the TensorFlow equivalent.
###…
-
When I'm run
python examples/mms/asr/infer/mms_infer.py --model "/path/to/asr/model" --lang lang_code --audio "/path/to/audio_1.wav" "/path/to/audio_1.wav"
I got this error:
>>> preparing tmp man…
-
I used the web app for aligning.
I found that 54 % of phrases in my test set were misaligned.
Misaligned meaning
1. Cut too early from the end
2. Cut too late from the previous fragment
3. …
-
Hello sir i have a problem with the Jarvis coding. My code takes in input, recognizes input but doesn't give out voice output
import pyttsx3 #pip install pyttsx3
import speech_recognition as sr
i…
-
We currently have no specific plans for making the next revision of Tympan. But, if we were to make another revision someday, what hardware changes would we like to have?
@biomurph, @eyuan-creare,…