-
# Aim
The aim of the AI Bot project is to create a voice assistant that can understand user input, generate responses using the OpenAI API, and provide both textual and auditory feedback.
# Detail…
-
Hi, after reading the paper, I am confused about the table 3.
What is the meaning of visual acc, audio acc and combine acc?
How did you calculate the result of 67.5%, 91.8%, 95.2%?
![default](http…
-
### Description
What Does RMSEnergyExtractor Do?
Calculates RMS Energy:
RMS energy is a measure of the power of an audio signal. It is computed as the square root of the average of the squared …
-
hi
i want speech recognition using sphinx but as the accuracy of sphinx is not good at all. so need to decode the wav file but getting some error into the file....can any one please help me on this.
…
-
Hi,
This is a bug or error, maybe.
This [line](https://github.com/SlapBot/stephanie-va/blob/master/Stephanie/AudioManager/audio_recognizer.py#L50) passes the key as json file. And therefore [speec…
-
Hi!
I am trying to fine tune the TATR model with a proprietary dataset. I am currently trying to convert the dataset to the same format as FinTabNet and then using the script in this repository (s…
-
[MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research](https://arxiv.org/pdf/2406.18301)
The above paper has just open-sourced a dataset fo…
-
Steps to reproduce
------------------
1. (How do you make the issue happen? Does it happen every time you try it?)
2. (Make sure to go into as much detail as needed to reproduce the issue. Postin…
-
Hi Memo
Would be interesting to have a demo of audio gesture recognition with ofxMSATensorFlow. not necessarily only speech, but any sort of audio musical and non-musical gesture (both deterministi…
ghost updated
5 years ago
-