-
@n8fr8 what are the good choices Haven made? Which frameworks are still good and maintained?
I found those:
- OpenCV: OpenCV is a library of programming functions mainly aimed at real-time computer…
-
Steps to reproduce
------------------
1. (How do you make the issue happen? Does it happen every time you try it?)
This issue happens upon running the python script containing the following code
…
-
# Task Name
Emoji-Grounded Speech Emotion Recognition
## Task Objective
The primary goal of the Emoji-Grounded Speech Emotion Recognition (EG-SER) task is to develop a system that can accurat…
-
## Introduction
We can envision and consider client-side, server-side and third-party speech recognition, synthesis and translation scenarios for a next version of the Web Speech API.
## Advanci…
-
**Description**
Provide us with a how to solution to extract periodic screen frames and audio for speech recognition in real-time from a Jitsi WebRTC call. The extracted frames and audio would be p…
-
[output.zip](https://github.com/user-attachments/files/16495935/output.zip)
[speechlog.log](https://github.com/user-attachments/files/16496103/speechlog.log)
Hi there,
zip of .wav file and log …
-
您好,论文中有提到 “Output Instruction: Lastly, we provide output instruction to further specify the task and desired format for different subtasks, and then the text output begins.”
以下这些Output Instruction在…
-
## Inspiration
So there is a gradio space [https://huggingface.co/spaces/hf-audio/whisper-large-v3](url) that uses whisper, from the hugging face api :
```python
import spaces
import torch
…
240db updated
1 month ago
-
I'm developing chatbot with Ionic.
We need a property in this plugin which automatically stops recognition after speech end like Android. And a solution for realtime text displays of recognition on…
-
[comment]: (Just paste "x" inside brackets, for example: - [x] Some positive statement)
**What problem are you facing?**
- [ ] audio isn`t recorded
- [ ] audio is recorded with artifacts
- [ …
ghost updated
1 month ago