-
I was looking at this speech. It had really strange line breaks in the middle of the speech when using the swedeb interface.
{"id":"prot-1909--ak--024_026","gender":"Man","party":"S","year":1909,"s…
-
This project is demonstrated a very good way to tokenize a speech with different feature, such as style and pitch tokens, that enable downstream application having fine grained control of the generati…
-
Hi,
I am currently running the speech-to-speech pipeline on an AWS EC2 instance (Ubuntu 20.04) with an Nvidia A10g GPU. The pipeline works well, but I am experiencing around 1 second of latency, an…
-
The Swedeb API fails when requesting speeches with an unknown speaker.
The error is raised by the Pythonic API that encounters NaN values in the returned speaker and party field.
To resolve this i…
-
Integrate [Parler-TTS](https://github.com/huggingface/parler-tts) to senselab's text to speech API.
-
## Goal
Create a speech instruction finetuning to make Ichigo better in conversation.
## Tasklist
- [ ] Check the data generation pipeline: https://github.com/collabora/WhisperSpeech
- [ ] Expe…
-
Speech Emotion Recognition (SER) system was defined as a combination of different frameworks and works based on analyzing audio signals to identify emotions. We can use one or combine other parts to r…
-
If user is using the on-device recognition, we do not need to prompt the user for speech recognition permission.
> Speech data from this app will be sent to Apple to process your requests. This wi…
-
![image](https://github.com/user-attachments/assets/fda027e3-f1c9-4bc8-b7d3-af5fee31cb97)
Section 1, Speech-Analysis, Word Frequency Tracking, Taser, Java/Kotlin, Android app, Speech Pattern Analysis…
-
以此模式搭建的AIbot,在使用群聊@时容易出现@错人的现象,或者可以说在某些条件下,@的对象不会随群聊对象的改变而切换。这个问题需要考虑在调用记忆存储与对发消息者的识别中添加限制条件,以避免AI在群聊中反复@某一位群友,造成困扰。