-
##### Environment
* Link to playable MPD file: https://live11-dash-preprod-he2.v-o.staging.cdn.orange.com/MnM3Syr0m1M-33yU8dNNYg/1733011200/428467/506237/54/cmaf/r11_pre10_ott/dash_high.mpd
* Dash.j…
-
Provide the user the ability to click an icon, talk, and have user's voice interpreted as text
- [x] Create small use case example
- [ ] Update IAM permissions to allow Transcribe access
- [ ] In…
-
Hi!
I was trying to finetune the model on my dataset but I couldn't understand how I should structure my dataset.
I've performed all tasks mentioned in [data preparation](https://github.com/mbz…
-
### Description
We aim to evaluate the effectiveness of our transfer text function and the LLM-generated corrected transcript in improving the quality of training data. This analysis will focus on th…
-
# Description
It's not a bug, but a much-needed feature.
If reporting a bug, please fill out the following:
### Environment
- pipecat-ai version: 0.0.48
- python version: 3.11
- OS: Window…
-
`def generate_podcast_from_transcript(user_id, topic):
# Generate podcast from URL
config = {
'conversation_style': ['Engaging', 'Fast-paced', 'Enthusiastic', 'Educational'],
'rol…
-
Is it not possible to transcribe long audio files, around ~3 hours? I am trying to transcribe the 3-hour audio to Hindi, but it uses huge memory.
```
import torch
import nemo.collections.asr as …
-
### Checks
- [X] This template is only for usage issues encountered.
- [X] I have thoroughly reviewed the project documentation but couldn't find information to solve my problem.
- [X] I have searche…
-
### Steps to reproduce
case 'playyy': {
if (args.length < 1) return reply("Insira o comando, e em seguida um nome para a pesquisa!");
const { Innertube } = require('youtubei.js');
co…
-
Hugginface has most models in some other formats.
For example, the auto-to-text/text-to-audio model facebook/seamless-m4t-v2-large is in .safetensors format: https://huggingface.co/facebook/seamles…