-
This is a use case to explore what Scripture Burritos would be created and how they relate to each other. The purpose is to explore how SBs are modeled, looking at ontologies and relationships.
1.…
-
[ to do ]
- implement this feature—will use stand-alone STT for now
- develop logic for identifying content of "about" messages
- respond to messages
- create blue-mix version of this bug (for far…
-
The Koboldcpp app is amazing. The only issue I see is the TTS occurs after the text is finished which takes forever. Is there a way to have the TTS occur as the text is being outputted to reduce the d…
-
## Overview
I have attempted to reproduce the zeroshot classification results for ESC-50 outlined in the publication [Large-scale contrastive language-audio pretraining with feature fusion and keywor…
-
Very amazing and great repository, I have a suggestion and an inquiry when can Video/Audio/Files/Documents support will be added like in AI google studio etc.? Currently it only support images and tex…
-
## Bug report
### Describe the bug
Live Stream Video or MKV files with audio AAC codecs is out of sync.
Change : last build from 05/12/20023: mkv with aac codec from local storage (nfs) play…
-
### Description
> I would like to request the extension of the Time-Series Foundation Model to support non-tabular data types, such as RGB images, text, and sound. This would allow the model to handl…
-
Hi
Linux - ubuntu 22.04
Tested commit 8fac645 - microphone is not passing audio to talk-llama , older builds ( from a month passing microphone audio and transcribe text from audio )
I also teste…
-
Add Read-Aloud Functionality for Storybooks
Description: It would be highly beneficial to introduce a "read-aloud" feature that allows children to listen to storybooks being narrated in a natural, …
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues]([https://github.com/langgenius/dify/issues](https://github.com/fishaudio/fish-speech/issues)), including closed …