-
The currently followed architecture of is still too closely bound to traditional NLU based voice interaction concepts. While it aimed at including LLM with speech, LLM with multimodality, ... it is po…
-
Hello
I am trying to make Mp4s with audio Mp3. When I run your layers_audio example, I can make GIFs and Mp4s if I set enable audio to false. If I set enable audio to true and make step3, the step3 t…
-
hi,
this is more a question then an issue -
i'm looking for a way to extract features from raw audio wav files and then use these features for different tasks such as voice recognition, voice activ…
-
I found that SampleRNN need to be run in parallel to get fast generation speed. It takes only about 500 seconds for generating 200 utterances, each with a length of 8 seconds speech. But it will be ve…
-
### Description
Currently while working with audio, sound, signals, or anything related to waveforms in general,
On uploading a run to `wandb.Table` for [audio](https://docs.wandb.ai/guides/tables…
-
## Detailed Description
Diffusion models seem to be quite useful for a lot of image generation and high detail and much easier training than for GANs as generative models, e.g. StableDiffusion. Thi…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues.
### Which plugins are affected?
Other
### Which platforms are affected?
Android
### Description
…
-
Hey guys,
Great work! I wanted to ask kindly for an update on the training code and the weights if possible. Would love to recreate your work :)
Looking forward!
Best
-
It would be a game changer if Juno would get audio and MIDI support. This, together with probably other musical tools would make Juno a tool for musical education or electronic music synthesis and com…
-
[This paper](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4189457) is connected to the new minister of communications ([Sattar Hashemi](https://x.com/HashemiSattar)) in Iran: https://x.com/ircf…
irgfw updated
3 weeks ago