-
### Feature Name
Research about Stability.ai
### Feature Description
This a research about Stability.ai, learning more about its supported models, how it is used and many more
### Motivati…
-
Please add file upload (text, images, pdf, etc)
-
I'm trying to implement a basic audio visualizer for mpv using an ffmpeg filter:
`[a]showwaves=mode=line:s=hd480:colors=White[v]`
The only problem is, when mpv opens an audio file it doesn't initial…
-
Hi there
First of all amazing project!
was wondering what is the expected latency for a short audio (2-5 seconds)?
Is it instant? Less then a second?
Wondering if this can be used in a realtim…
-
The bouncy source is stuck in a loop of the same ~1sec audio and doesn't escape this back and forth movement pattern.
[Issue - Glitched loop.m4v.zip](https://github.com/mitchmindtree/beyond_perceptio…
-
I am trying to make Godot send OSC messages to SuperCollider, but I can't make it work.
Is it possible to include an example program in a separate repo? https://reddit.com/r/godot/comments/1b2lzle…
-
Here is a list of papers using this code
1. [A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment](https://arxiv.org/abs/2307.15611)
2. [IANS: Intelligibility-awa…
-
## Goal
To create a benchmark dataset for audio files to assist evaluation of deepfake detection tools.
## Overview
During the first quarter of launch of DAU, a trend that has emerged is the p…
-
## 一言でいうと
音声の生成を、一般的な振幅X時系列ではなく周波数X時系列(スペクトログラム)で行った研究。スペクトログラムは縦横軸の意味がそれぞれ異なるため、雑にCNNで畳み込むのは適さない。そこで、時間方向/周波数方向で別個のEncode(RNN)を行い生成を行っている。
![image](https://user-images.githubusercontent.com/544…
-
Resources:
https://www.assemblyai.com/blog/recent-developments-in-generative-ai-for-audio/
https://colab.research.google.com/github/sanchit-gandhi/notebooks/blob/main/MusicGen.ipynb