-
### Description
What Does RMSEnergyExtractor Do?
Calculates RMS Energy:
RMS energy is a measure of the power of an audio signal. It is computed as the square root of the average of the squared …
-
Currently, the `Feature Extraction` task includes both models for audio and text feature extraction (it is officially placed under the NLP modality). I think it would be nice to have a new task for `A…
-
Rock or pop?
Write a program that classifies small audio files into different genres using machine learning models (you can use a pre-trained model or simple feature extraction).
-
This project classifies audio samples from urban environments into one of 10 classes. The dataset, known as UrbanSound8K, contains 8732 sound excerpts, each 4 seconds or shorter, representing urban so…
-
Hello author, thank you for your excellent work!
I want to use other datasets for video parsing training. I found the "`.py`" file for video feature extraction in directory `cpsp_avvp/scripts`. How…
-
# Benchmark
## Introduce
In the field of deep learning for audio, the mel spectrogram is the most commonly used audio feature. The performance of mel spectrogram features can be benchmarked and co…
-
### Have you completed your first issue?
- [X] I have completed my first issue
### Guidelines
- [X] I have read the guidelines
- [X] I have the link to my latest merged PR
### Latest Merged PR Lin…
-
**Summary:**
Currently, the project relies on YouTube’s captioning system for lyrics extraction. However, only a limited number of YouTube videos have captions enabled, restricting the number of song…
cmm25 updated
3 weeks ago
-
Notes Animal sounds project
Background
Research questions
logistics Hard drives with TB of audio data by plane
Lot of data
Data on Yoda
Team and collaboration
Scientific problems:
• …
-
During testing, I plan to use another audio feature extraction with a different shape (x, 16, 80). But it is incompatible with the convolution model.
`RuntimeError: Given groups=1, weight of size […