DagsHub / audio-datasets

open-source audio datasets
https://dagshub.com/DagsHub/audio-datasets
141 stars 23 forks source link
audio audio-datasets codepeak codepeak2022 hacktoberfest hacktoberfest-2022 hacktoberfest-2023 hacktoberfest-22 hacktoberfest2022 hacktoberfest22 open-source

Open-source Audio Datasets

banner

What is DagsHub?

DagsHub is a centralized platform to host and manage machine learning projects including code, data, models, experiments, annotations, model registry, and more! DagsHub does the MLOps heavy lifting for its users. Every repository comes with configured S3 storage, an experiment tracking server, and an annotation workspace - all using popular open-source tools like MLflow, DVC, Git, and Label Studio.

What is Hacktoberfest?

Hacktoberfest is a month-long virtual festival of open source! Participants are giving back to the community by completing pull requests, participating in events, and donating to open-source projects. This project is part of Hacktoberfest 2023, where participants enrich the open-source audio datasets hosted on DagsHub.

Quick Start to Contribution

What does the DagsHub community contribute?

This year we'd like to focus our contribution on the audio domain. For that, we added audio data catalog capabilities to DagsHub! You can now upload audio files to DagsHub and see its spectrogram, wave, and even listen to it! You can see a vivid example of this (extremely cool) feature in our Librispeech-ASR-corpus project.

audio-catalog

To help audio practitioners leverage this new feature, we want to enrich open-source audio datasets on DagsHub. This is where you can contribute to the data science community!

How to contribute?