coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
MIT License
1.27k stars 137 forks source link

adding inaGVAD corpus #221

Open DavidDoukhan opened 4 months ago

DavidDoukhan commented 4 months ago

Dear maintainers,

First of all I want to thank you for your useful listing of voice datasets that have been useful for my research.

I'm suggesting to add inaGVAD to this listing, a brand new voice activity detection and speaker gender segmentation that has just been released and presented at LREC 2024. This dataset has been proven to be useful and challenging with respect to state-of-the art datasets and VAD systems (full details are provided in LREC paper).

The access to this dataset is free for academics, and I hope that its mention on your page will contribute to its adoption in the speech analysis community. For copyright compliance issue, its use is restricted to academics and its access requires to comply to INA's GCU

FYI : I've signed the contributor License Agreement.

Pull request guidelines

Welcome to the 🐸open-speech-corpora project! We are excited to see your interest, and we appreciate your support!

This repository is governed by the Contributor Covenant Code of Conduct. For more details, see the CODE_OF_CONDUCT.md file.

Before accepting your pull request, you will be asked to sign a Contributor License Agreement.

This Contributor License Agreement: