SpeechColab / GigaSpeech

Large, modern dataset for speech recognition
Apache License 2.0
649 stars 62 forks source link

What is the source of GigaSpeech Podcast and Audiobook? #101

Closed xiaobobo-bilibili closed 2 years ago

xiaobobo-bilibili commented 2 years ago

I've been searching but couldn't find any website that contains downloadable podcast/ audiobook with captions.

chenguoguo commented 2 years ago

The metadata file contains urls for the original audio/video files

dophist commented 2 years ago

solved. closing