Closed GNroy closed 3 years ago
Hey thanks a lot for the questions! We are still finalizing the dataset so things may change... Trying to answer as much as I can:
if possible, it would be great to make sure that the original youtube data was released under a creative commons license by the creators, and not the youtube license.
We can't claim right on audio, it will be more like the approach of ImageNet.
what's the approach of imagenet? I'm not familiar with their distribution approach
We updated the README and it's ready for download now. You will have to agree to the Terms of Access.
Since the GigaSpeech dataset has been officially released, the questions listed above are explained in our paper and repo's README. I'm closing this.
Thanks for providing a new dataset!
Reading the README left me with a few questions, though:
-- Thanks, Aleksandr Laptev