AI4Bharat / NPTEL2020-Indian-English-Speech-Dataset

NPTEL2020: Speech2Text dataset for Indian-English Accent
72 stars 20 forks source link

Not able to read Data in Colab #13

Open kulshrestha13 opened 2 years ago

kulshrestha13 commented 2 years ago

When I ran your scripts, the file got downloaded in Google Colab Noebook, but I could not read the data. It is a .tar.gz file but when I extracted it, it gave me the following error- 'OSError: Not a gzipped file '

Can you tell me how to extract and read it?

Thanks

GokulNC commented 2 years ago

Which file exactly?

Probably it was not downloaded completely. Did you retry?

hayk314 commented 11 months ago

I am having a similar issue with the train data. Here is the error I get when trying to unzip partaa of the train (or the concatenated version of the train data parts)

gzip: stdin: unexpected end of file
tar: Unexpected EOF in archive
tar: Unexpected EOF in archive
tar: Error is not recoverable: exiting now