Open sulkytejas opened 1 year ago
Did you mean the sample rate to be 16000 not 1600? I'm not sure if that's behind the int64 thing though.
re: how many audio files I should use to train? the answer is probably a lot :)
@LWprogramming Thank you for the reply. I am pretty new to the field and still unaware of the unknowns; here is my collab drive. Can you please help me with what I did do wrong? https://colab.research.google.com/drive/10uHyvlwbhrnA3puvznJ4rQo0UOIsRPSj?usp=sharing
Also, would you happen to know any open-source dataset I can use for training? I went through and extracted some myself but it was not enough.
After training the dataset with 2-sec audio, it generates an int64 dtype
tensor.
Torchaudio.save` does not support the type. So I have to cast it to int32. After saving that file, I get an empty file.Can you guide me on the correct way to save it?![Screenshot 2023-04-02 at 5 41 13 PM](https://user-images.githubusercontent.com/10854204/229388494-c226aeb9-e28b-4e00-aeb5-e1b5fcaa05c5.png)
Also, could you let me know how many audio files I should use to train?