mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.
MIT License
1.11k stars 260 forks source link

Clarifications for creating sequential batches instead of random sampling. #112

Open Raviteja-banda opened 1 year ago

Raviteja-banda commented 1 year ago

Please correct me if I'm wrong: In the speaker_id.py file, in the function create_baches_rnd(), you select a batch of random random samples and then a random chunk of length 200ms(since the length of the sample is 3200 and sampling rate is 16000) from each sample. This way, you might select the same file in 2 different batches and you might end up not selecting some files. Eventually the model might end up not training some labels. Am i correct in saying so?

What is the effect of selecting the files in sequence instead of random selection. This way, I select each audio file only once, and I make sure all the audio samples are selected for training. This also decreases the number of batches for training and hence less training time. Could some one please clarify these things?