Closed deciding closed 4 years ago
Hi, The dataloader will load data parallelly. To make each speaker has the uniform probability to be sampled, each loading process must generate different sequence of random numbers. I add jumpahead to make the above idea to work and unfortunately, I haven't tried to remove jumpahead function. Theoretically, if the code can give you "enough randomness", the result should be okay. You can easily write a code snippet to confirm that without jumpahead. (just do data loading and check the loaded utterances). Please let me know if you try that.
@mycrazycracy Hi, there should be no problem of removing jumpahead. I checked:
Thus the misalignment on my side from your result is due to other thing. I will raise another ticket for this performance alignment.
the jumpahead function of random module is eliminated in python3, I removed it in the code, and found that the eer got degradation. I just want to ask is this function useful in model training? I think the os.urandom can already gives us good randomness