I found that the inference results on my customized dataset are different if I run the evaluation code for multiple times. I fix the batch size to 4, and the "shuffle" of dataloader is set to False. I don't use "low_resource" during inference. However, if I fix the seed, the results will be consistent. Any clues on why this happened? Thanks.
I found that the inference results on my customized dataset are different if I run the evaluation code for multiple times. I fix the batch size to 4, and the "shuffle" of dataloader is set to False. I don't use "low_resource" during inference. However, if I fix the seed, the results will be consistent. Any clues on why this happened? Thanks.