Closed ShaoDingBao closed 2 weeks ago
Why are there different pre-trained models corresponding to different datasets? Is there any specific reason for this?
we train models with different flops for some datasets
Why are there different pre-trained models corresponding to different datasets? Is there any specific reason for this?