jiasenlu / vilbert_beta

470 stars 96 forks source link

What's your pretraining time per epoch? #57

Open yaorong1996 opened 3 years ago

yaorong1996 commented 3 years ago

When I run my code developed from yours, it takes about 3days per epoch training in 64 batch size with 8 GPUS.