ProjectNUWA / LayoutNUWA

MIT License
134 stars 16 forks source link

How long did it take to train your model? #10

Closed jianantian closed 11 months ago

jianantian commented 11 months ago

In the paper, i find that you run all experiments on 64 NVIDIA V100 GPUs. How long did it take to train your model?

ZetangForward commented 11 months ago

64GPUs will cost more than 120 hours for training for the large single dataset (PubLayNet and RICI). For the domain-agonist setting, it will cost a week. We recommend pre-training the model on the domain-agonist setting for a few days and continually fine-tuning the model on the special dataset for efficiency and effectiveness.