Open mo230761 opened 2 months ago
https://arxiv.org/pdf/2408.12429
You can get the training details from our ongoing paper (Appendix A Implementation Details). We trained the model at A100 (40G) with batch size=4. You can tried with different batch size or other techniques for the training. Stay tuned we would upload the code.
OK!Thank you!
https://arxiv.org/pdf/2408.12429
You can get the training details from our ongoing paper (Appendix A Implementation Details). We trained the model at A100 (40G) with batch size=4. You can tried with different batch size or other techniques for the training. Stay tuned we would upload the code.
Presumably what emits the code
I really appreciate your work. Can a 32GB V100 be used for training? Thanks!