Closed joytianya closed 1 year ago
May I ask about the configs of pre-training? For example, did you use dropout?
If you asking about OpenLLaMA, we follow the exact same configuration of LLaMA, so you can find all the hyperparameters in the LLaMA paper.
I didn't find dropout in the paper. May I ask if llama has used dropout?
May I ask about the configs of pre-training? For example, did you use dropout?