Open BrucePeng92 opened 3 months ago
Why use the gpt model when training llama2? How to determine whether the model trained by pretrain_llama_distributed.sh is llama or gpt?
Why use the gpt model when training llama2? How to determine whether the model trained by pretrain_llama_distributed.sh is llama or gpt?