Closed guocuimi closed 1 month ago
set right default value for rope_theta for llama2. also refactor the code a bit to explicately set default values for llama2, llama3 and Yi.
TODOS:
This diff would fix the correctness issue for llama2.
set right default value for rope_theta for llama2. also refactor the code a bit to explicately set default values for llama2, llama3 and Yi.
TODOS: