Open yasuhisa-nakashima opened 1 year ago
Add support for grouped-query attention for Llama 2 70B and Code Llama 34B compatibility.
References:
Add support for grouped-query attention for Llama 2 70B and Code Llama 34B compatibility.
References: