Closed griff4692 closed 3 months ago
See this section of the writeup.
This issue involves implementing the paper below into gpt-fast
and doing a sanity check to see if the profiled attention heads are accurate for llama-3
See this section of the writeup.
This issue involves implementing the paper below into gpt-fast
and doing a sanity check to see if the profiled attention heads are accurate for llama-3