Question about past_key_value modification

SafeAILab / EAGLE

Official Implementation of EAGLE

Apache License 2.0

622 stars 59 forks source link

Hello Eagle Team! I noticed you modified past_key_value in https://github.com/SafeAILab/EAGLE/blob/667ba930db7ea0075421f3c7df94ffbc10b93805/eagle/model/modeling_llama_kv.py#L594 by setting it to None in forward function, comparing with the source code https://github.com/huggingface/transformers/blob/e51d7ac70ab8f3e69d3659226aa838308a668238/src/transformers/models/llama/modeling_llama.py#L324 Could you provide some insights why you made such changes? I am trying to generating responses with code-llama-7b with EAGLE's KVLlamaForCausalLM class, but the results are much lower quality than results I got with default AutoModelForCausalLM class. I suspect the kv cache affects the generation.

SafeAILab / EAGLE