[Feat] Support KV Cache

RUCAIBox / LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

MIT License

578 stars 74 forks source link

Closed huyiwen closed 5 months ago

huyiwen commented 5 months ago

Evaluation Method	ppl	prob	generation
Dataset	Hellaswag (0-shot)	MMLU (5-shot)	GSM (8-shot)
Performance (PC)	75.60	46.52	15.16
Performance (Vanilla)	75.59	46.53	14.56
Time (PC)	0:05:48	0:05:51	0:17:13
Time (Vanilla)	0:05:32	0:18:30	2:10:27