Optim memory usage, fastchat integration and multiprocessing benchmark

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

MIT License

269 stars 21 forks source link

Closed guyan364 closed 5 months ago

guyan364 commented 5 months ago

close #5 #8 #11 #12