thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
MIT License
309 stars 29 forks source link

qwen系列模型支持 #3

Closed fst813 closed 8 months ago

fst813 commented 8 months ago

能否支持下qwen系列模型

huliangbing commented 8 months ago

同问

NaivePawn commented 8 months ago

@fst813 我在patch.py里加了这一段,不知道对不对,实验下来点没降 image

fst813 commented 8 months ago

@NaivePawn 只加这个不对吧?你跑起来了?

NaivePawn commented 8 months ago

@NaivePawn 只加这个不对吧?你跑起来了?

啥不对?我跑起来了,要用qwen1.5那个

fst813 commented 8 months ago

我跑的不是1.5 跑的qwen,改了很多东西

huliangbing commented 8 months ago

效果如何?