issues
search
YuchuanTian
/
DiJiang
[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.
https://arxiv.org/abs/2403.19928
86
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
一些问题
#5
00ffcc
opened
2 months ago
3
Long inputs cause overflow / underflow
#4
yuji96
opened
2 months ago
2
Wrong Configuration settings in python-2.8/1B
#3
4IK1d
closed
3 months ago
1
Llama 7B?
#2
pharaouk
opened
3 months ago
2
Merge to huggingface/transformers
#1
sepcnt
opened
3 months ago
0