issues
search
jquesnelle
/
yarn
YaRN: Efficient Context Window Extension of Large Language Models
MIT License
1.25k
stars
110
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Testing yarn on practical tasks.
#12
ChenxinAn-fdu
closed
10 months ago
3
Discussion: how to apply this experiment to the llama2 70B model?
#11
ghost
opened
10 months ago
6
Update requirements.txt
#10
adarshxs
closed
8 months ago
0
Compute Requirements
#9
torphix
closed
10 months ago
3
Error about eval/passkey.py
#8
Xnhyacinth
closed
11 months ago
0
Added Block Sparse, Sparse, and regular Flash Attention
#7
arnavdantuluri
closed
10 months ago
0
Linear Scaled Embedding Has Different Implementation?
#6
fahadh4ilyas
closed
10 months ago
1
Best datasets to use for finetuning?
#5
StrangeTcy
opened
11 months ago
0
Finetune Example
#4
M-Chris
closed
10 months ago
3
How to generate plot?
#3
vgoklani
closed
11 months ago
1
License
#2
andreaskoepf
closed
11 months ago
1
Add NTK-Aware interpolation "by parts" correction
#1
bloc97
closed
11 months ago
0
Previous