issues
search
kyegomez
/
LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
https://discord.gg/qUtxnK2NMf
Apache License 2.0
688
stars
64
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
LongNetTransformer Error
#25
LiJiahao-Alex
opened
4 months ago
3
Feature Request : Enhance Attention Mechanism for Multi-GPU Support
#24
viai957
opened
6 months ago
0
Train Error
#23
bruicecode
opened
8 months ago
4
train error
#22
ZTYyy
opened
11 months ago
8
pip install Error
#21
yangsp5
closed
1 year ago
1
add dilated mask
#20
nullonesix
opened
1 year ago
1
OutOfMemoryError
#19
Qembo154
closed
1 year ago
1
AMD Support
#18
userbox020
closed
1 year ago
1
LongNet can be used for fine-tuning large language models?
#17
mahuixian
closed
1 year ago
1
ModuleNotFoundError: No module named 'LongNet'
#16
copi143
closed
1 year ago
1
where to find any experiments on real dataset?
#15
decoda-huanyi
closed
1 year ago
3
Module not Found Error : 'packaging'
#14
jh9504
closed
1 year ago
1
Incorrect argument type passed into utils.sparsifyIndices()
#13
MiuMiuMiue
closed
1 year ago
2
Import issues
#12
jtrechot
closed
1 year ago
1
example.py does not work
#11
SowreshMS
closed
1 year ago
2
The README usage code failed to run.
#10
LetianLee
closed
1 year ago
1
KeyError: 'module.token_embs.0.gamma'
#9
pokameng
closed
1 year ago
1
RuntimeError: shape '[32, 1, -1, 64, 512]' is invalid for input of size 524288
#8
One-sixth
closed
1 year ago
1
Training with gpus
#7
fangxy100
closed
1 year ago
1
fix SyntaxError: keyword argument repeated: dropout
#6
grahamannett
closed
1 year ago
0
Any demo python I can play with?
#5
AK51
closed
1 year ago
4
Fix typo in README.md
#4
eltociear
opened
1 year ago
0
cant install
#3
jmanhype
closed
1 year ago
12
Basemodel usage
#2
PriNova
closed
1 year ago
2
Link to official implementation, remove misleading citation
#1
elliottower
closed
1 year ago
7