issues
search
kyegomez
/
BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
https://discord.gg/qUtxnK2NMf
MIT License
1.56k
stars
145
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.3
#65
dependabot[bot]
opened
8 hours ago
0
Bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.2
#64
dependabot[bot]
closed
8 hours ago
1
Question: embeddings 3bits?
#63
telamon
opened
2 weeks ago
0
Bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.1
#62
dependabot[bot]
closed
2 weeks ago
1
Bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.0
#61
dependabot[bot]
closed
4 weeks ago
1
[BUG] cant compile the cuda
#60
huynhducloi00
opened
2 months ago
1
Fix: Weight quantization sign should be the last operation
#59
jmbrito01
closed
1 day ago
1
Create a javascript version that can run in a web browser?
#58
flatsiedatsie
closed
4 weeks ago
2
NanoGPT sample
#57
izaxon
closed
3 weeks ago
1
Bump pypa/gh-action-pypi-publish from 1.8.14 to 1.9.0
#56
dependabot[bot]
closed
3 months ago
0
BitNet model performs wrose than Base Transformer
#55
johanssontan
closed
2 weeks ago
2
added if else statement to handle post_act_ln
#54
Hiromasa-H
closed
4 months ago
1
[BUG] BitFeedForward(post_act_ln=False) results in a TypeError
#53
Hiromasa-H
closed
4 months ago
3
Expected BitLinear weight to be 1 or -1
#52
sanjeev-bhandari
closed
1 day ago
4
what is the purpose of detach here?
#51
Weitian-Wang-Bosch
closed
5 months ago
2
Fix error in bitlinear algorithm
#50
Mrw33554432
closed
3 months ago
1
[BUG] Bitnet Example Bug
#49
sneilan
closed
4 months ago
6
Consider techniques from official training paper
#48
EwoutH
closed
4 months ago
2
fix grouping in bitlinear.py
#47
Jiangxg
closed
6 months ago
0
1.58bit algorithm implement recommend
#46
princepride
closed
4 months ago
2
Encountering Size Mismatch Error in Updated Code
#45
anonymousA123
closed
4 months ago
3
Bump pypa/gh-action-pypi-publish from 1.8.12 to 1.8.14
#44
dependabot[bot]
closed
6 months ago
0
[BUG] NoneType in sequential module in bit_ffn
#43
jayUyang
closed
4 months ago
1
[BUG] bitlinear fix
#42
jayUyang
closed
4 months ago
5
is this actually working?
#41
fblgit
closed
3 months ago
11
Issue with model size after replacing BitLinear layer into a HF model (say Llama2-7b-chat)[BUG]
#40
mriganktiwari
closed
4 months ago
3
Revert "Jp"
#39
kyegomez
closed
7 months ago
0
Google Drive Link to model weights is broken
#38
SinanAkkoyun
closed
7 months ago
1
Fixed shape of beta and gamma for proper broadcasting
#37
dariocazzani
closed
7 months ago
0
Requesting a Text-to-Text translation example
#36
TMammadov
closed
4 months ago
1
The output of BitLinear is quite abnormal
#35
Jiangxg
closed
6 months ago
6
ImportError: cannot import name 'BitLinear15bs' from 'bitnet.bitbnet_b158'[BUG]
#34
Bobby-youngking
closed
6 months ago
6
Update bitlinear.py
#33
ramonpeter
closed
7 months ago
3
[BUG] residual connection wrong?
#32
qianlong0502
closed
7 months ago
1
Bump pypa/gh-action-pypi-publish from 1.8.11 to 1.8.12
#31
dependabot[bot]
closed
7 months ago
0
Update bitlinear.py
#30
ramonpeter
closed
7 months ago
2
Jp
#29
Sunwood-ai-labs
closed
7 months ago
1
fix inference bug
#28
shi3z
closed
4 months ago
1
does not have support for mistral, gemma, etc and generate error [BUG] ?
#27
NickyDark1
closed
3 months ago
5
Parts of the BitLinear code doesn't match paper (before bit1.58)
#26
qqqllppp
closed
5 months ago
2
Question about weight quantization methodology memory savings
#25
nnethercott
closed
5 months ago
1
[BUG]multi-head attention is noop for BITLINEAR
#24
Bsdnbo
closed
7 months ago
1
[BUG] Loss drops, model still produces gibberish?
#23
MichelNivard
closed
3 months ago
5
where to download bitnet model ?
#22
dibu28
closed
7 months ago
4
About 'replace_hf.py'
#21
chyoob
closed
3 months ago
3
[BUG] Google drive link in readme.md is dead
#20
nathanielhudson
closed
7 months ago
1
[BUG] Tensor size mismatch from train.py
#19
richardburleigh
closed
7 months ago
7
[BUG] Can't install with pipenv, pip
#18
calliope-pro
closed
7 months ago
1
need a distributed training example
#17
sosofun
closed
7 months ago
3
[BUG] I tried using BitLinear in nanoRWKV but got the error.
#16
win10ogod
closed
9 months ago
3
Next