issues
search
microsoft
/
torchscale
Foundation Architecture for (M)LLMs
https://aka.ms/GeneralAI
MIT License
3.01k
stars
202
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix a bug that overrides the default constructed output_projection
#9
MatthewChang
closed
1 year ago
1
don't need attn weight in decoder
#8
buaahsh
closed
1 year ago
0
Typo in Paper
#7
hu-po
closed
1 year ago
1
requires dict.txt and sentencepiece.bpe.model
#6
lpyhdzx
closed
1 year ago
2
fix typo
#5
kashif
closed
1 year ago
1
remove lambda
#4
kashif
closed
1 year ago
1
Can't pickle
#3
kashif
closed
1 year ago
1
does torchscale functionalities can impove modeling generality and capability in case of Session-Based Recommendation system
#2
deep-matter
closed
1 year ago
1
Fix decoder_embed_dim in Fairseq example
#1
buaahsh
closed
1 year ago
0
Previous