issues
search
OpenBMB
/
ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
https://modelcenter.readthedocs.io
Apache License 2.0
243
stars
30
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
TypeError: Block.forward() got an unexpected keyword argument 'past_key_value'[BUG]
#47
kunling-cxk
opened
5 months ago
0
[FEATURE] Can we have an example of transfering ModelCenter weights into HF weights?
#46
MohamedAdelNaguib
opened
7 months ago
0
[BUG] TypeError: linear(): argument 'input' (position 1) must be Tensor, not NoneType when running get started code
#45
jiangzizi
opened
9 months ago
0
[BUG] Following Quick Start and in step 3 "Prepare the dataset" encountering "KeyError: 'label'"
#44
jiangzizi
opened
9 months ago
0
fix roberta dtype bug
#43
XWang20
closed
1 year ago
0
模型加载问题
#42
ftgreat
opened
1 year ago
0
[BUG] llama outputting random gibberish
#41
w32zhong
opened
1 year ago
1
Add LLaMA
#40
Achazwl
closed
1 year ago
0
Add: LLaMa Model
#39
huangyuxiang03
closed
1 year ago
0
How can I use my own dataset while using ModelCenter?
#38
lhj-git
opened
1 year ago
1
[BUG] cpm1 finetuning error ---- AttributeError: 'BaseModelOutput' object has no attribute 'index_select'
#37
pikaqqqqqq
opened
2 years ago
0
[BUG] Error in the demonstration code
#36
Yangyi-Chen
closed
2 years ago
0
A report of misspelling in the file cpm1.py
#35
Kunlun-Zhu
opened
2 years ago
1
[FEATURE] generate method
#34
h-peng17
closed
2 years ago
0
[BUG] Pretrained GPT2 model has an incorrect size compared with the config file.
#33
alphaGem
closed
2 years ago
5
[FEATURE] Make the dimensions of linear spaces distinguishable
#32
alphaGem
opened
2 years ago
1
FIX bugs and some interface problems
#31
MayDomine
closed
2 years ago
0
add_t5_cache
#30
Clancy-Zhu
closed
2 years ago
0
fix use_cache interfache for gpt,bert,cpm3 and fix glm position id bug
#29
MayDomine
closed
2 years ago
0
[BUG] can't import model_center.tools
#28
xrandx
closed
2 years ago
1
add t5 scale config
#27
MayDomine
closed
2 years ago
0
ModelCenter1.0.4
#26
MayDomine
closed
2 years ago
0
add distributed dataset
#25
a710128
closed
2 years ago
0
Sparse attention & Longformer
#24
MayDomine
closed
2 years ago
0
Merge pull request #1 from OpenBMB/main
#23
MayDomine
closed
2 years ago
0
Longformer & SparseAttention
#22
MayDomine
closed
2 years ago
0
fix cache in gpt2
#21
zh-zheng
closed
2 years ago
0
[FEATURE] support model.from_pretrained without the need of init distributed
#20
Jiaxin-Wen
opened
2 years ago
0
add test for vit(forward and backward)
#19
MayDomine
closed
2 years ago
0
Question in ModelCenter/model_center/layer/transformer.py
#18
Kunlun-Zhu
closed
2 years ago
5
Shared key value
#17
THUCSTHanxu13
closed
2 years ago
0
vit reconstructed
#16
MayDomine
closed
2 years ago
0
Vit reconstruct
#15
MayDomine
closed
2 years ago
0
[BUG] Get nan when calculating cross entropy loss.
#14
alphaGem
closed
2 years ago
3
T51.1 & MT5
#13
Achazwl
closed
2 years ago
0
test_backward
#12
MayDomine
closed
2 years ago
0
vit for bmtrain
#11
qyc-98
closed
2 years ago
0
vit for bmtrain
#10
qyc-98
closed
2 years ago
0
[FEATURE] Add custom config parameters in **from_pretrained**
#9
xcjthu
closed
2 years ago
1
CPM模型加载异常
#8
xikaluo
closed
2 years ago
2
ModelForLM
#7
QiaoZiqing
opened
2 years ago
0
update readme
#6
zh-zheng
closed
2 years ago
0
update readme
#5
zh-zheng
closed
2 years ago
0
update README-ZH
#4
zh-zheng
closed
2 years ago
0
Update docs and README:add quick start
#3
zh-zheng
closed
2 years ago
0
README
#2
Achazwl
closed
2 years ago
0
transpose
#1
a710128
closed
2 years ago
0