issues
search
EleutherAI
/
oslo
OSLO: Open Source for Large-scale Optimization
https://oslo.eleuther.ai
173
stars
29
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
TP does not work on Flan-T5
#215
shreyansh26
opened
11 months ago
0
is there any plan to support llama model?
#214
yuyaxiong
opened
1 year ago
0
Feature/auto model test
#213
koliaok
closed
1 year ago
1
Give num_concurrent as argument
#212
jinwonkim93
closed
1 year ago
0
[Fix] Refactor ZeRO Directory Structure
#211
yhna940
closed
1 year ago
2
Auto model test integration
#210
koliaok
closed
1 year ago
1
[Fix] rpc not working on aws
#209
jinwonkim93
closed
1 year ago
2
[Add] copyright for ZeRO3
#208
yhna940
closed
1 year ago
0
[Feature] Enhance Distributed Data Parallel wrapper with ZeRO
#207
yhna940
closed
1 year ago
1
[Fix] Change the logger of ZeRO3
#206
yhna940
closed
1 year ago
0
[Add] ZeRO3 unit test
#205
yhna940
closed
1 year ago
0
Ohwi/pp use message queue
#204
ohwi
closed
1 year ago
2
To apply FlashAttention
#203
dyanos
opened
1 year ago
1
Add requirement for lightseq2
#202
jinwonkim93
closed
1 year ago
0
[Feature] new interface for zero
#201
jinwonkim93
closed
1 year ago
0
BART PreTraining code
#200
hmy831004
closed
1 year ago
0
Add DP on Trainer
#199
tree-park
closed
1 year ago
0
Apply Multi-head Attention Layer of LightSeq2 on BERT
#198
dyanos
closed
1 year ago
6
Applied Multi-head Attention Layer of LightSeq2 on BERT
#197
dyanos
closed
1 year ago
1
[Feature] Add GradScaler for ZeroOptim
#196
nijkah
opened
1 year ago
3
[Del] grad handle free storage func
#195
yhna940
closed
1 year ago
0
Remove typos related to distributed tensor
#194
KKIEEK
closed
1 year ago
0
Revert "Resolve merge conflicts for ZeRO3"
#193
KKIEEK
closed
1 year ago
0
Resolve merge conflicts for ZeRO3
#192
KKIEEK
closed
1 year ago
1
Task Variable Naming change
#191
hmy831004
closed
1 year ago
0
[Add] Distributed Logger Utility for PyTorch
#190
yhna940
closed
1 year ago
0
[Del] Remove DistributedTensor
#189
yhna940
closed
1 year ago
0
Trainer updates additional functions
#188
tree-park
closed
1 year ago
1
[Fix] Minor Issues in Data Parallel and Chunk Utils
#187
yhna940
closed
1 year ago
0
TODO : Zero optimizer deparallel
#186
hyeinhyun
opened
1 year ago
0
[Enhance] Support ZeRO3
#185
yhna940
closed
1 year ago
2
[Refact] Mv dist tensor level
#184
yhna940
closed
1 year ago
0
[zero] Suggests a minor change to confusing variable names in the ZeRO optimizer
#183
yhna940
closed
1 year ago
0
Change save_pretrained and from_parallelized annotation
#182
koliaok
closed
1 year ago
2
[Enhance] add zero3 opt
#181
yhna940
closed
1 year ago
1
fixed save_pretrained
#180
jason9693
closed
1 year ago
5
Revert ViT TP2D logic
#179
KKIEEK
closed
1 year ago
0
[Enhance] Improve Zero3 Implementation: Search Utility, Consolidation, and In-Place Dist Tensor Conversion
#178
yhna940
closed
1 year ago
1
[Fix] minor bug for single output in _DistributedDataParallel
#177
yhna940
closed
1 year ago
2
[MOD] BART TASK Update
#176
hmy831004
closed
1 year ago
5
TODO : Documentation for Expert Parallel
#175
scsc0511
opened
1 year ago
0
Remove torch version 2.0.0 dependency in
#174
koliaok
closed
1 year ago
1
[Fix] Support gradient accumulation for DDP
#173
KKIEEK
closed
1 year ago
1
Make decoder-only models to be able to generate with `inputs_embeds`
#172
ingyuseong
closed
1 year ago
1
Add restarting model from saved model and fix bug
#171
tree-park
closed
1 year ago
0
adding tutorial docs for DataParallel, ZeroOptimizer
#170
jinwonkim93
closed
1 year ago
3
Wrong import in zero
#169
jinwonkim93
closed
1 year ago
3
Prototyping FSDP
#168
KKIEEK
closed
1 year ago
2
[Fix] zero optimizer w/ tensor parallel test
#167
yhna940
closed
1 year ago
4
import ParallelMode
#166
bzantium
closed
1 year ago
0
Next