issues
search
mosaicml
/
llm-foundry
LLM training code for Databricks foundation models
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Apache License 2.0
3.84k
stars
503
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Nicer error message for undefined symbol
#1339
dakinggg
closed
3 days ago
0
Avoid HF race condition
#1338
dakinggg
closed
3 days ago
0
Add CLI for train.py
#1337
KuuCi
opened
3 days ago
0
Deepcopy config in callbacks_with_config
#1336
mvpatel2000
closed
3 days ago
0
Add a config arg to just save an hf checkpoint
#1335
dakinggg
closed
3 days ago
0
Adding a child class of hf's rotary embedding to make hf generate work on multiple gpus.
#1334
ShashankMosaicML
closed
4 days ago
0
Fix registry for callbacks with configs
#1333
mvpatel2000
closed
4 days ago
0
Currently multi-gpu generate does not work with hf.generate for hf checkpoints. This PR fixes that.
#1332
ShashankMosaicML
closed
4 days ago
0
Bump onnx from 1.14.0 to 1.16.1
#1331
dependabot[bot]
closed
5 days ago
0
Update datasets requirement from <2.20,>=2.19 to >=2.20.0,<2.21
#1330
dependabot[bot]
opened
5 days ago
0
Bump onnxruntime from 1.15.1 to 1.18.1
#1329
dependabot[bot]
closed
5 days ago
0
Bump einops from 0.7.0 to 0.8.0
#1328
dependabot[bot]
closed
5 days ago
0
Update transformers requirement from <4.41,>=4.40 to >=4.42.3,<4.43
#1327
dependabot[bot]
closed
5 days ago
0
Bump version
#1326
b-chu
closed
5 days ago
2
Composer lora weights conversion
#1325
zhao-lun
closed
4 days ago
3
Fixing sequence_id =-1 bug, adding tests
#1324
ShashankMosaicML
closed
5 days ago
0
Registry docs update
#1323
dakinggg
closed
5 days ago
0
Add dependabot
#1322
dakinggg
closed
5 days ago
0
`HUGGING_FACE_HUB_TOKEN` -> `HF_TOKEN`
#1321
dakinggg
closed
5 days ago
0
Extra serverless
#1320
XiaohanZhangCMU
closed
5 days ago
0
Extra for serverless
#1319
XiaohanZhangCMU
closed
1 week ago
0
Refactor hf checkpointer for config transformations
#1318
irenedea
closed
1 week ago
0
Remove databricks-connect from all-cpu dep
#1317
XiaohanZhangCMU
closed
1 week ago
0
Update databricks connect version
#1316
XiaohanZhangCMU
opened
1 week ago
0
Provide default seed value in TrainConfig, matching EvalConfig
#1315
mvpatel2000
closed
1 week ago
0
Relax hf hub pin
#1314
dakinggg
closed
5 days ago
0
Error if metadata matches existing keys
#1313
dakinggg
closed
5 days ago
2
Bump recommended images to 2.3.1 and remove 2.3.0 CI
#1312
dakinggg
closed
1 week ago
0
Fix 4 gpu tests
#1311
dakinggg
closed
1 week ago
0
Bump ci-testing to 0.0.9
#1310
dakinggg
closed
1 week ago
0
Test versioned GPU tests
#1309
b-chu
closed
1 week ago
0
External library usage interface
#1308
moeiniamir
opened
1 week ago
0
Upgrade ci testing to 0.0.8
#1307
dakinggg
closed
1 week ago
0
Update CI test to v0.0.8
#1306
KuuCi
closed
1 week ago
1
Remove codeql workflow
#1305
dakinggg
closed
1 week ago
0
Avoid circular import in hf checkpointer
#1304
dakinggg
closed
1 week ago
0
Bumping mlflow version to include buffering
#1303
JackZ-db
closed
1 week ago
0
Add Retries to run_query
#1302
KuuCi
closed
1 week ago
1
Ignore mosaicml logger for exception if excephook is active
#1301
jjanezhang
closed
1 week ago
0
Add `all` transforms to train script
#1300
dakinggg
closed
2 weeks ago
0
Allows interweaving of arbitrary kinds of 'attention' layers, like sliding window, reuse prev layer kv cache etc.
#1299
ShashankMosaicML
closed
6 days ago
1
Allow passing in lbl_process_group directly
#1298
dakinggg
closed
2 weeks ago
0
Bump composer to 0.23.4
#1297
mvpatel2000
closed
2 weeks ago
0
Fix grad accum typing
#1296
dakinggg
closed
2 weeks ago
0
[Do Not Merge] Test patch
#1295
mvpatel2000
closed
2 weeks ago
0
Bump min composer version to 0.23.3
#1294
dakinggg
closed
2 weeks ago
0
Small refactor for update batch size
#1293
dakinggg
closed
2 weeks ago
0
Removing logging exception through update run metadata
#1292
jjanezhang
opened
2 weeks ago
0
Unable to use self developed pre-trained model for finetuning in MosaicML
#1291
sauravgrd
closed
1 week ago
1
Extendability refactors
#1290
dakinggg
closed
2 weeks ago
1
Next