issues
search
mosaicml
/
llm-foundry
LLM training code for Databricks foundation models
https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Apache License 2.0
3.99k
stars
525
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
`HUGGING_FACE_HUB_TOKEN` -> `HF_TOKEN`
#1321
dakinggg
closed
3 months ago
0
Extra serverless
#1320
XiaohanZhangCMU
closed
3 months ago
0
Extra for serverless
#1319
XiaohanZhangCMU
closed
3 months ago
0
Refactor hf checkpointer for config transformations
#1318
irenedea
closed
3 months ago
0
Remove databricks-connect from all-cpu dep
#1317
XiaohanZhangCMU
closed
3 months ago
0
Update databricks connect version
#1316
XiaohanZhangCMU
closed
2 months ago
0
Provide default seed value in TrainConfig, matching EvalConfig
#1315
mvpatel2000
closed
3 months ago
0
Relax hf hub pin
#1314
dakinggg
closed
3 months ago
0
Error if metadata matches existing keys
#1313
dakinggg
closed
3 months ago
2
Bump recommended images to 2.3.1 and remove 2.3.0 CI
#1312
dakinggg
closed
3 months ago
0
Fix 4 gpu tests
#1311
dakinggg
closed
3 months ago
0
Bump ci-testing to 0.0.9
#1310
dakinggg
closed
3 months ago
0
Test versioned GPU tests
#1309
b-chu
closed
3 months ago
0
External library usage interface
#1308
moeiniamir
opened
3 months ago
0
Upgrade ci testing to 0.0.8
#1307
dakinggg
closed
3 months ago
0
Update CI test to v0.0.8
#1306
KuuCi
closed
3 months ago
1
Remove codeql workflow
#1305
dakinggg
closed
3 months ago
0
Avoid circular import in hf checkpointer
#1304
dakinggg
closed
3 months ago
0
Bumping mlflow version to include buffering
#1303
JackZ-db
closed
3 months ago
0
Add Retries to run_query
#1302
KuuCi
closed
3 months ago
1
Ignore mosaicml logger for exception if excephook is active
#1301
jjanezhang
closed
3 months ago
0
Add `all` transforms to train script
#1300
dakinggg
closed
3 months ago
0
Allows interweaving of arbitrary kinds of 'attention' layers, like sliding window, reuse prev layer kv cache etc.
#1299
ShashankMosaicML
closed
3 months ago
1
Allow passing in lbl_process_group directly
#1298
dakinggg
closed
3 months ago
0
Bump composer to 0.23.4
#1297
mvpatel2000
closed
3 months ago
0
Fix grad accum typing
#1296
dakinggg
closed
3 months ago
0
[Do Not Merge] Test patch
#1295
mvpatel2000
closed
3 months ago
0
Bump min composer version to 0.23.3
#1294
dakinggg
closed
3 months ago
0
Small refactor for update batch size
#1293
dakinggg
closed
3 months ago
0
Removing logging exception through update run metadata
#1292
jjanezhang
closed
2 months ago
1
Unable to use self developed pre-trained model for finetuning in MosaicML
#1291
sauravgrd
closed
3 months ago
1
Extendability refactors
#1290
dakinggg
closed
3 months ago
1
MPT training with ALiBi and Flash Attention 2
#1289
rickgit16
closed
3 months ago
4
Add TE to setup
#1288
j316chuck
closed
3 months ago
0
Add missing dependency group
#1287
dakinggg
closed
3 months ago
1
Fix backwards compatibility for ICL arg
#1286
dakinggg
closed
3 months ago
0
Bump mlflow to 2.13.2
#1285
KuuCi
closed
3 months ago
1
Fix typo in CI
#1284
dakinggg
closed
3 months ago
0
Add optional logging of text output to EvalOutputLogging
#1283
sjawhar
closed
3 months ago
8
Update README.md to use variables
#1282
milocress
closed
3 months ago
0
Adds CI for torch 2.3.1
#1281
dakinggg
closed
3 months ago
1
Fix TE HF checkpoint saving
#1280
j316chuck
closed
3 months ago
1
Add loggers by default if env vars are populated
#1279
aspfohl
opened
3 months ago
0
Make expandable segments on by default
#1278
b-chu
closed
3 months ago
0
Fix packing + streaming + resumption
#1277
dakinggg
closed
3 months ago
0
Allow multiprocessing when preparing ICL dataset
#1276
sanjari-orb
opened
3 months ago
8
Add torch 2.3.1 docker images
#1275
dakinggg
closed
3 months ago
0
Update Dockerfile
#1274
j316chuck
closed
3 months ago
0
Update Dockerfile with TE main
#1273
j316chuck
closed
3 months ago
2
Managing Timeout on Training Errors and Simultaneous Restart of All Nodes in LLM Foundry
#1272
germanjke
closed
3 months ago
1
Previous
Next