mosaicml llm-foundry issues

mosaicml / llm-foundry

LLM training code for Databricks foundation models

https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm

Apache License 2.0

3.99k stars 525 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

`HUGGING_FACE_HUB_TOKEN` -> `HF_TOKEN`

#1321 dakinggg closed 3 months ago
0
Extra serverless

#1320 XiaohanZhangCMU closed 3 months ago
0
Extra for serverless

#1319 XiaohanZhangCMU closed 3 months ago
0
Refactor hf checkpointer for config transformations

#1318 irenedea closed 3 months ago
0
Remove databricks-connect from all-cpu dep

#1317 XiaohanZhangCMU closed 3 months ago
0
Update databricks connect version

#1316 XiaohanZhangCMU closed 2 months ago
0
Provide default seed value in TrainConfig, matching EvalConfig

#1315 mvpatel2000 closed 3 months ago
0
Relax hf hub pin

#1314 dakinggg closed 3 months ago
0
Error if metadata matches existing keys

#1313 dakinggg closed 3 months ago
2
Bump recommended images to 2.3.1 and remove 2.3.0 CI

#1312 dakinggg closed 3 months ago
0
Fix 4 gpu tests

#1311 dakinggg closed 3 months ago
0
Bump ci-testing to 0.0.9

#1310 dakinggg closed 3 months ago
0
Test versioned GPU tests

#1309 b-chu closed 3 months ago
0
External library usage interface

#1308 moeiniamir opened 3 months ago
0
Upgrade ci testing to 0.0.8

#1307 dakinggg closed 3 months ago
0
Update CI test to v0.0.8

#1306 KuuCi closed 3 months ago
1
Remove codeql workflow

#1305 dakinggg closed 3 months ago
0
Avoid circular import in hf checkpointer

#1304 dakinggg closed 3 months ago
0
Bumping mlflow version to include buffering

#1303 JackZ-db closed 3 months ago
0
Add Retries to run_query

#1302 KuuCi closed 3 months ago
1
Ignore mosaicml logger for exception if excephook is active

#1301 jjanezhang closed 3 months ago
0
Add `all` transforms to train script

#1300 dakinggg closed 3 months ago
0
Allows interweaving of arbitrary kinds of 'attention' layers, like sliding window, reuse prev layer kv cache etc.

#1299 ShashankMosaicML closed 3 months ago
1
Allow passing in lbl_process_group directly

#1298 dakinggg closed 3 months ago
0
Bump composer to 0.23.4

#1297 mvpatel2000 closed 3 months ago
0
Fix grad accum typing

#1296 dakinggg closed 3 months ago
0
[Do Not Merge] Test patch

#1295 mvpatel2000 closed 3 months ago
0
Bump min composer version to 0.23.3

#1294 dakinggg closed 3 months ago
0
Small refactor for update batch size

#1293 dakinggg closed 3 months ago
0
Removing logging exception through update run metadata

#1292 jjanezhang closed 2 months ago
1
Unable to use self developed pre-trained model for finetuning in MosaicML

#1291 sauravgrd closed 3 months ago
1
Extendability refactors

#1290 dakinggg closed 3 months ago
1
MPT training with ALiBi and Flash Attention 2

#1289 rickgit16 closed 3 months ago
4
Add TE to setup

#1288 j316chuck closed 3 months ago
0
Add missing dependency group

#1287 dakinggg closed 3 months ago
1
Fix backwards compatibility for ICL arg

#1286 dakinggg closed 3 months ago
0
Bump mlflow to 2.13.2

#1285 KuuCi closed 3 months ago
1
Fix typo in CI

#1284 dakinggg closed 3 months ago
0
Add optional logging of text output to EvalOutputLogging

#1283 sjawhar closed 3 months ago
8
Update README.md to use variables

#1282 milocress closed 3 months ago
0
Adds CI for torch 2.3.1

#1281 dakinggg closed 3 months ago
1
Fix TE HF checkpoint saving

#1280 j316chuck closed 3 months ago
1
Add loggers by default if env vars are populated

#1279 aspfohl opened 3 months ago
0
Make expandable segments on by default

#1278 b-chu closed 3 months ago
0
Fix packing + streaming + resumption

#1277 dakinggg closed 3 months ago
0
Allow multiprocessing when preparing ICL dataset

#1276 sanjari-orb opened 3 months ago
8
Add torch 2.3.1 docker images

#1275 dakinggg closed 3 months ago
0
Update Dockerfile

#1274 j316chuck closed 3 months ago
0
Update Dockerfile with TE main

#1273 j316chuck closed 3 months ago
2
Managing Timeout on Training Errors and Simultaneous Restart of All Nodes in LLM Foundry

#1272 germanjke closed 3 months ago
1

Previous Next