allenai OLMo issues - Githubissues

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

https://allenai.org/olmo

Apache License 2.0

4.75k stars 485 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

OLMo-2 held-out validation data

#755 chawins opened 22 hours ago
0
More checkpoint information

#754 dirkgr closed 1 day ago
0
Figure for plotting Pareto frontier (Flops x Perf)

#753 kyleclo opened 1 day ago
0
Add OLMo 2 checkpoint converter and update docs

#752 2015aroras closed 1 day ago
0
Update README.md

#751 revbucket opened 1 day ago
0
Legal Whammy for 7B

#750 dirkgr closed 1 day ago
0
Barely Legal Whammy

#749 dirkgr closed 1 day ago
0
Add test and train sets to in-loop oe-eval (for ladder work)

#748 liujch1998 closed 1 week ago
0
Add intermediate size to hf_olmo

#747 2015aroras closed 1 week ago
2
Difference between 0724 and 0424 7B models

#746 jiahai-feng opened 2 weeks ago
0
Documentation Improvements

#745 aman-17 closed 1 day ago
0
dependency issue when running scripts/unshard.py

#744 viking-sudo-rm closed 3 weeks ago
2
TypeError - running example code

#743 KPK101 opened 3 weeks ago
0
Improved support for Google Storage

#742 dirkgr closed 3 weeks ago
0
Fail to load tokenizer for checkpoints

#741 tresiwald opened 1 month ago
0
Adds support for converting from safetensors

#740 soldni opened 1 month ago
0
Peteish13

#739 dirkgr closed 1 week ago
1
Annealing configs

#738 dirkgr closed 3 weeks ago
1
Error Encountered During Multi-Node Pretraining with Torchrun

#737 Zehui127 opened 1 month ago
0
Create an eval-only script for existing ckpts

#736 liujch1998 opened 1 month ago
1
fixed up changelog

#735 revbucket closed 1 month ago
0
reduce the dataset size - update readme for default conda environment

#734 amazloumi closed 1 month ago
0
Update version.py

#733 revbucket closed 1 month ago
0
OLMo Checkpoints Website Down?

#732 jhsansom closed 1 month ago
2
Adding script for processing many intermediate checkpoints at once for offline evals

#731 IanMagnusson opened 1 month ago
2
Add regression tests for training

#730 2015aroras opened 1 month ago
1
I added some script to help people set up the env on vista

#729 leo-liuzy closed 1 month ago
0
Getting training data by sources

#728 chawins closed 1 month ago
2
Compile support for peteish13

#727 dirkgr closed 1 month ago
0
Missing OLMo checkpoints

#726 mirandrom opened 1 month ago
1
Fix build errors

#725 2015aroras closed 1 month ago
0
Update LUMI scripts

#724 2015aroras closed 1 month ago
0
docker

#723 jacky080808 closed 1 month ago
1
8-bit allgather support

#722 yaroslavvb opened 2 months ago
1
Bump torch version

#721 vwxyzjn closed 1 month ago
1
Set CUDA device before initializing process group

#720 2015aroras closed 2 months ago
0
[HF OLMo] Add flash attention and gradient checkpointing support

#719 2015aroras closed 2 months ago
0
Fix mmlu bpb bug only scoring answer=A questions

#718 OyvindTafjord closed 2 months ago
1
Added ability to try loading the latest checkpoint from save folders

#717 2015aroras closed 2 months ago
1
Performance degrades after converting checkpoint to HF

#716 ahmadshapiro closed 2 weeks ago
1
Expected Data Format

#715 aflah02 opened 3 months ago
1
Which mmlu validation setting is recommend?

#714 mathfinder opened 3 months ago
1
Criteria for Selecting acc vs. len_norm Metrics

#713 mathfinder closed 2 weeks ago
1
Fix bug in bpb tasks from oe-eval, add 0-shot csqa and social_iqa

#712 OyvindTafjord closed 3 months ago
0
fix unbound qkv

#711 epwalsh closed 3 months ago
1
Version dolma flan change

#710 IanMagnusson closed 3 months ago
0
Add some docs about debugging

#709 2015aroras closed 3 months ago
0
Docs model ladder

#708 IanMagnusson opened 3 months ago
2
Add OLMoE checkpoints and run config

#707 2015aroras opened 3 months ago
2
OLMoThreadError: generator thread data thread 0 failed

#706 ybdesire closed 1 month ago
2