issues
search
allenai
/
OLMo
Modeling, training, eval, and inference code for OLMo
https://allenai.org/olmo
Apache License 2.0
4.75k
stars
485
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
OLMo-2 held-out validation data
#755
chawins
opened
22 hours ago
0
More checkpoint information
#754
dirkgr
closed
1 day ago
0
Figure for plotting Pareto frontier (Flops x Perf)
#753
kyleclo
opened
1 day ago
0
Add OLMo 2 checkpoint converter and update docs
#752
2015aroras
closed
1 day ago
0
Update README.md
#751
revbucket
opened
1 day ago
0
Legal Whammy for 7B
#750
dirkgr
closed
1 day ago
0
Barely Legal Whammy
#749
dirkgr
closed
1 day ago
0
Add test and train sets to in-loop oe-eval (for ladder work)
#748
liujch1998
closed
1 week ago
0
Add intermediate size to hf_olmo
#747
2015aroras
closed
1 week ago
2
Difference between 0724 and 0424 7B models
#746
jiahai-feng
opened
2 weeks ago
0
Documentation Improvements
#745
aman-17
closed
1 day ago
0
dependency issue when running scripts/unshard.py
#744
viking-sudo-rm
closed
3 weeks ago
2
TypeError - running example code
#743
KPK101
opened
3 weeks ago
0
Improved support for Google Storage
#742
dirkgr
closed
3 weeks ago
0
Fail to load tokenizer for checkpoints
#741
tresiwald
opened
1 month ago
0
Adds support for converting from safetensors
#740
soldni
opened
1 month ago
0
Peteish13
#739
dirkgr
closed
1 week ago
1
Annealing configs
#738
dirkgr
closed
3 weeks ago
1
Error Encountered During Multi-Node Pretraining with Torchrun
#737
Zehui127
opened
1 month ago
0
Create an eval-only script for existing ckpts
#736
liujch1998
opened
1 month ago
1
fixed up changelog
#735
revbucket
closed
1 month ago
0
reduce the dataset size - update readme for default conda environment
#734
amazloumi
closed
1 month ago
0
Update version.py
#733
revbucket
closed
1 month ago
0
OLMo Checkpoints Website Down?
#732
jhsansom
closed
1 month ago
2
Adding script for processing many intermediate checkpoints at once for offline evals
#731
IanMagnusson
opened
1 month ago
2
Add regression tests for training
#730
2015aroras
opened
1 month ago
1
I added some script to help people set up the env on vista
#729
leo-liuzy
closed
1 month ago
0
Getting training data by sources
#728
chawins
closed
1 month ago
2
Compile support for peteish13
#727
dirkgr
closed
1 month ago
0
Missing OLMo checkpoints
#726
mirandrom
opened
1 month ago
1
Fix build errors
#725
2015aroras
closed
1 month ago
0
Update LUMI scripts
#724
2015aroras
closed
1 month ago
0
docker
#723
jacky080808
closed
1 month ago
1
8-bit allgather support
#722
yaroslavvb
opened
2 months ago
1
Bump torch version
#721
vwxyzjn
closed
1 month ago
1
Set CUDA device before initializing process group
#720
2015aroras
closed
2 months ago
0
[HF OLMo] Add flash attention and gradient checkpointing support
#719
2015aroras
closed
2 months ago
0
Fix mmlu bpb bug only scoring answer=A questions
#718
OyvindTafjord
closed
2 months ago
1
Added ability to try loading the latest checkpoint from save folders
#717
2015aroras
closed
2 months ago
1
Performance degrades after converting checkpoint to HF
#716
ahmadshapiro
closed
2 weeks ago
1
Expected Data Format
#715
aflah02
opened
3 months ago
1
Which mmlu validation setting is recommend?
#714
mathfinder
opened
3 months ago
1
Criteria for Selecting acc vs. len_norm Metrics
#713
mathfinder
closed
2 weeks ago
1
Fix bug in bpb tasks from oe-eval, add 0-shot csqa and social_iqa
#712
OyvindTafjord
closed
3 months ago
0
fix unbound qkv
#711
epwalsh
closed
3 months ago
1
Version dolma flan change
#710
IanMagnusson
closed
3 months ago
0
Add some docs about debugging
#709
2015aroras
closed
3 months ago
0
Docs model ladder
#708
IanMagnusson
opened
3 months ago
2
Add OLMoE checkpoints and run config
#707
2015aroras
opened
3 months ago
2
OLMoThreadError: generator thread data thread 0 failed
#706
ybdesire
closed
1 month ago
2
Next