issues
search
mlcommons
/
training
Reference implementations of MLPerf™ training benchmarks
https://mlcommons.org/en/groups/training
Apache License 2.0
1.57k
stars
548
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Running udnet3d on multiple GPUS
#750
luiceur
opened
22 hours ago
0
Add MLCube implementation for llama2
#749
davidjurado
opened
1 week ago
1
Update megatron-lm reference to run on hopper gpus
#748
ShriyaPalsamudram
opened
1 week ago
1
Change submission date from v40 to v41
#747
ShriyaPalsamudram
closed
6 days ago
1
Problem downloading S3 bucket
#746
mahmoodn
closed
2 weeks ago
0
Add `distribution_strategy` and `all_reduce_alg` flags to TensorFlow BERT pretraining
#745
rapsealk
opened
3 weeks ago
1
Bus error (core dumped) in graph_neural_network
#744
abrarfuad27
opened
3 weeks ago
1
Improve TensorFlow compatibility in BERT scripts
#743
rapsealk
opened
3 weeks ago
4
Scope of ML based benchmarks in MLPerf.
#741
rakshithgb-fujitsu
opened
4 weeks ago
0
`IndexError` in `cross_device_ops` with `MultiWorkerMirroredStrategy`
#740
rapsealk
opened
1 month ago
0
Invalid `local_replica_id` with `MultiWorkerMirroredStrategy`
#739
rapsealk
opened
1 month ago
0
Updating IGB download paths
#738
akhatua2
opened
1 month ago
4
[GNN] Adds example building dockerfile for H100s.
#737
Elnifio
closed
1 month ago
4
llama3 support?
#736
ifelsefi
opened
1 month ago
0
Hardware Configuration
#735
BhAem
opened
1 month ago
0
GNN: update the docker compose file
#734
LiSu
closed
1 month ago
1
[GNN] Fixes the dockerfile
#733
Elnifio
closed
2 months ago
1
DLRM criteo day23 MD5 varify faild
#732
kkkparty
opened
2 months ago
0
Data download for Stable Diffusion fails
#731
coppock
opened
2 months ago
0
[SD] switched to upstream logging (4.0.0-rc2)
#730
ahmadki
closed
3 weeks ago
1
Add missing logging keys for GNN
#729
LiSu
closed
2 months ago
1
(1) adding support for evaluation skipping; (2) updating model and data…
#728
itayhubara
closed
2 months ago
1
Llama2 - LoRA Reference Implementation
#727
rgandikota
opened
2 months ago
2
llama2: fixing DS yaml by adding gradient clipping: 0.3, and small update to …
#726
itayhubara
closed
2 months ago
1
switch to samples_count in logging of llama2_70b_lora
#725
itayhubara
closed
2 months ago
1
MLPerf library version for 4.0 Submission
#724
rgandikota
closed
2 months ago
0
Gradient clipping not working for llama2_70b_lora benchmark
#723
michal2409
opened
3 months ago
1
Alternative method for downloading Llama2 70b
#722
tianmu-li
opened
3 months ago
0
[Stable Diffusion] VAE Moments to image outputs whited out image.
#721
entrpn
opened
3 months ago
1
where is the definition of mlperf_logging
#720
liuxiaoxiao1121
closed
3 months ago
1
OCI runtime create failed
#719
gorleramyasri
opened
3 months ago
0
unable to find image 'mlperf/object_detection'
#718
gorleramyasri
closed
3 months ago
0
Bump gradio from 3.11 to 4.19.2 in /stable_diffusion
#717
dependabot[bot]
opened
4 months ago
1
Add v4.0 suite on Readme
#716
nv-rborkar
closed
3 months ago
1
Bump urllib3 from 1.22 to 1.26.18 in /retired_benchmarks/transformer/tensorflow
#715
dependabot[bot]
opened
4 months ago
1
Bump ip from 1.1.5 to 1.1.9 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#714
dependabot[bot]
opened
4 months ago
1
Bump axios from 0.19.0 to 0.28.0 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#713
dependabot[bot]
opened
4 months ago
1
Change dataset download scripts to use Cloudflare buckets directly
#712
morphine00
closed
3 months ago
5
Vulnerability patch: remove joseki from minigo legacy benchmark
#711
pgmpablo157321
closed
4 months ago
1
Potential private information leak in retired benchmark
#710
pgmpablo157321
closed
4 months ago
0
Bump follow-redirects and axios in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#709
dependabot[bot]
opened
4 months ago
1
Bump axios from 0.19.0 to 1.6.0 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#708
dependabot[bot]
closed
4 months ago
2
Bump gradio from 3.11 to 4.11.0 in /stable_diffusion
#707
dependabot[bot]
closed
4 months ago
2
Bump pillow from 5.2.0 to 10.2.0 in /retired_benchmarks/ssd-v1
#706
dependabot[bot]
opened
4 months ago
1
Bump werkzeug from 0.14.1 to 2.3.8 in /retired_benchmarks/transformer/tensorflow
#705
dependabot[bot]
opened
4 months ago
1
Bump fsevents from 1.2.9 to 1.2.13 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#704
dependabot[bot]
opened
4 months ago
1
Bump transformers from 4.19.2 to 4.36.0 in /stable_diffusion
#703
dependabot[bot]
opened
4 months ago
1
[SD] v4.0 cleanup and bug fixes
#702
ahmadki
closed
3 months ago
1
Update S3 download instructions
#701
nathanw-mlc
closed
3 months ago
1
[GNN] Reference implementation for GNN node classification
#700
LiSu
closed
3 months ago
8
Next