issues
search
mlcommons
/
training
Reference implementations of MLPerf™ training benchmarks
https://mlcommons.org/en/groups/training
Apache License 2.0
1.57k
stars
548
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Tag v3.0 code release
#650
nv-rborkar
opened
1 year ago
0
Bump requests from 2.18.4 to 2.31.0 in /retired_benchmarks/transformer/tensorflow
#649
dependabot[bot]
opened
1 year ago
1
[DLRM v2] Using the model for the inference reference implementation
#648
pgmpablo157321
opened
1 year ago
5
Update license header
#647
nathanw-mlc
closed
1 year ago
1
Regarding the issue of continuous memory growth during the training process
#646
Daming-wang
opened
1 year ago
0
added train_samples keyword for compliance check
#645
anmolgupt
closed
1 month ago
2
Update README.md
#644
arjunsuresh
closed
1 year ago
1
Image classification reference implementation is failing on Ubuntu 22.04
#643
arjunsuresh
opened
1 year ago
2
Bert pretrain script message "Could not find trained model in model_dir: /tmp/output/"
#642
mahmoodn
opened
1 year ago
5
Language model dataset preparation
#641
mahmoodn
closed
1 year ago
0
Update README.md
#640
anmolgupt
closed
1 year ago
3
Checkpointing DLRMv2
#639
mailvijayasingh
opened
1 year ago
3
Steps for language model
#638
mahmoodn
closed
1 year ago
0
Ask for the access to download the checkpoint of LLM
#637
JJingL
opened
1 year ago
2
[DLRM_DCNv2] Benchmark name in reference implementation
#636
janekl
closed
1 year ago
1
Does DLRM_v2 support H100?
#635
xyyintel
opened
1 year ago
2
The default training script of DLRM v2 does not reach the reported AUC.
#634
Kevin0624
opened
1 year ago
3
[DLRMv2] Update target AUC in README
#633
janekl
closed
1 year ago
2
MLCube integration with Bert
#632
davidjurado
opened
1 year ago
6
Summary table for benchmark suite
#631
TheKanter
opened
1 year ago
0
[GPT3] update megatron-LM reference
#630
ShriyaPalsamudram
closed
1 year ago
5
SSD benchmark with MLCube implementation
#629
davidjurado
closed
1 year ago
3
[DLRMv2_DCNv2] Update Criteo 1TB dataset download link
#628
janekl
closed
1 year ago
1
Bump minimist, mkdirp, loader-fs-cache and handlebars in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#627
dependabot[bot]
closed
1 year ago
2
Ask for help about :'_OpNamespace' 'fbgemm' object has no attribute 'new_managed_tensor'
#626
sea-of-freedom
opened
1 year ago
1
UNET3D - Change DistributedSampler seed to be same across all workers
#625
lhovon
closed
1 year ago
4
Bump werkzeug from 0.14.1 to 2.2.3 in /retired_benchmarks/transformer/tensorflow
#624
dependabot[bot]
closed
1 year ago
2
Object_detection error "cannot import name '_C' from 'maskrcnn_benchmark'"
#623
mahmoodn
closed
1 year ago
1
[DLRMv2] Align optimizer parameters for embeddings and dense layers
#622
janekl
closed
1 year ago
5
[DLRMv2] Resolve benchmark name and use other constants for logging
#621
janekl
closed
1 year ago
1
Bump werkzeug from 0.14.1 to 0.15.5 in /retired_benchmarks/transformer/tensorflow
#620
dependabot[bot]
closed
1 year ago
2
Docker step fails in object_detection benchmark
#619
mahmoodn
closed
1 year ago
2
Segmentation preprocess skips from Case 210 until Case 299
#618
mahmoodn
closed
1 year ago
0
3d_unet paths
#617
mahmoodn
closed
1 year ago
0
[DCNV2] Add MLPerf logging
#616
janekl
closed
1 year ago
4
[DCNV2] Add MLPerf logging
#615
janekl
closed
1 year ago
3
Bump json5 from 1.0.1 to 1.0.2 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#614
dependabot[bot]
closed
1 year ago
2
test-cla
#613
samiwilf
closed
1 year ago
3
Add MLPerf DLRM v2 benchmark
#612
samiwilf
closed
1 year ago
2
Add MLPerf DLRM v2 benchmark
#611
samiwilf
closed
1 year ago
1
Bump express from 4.17.1 to 4.18.2 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#610
dependabot[bot]
closed
1 year ago
2
Try to update THC.h and python code for adjust PyTorch 1.13 version
#609
1pikachu
closed
1 year ago
1
Bump qs from 6.5.2 to 6.5.3 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#608
dependabot[bot]
closed
1 year ago
2
Bump decode-uri-component from 0.2.0 to 0.2.2 in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#607
dependabot[bot]
closed
1 year ago
2
Bump certifi from 2018.4.16 to 2022.12.7 in /retired_benchmarks/transformer/tensorflow
#606
dependabot[bot]
closed
1 year ago
2
Fix for issue #432: R50 readme v1 -> v1.5
#605
itayhubara
closed
1 year ago
2
Does this open source code support multi-GPU training?
#604
yangzhipeng1108
opened
1 year ago
0
ResNet50 -> ResNet-50 in all docs
#603
matthew-frank
closed
1 year ago
1
Clean up object detection
#602
maanug-nv
closed
1 year ago
4
fix typo
#601
TheKanter
closed
1 year ago
1
Previous
Next