issues
search
mlcommons
/
training
Reference implementations of MLPerf™ training benchmarks
https://mlcommons.org/en/groups/training
Apache License 2.0
1.57k
stars
548
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[GNN] Reference implementation for GNN node classification
#700
LiSu
closed
3 months ago
8
[SD] log number of samples instead of number of iterations
#699
ahmadki
closed
4 months ago
3
adding initial code drop for llm finetune
#698
itayhubara
closed
3 months ago
2
[Unet3d] - Add infinite data loader to align epochs->samples transition
#697
mmarcinkiewicz
opened
5 months ago
1
Add MLCube implementation for Stable Diffusion
#696
davidjurado
opened
5 months ago
1
Add MLCube implementation for 3D Unet
#695
davidjurado
opened
6 months ago
1
run stable diffusion see no space left on device error
#694
gaowayne
opened
6 months ago
2
Unable to download tar file in the mlcommons-training-wg-s3 S3 Bucket
#693
ajscalers
opened
6 months ago
0
updated with code to use our instrumentation (some more README update…
#692
rajveerb
closed
7 months ago
1
error run the rnn speech workload, failed to process data after enter docker
#691
gaowayne
opened
7 months ago
4
failed to build object_detection container with below error on FedoraOS37
#690
gaowayne
opened
7 months ago
3
docker run error for image_segmentation/pytorch test following the guide
#689
gaowayne
opened
7 months ago
2
Command line options in bert training
#688
mahmoodn
opened
7 months ago
0
Stable diffusion training test failed at module 'cv2.dnn' has no attribute 'DictValue'
#687
billcsm
closed
4 months ago
2
MLCube implementation for ResNet
#686
davidjurado
opened
8 months ago
1
[SD] unified val file names
#685
ahmadki
closed
4 months ago
1
[UNET3D] Replace epochs with samples
#684
mmarcinkiewicz
opened
8 months ago
1
Add quick SSD demo
#683
davidjurado
closed
3 months ago
2
[SD] fixed number of training samples
#682
ahmadki
closed
4 months ago
1
[SD] a small indentation fix
#681
ahmadki
closed
9 months ago
1
Switch dataset locations from Google Drive to MLCommons Cloud
#680
nathanw-mlc
closed
4 months ago
9
How to run dlrm module with criteo_kaggle dataset?
#679
esharkwang
opened
11 months ago
0
Bump certifi from 2018.4.16 to 2023.7.22 in /retired_benchmarks/transformer/tensorflow
#678
dependabot[bot]
opened
11 months ago
1
[SD] Finalized the benchmark
#677
ahmadki
closed
11 months ago
2
Unable to run unit tests of distributed checkpointing in Megatron-LM
#676
MingjiHan99
opened
11 months ago
0
[SSD] Pinned fiftyone package
#675
ahmadki
closed
11 months ago
1
[LLM] Add S3 details to readme
#674
mikolajblaz
closed
11 months ago
3
does not have storage.objects.list access to the Google Cloud Storage bucket
#673
karpenko-p-n
opened
11 months ago
0
Bump semver and react-scripts in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#672
dependabot[bot]
opened
11 months ago
1
[MaskRCNN bug] when MaskRCNN saves checkpoint after training, an error is reported
#671
Xiao-Yamin
opened
11 months ago
0
[MaskRCNN bug] make_data_loader() method should only return data_loaders[0] when training
#670
Xiao-Yamin
opened
11 months ago
0
AccessDeniedException: 403 does not have storage.objects.list access to the Google Cloud Storage bucket.
#669
zwang92
opened
11 months ago
0
Bump tough-cookie and react-scripts in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#668
dependabot[bot]
opened
11 months ago
1
Bump scipy from 1.5.2 to 1.10.0 in /image_segmentation/pytorch
#667
dependabot[bot]
opened
11 months ago
1
Bump scipy from 1.0.1 to 1.10.0 in /retired_benchmarks/transformer/tensorflow
#666
dependabot[bot]
opened
11 months ago
1
Bump grpcio from 1.11.0 to 1.53.0 in /retired_benchmarks/transformer/tensorflow
#665
dependabot[bot]
opened
11 months ago
1
Bump semver and react-scripts in /retired_benchmarks/minigo/tensorflow/minigo/oneoffs/joseki
#664
dependabot[bot]
closed
11 months ago
2
Stable_diffusion: document embedding size from ViT-H into Unet
#663
matthew-frank
closed
11 months ago
2
Table summarizing benchmark suite
#662
TheKanter
opened
1 year ago
1
Added Stable Diffusion (SD) benchmark - Part 2
#661
ahmadki
closed
1 year ago
2
Fix tensorflow v1 compatibility for bert
#660
arjunsuresh
opened
1 year ago
1
【Bert】Unable to achieve accuracy of 0.72.
#659
BiduCui
closed
7 months ago
0
Bump gradio from 3.11 to 3.34.0 in /stable_diffusion
#658
dependabot[bot]
closed
4 months ago
2
Bump transformers from 4.19.2 to 4.30.0 in /stable_diffusion
#657
dependabot[bot]
closed
4 months ago
2
WIP: Add Stable Diffusion benchmark
#656
ahmadki
closed
1 year ago
1
[DLRM v2] How to modify the default training script of DLRM v2 to train the model with limited GPU memory
#655
JJingL
opened
1 year ago
1
SSD: exception during conversion of dataset to COCO format
#654
ukurkure
closed
4 months ago
4
Are gpt tokenizer model open-source?
#653
xyyintel
opened
1 year ago
1
readme updates
#652
anmolgupt
closed
1 year ago
7
Would be nice to have parameters counts for all models
#651
rakshithvasudev
opened
1 year ago
0
Previous
Next