Open Epliz opened 1 year ago
@JehandadKhan could we provide some way of on-site tuning for the user?
Would it be possible for you to run the benchmark to get a reference? It takes less than 20 minutes to run.
@Epliz Apologies for the lack of response. Can you please re-test with latest ROCm 6.1.2 and check if issue occurs? Thanks!
Hi,
Just to check if I set up my machine with a MI100 GPU correctly I ran the "AI Benchmark" from https://ai-benchmark.com/ranking_deeplearning_detailed.html . The inference speed is pretty good, but the training one is for some sub-benchmarks quite far from where I would imagine it could be.
Installation instructions are at https://ai-benchmark.com/alpha.html .
Results I get:
I installed the miopen kernels for gfx908 through the packaging manager, I am on Ubuntu 22.04.2 LTS, rocm 5.4.3, tensorflow 2.11 .
I would appreciate if you could indicate that it is the performance I should get as of now, or with some tuning it could be better. Given the training scores are not that great compared to the inference ones, I feel like there is something wrong and it should be better.
Best regards, Epliz