Open zjgarvey opened 3 months ago
@zjgarvey added https://github.com/llvm/torch-mlir/issues/3647 to some of the models as we need that along with https://github.com/iree-org/iree/issues/18229
cc @lialan as well. Can you co-ordinate with Zach to track CPU codegen issues.
Also adding https://github.com/llvm/torch-mlir/issues/3651 that needs to be done for supporting broad range of models.
TOTAL TESTS = 18 | Stage | # Passing | % of Total | % of Attempted |
---|---|---|---|---|
Setup | 18 | 100.0% | 100.0% | |
IREE Compilation | 18 | 100.0% | 100.0% | |
Gold Inference | 18 | 100.0% | 100.0% | |
IREE Inference Invocation | 18 | 100.0% | 100.0% | |
Inference Comparison (PASS) | 16 | 88.9% | 88.9% |
TOTAL TESTS = 18 | Stage | # Failed at Stage | % of Total |
---|---|---|---|
Setup | 0 | 0.0% | |
IREE Compilation | 0 | 0.0% | |
Gold Inference | 0 | 0.0% | |
IREE Inference Invocation | 0 | 0.0% | |
Inference Comparison | 2 | 11.1% |
Test was run with the following arguments: Namespace(device='hip://1', backend='rocm', iree_compile_args=['iree-hip-target=gfx942'], mode='cl-onnx-iree', torchtolinalg=False, stages=None, skip_stages=None, benchmark=True, load_inputs=False, groups='all', testfilter='migx', testsfile=None, tolerance=None, verbose=True, rundirectory='test-run', no_artifacts=False, cleanup='0', report=True, report_file='bert-bench-11-5.md', get_metadata=False)
Test | Exit Status | Mean Benchmark Time (ms) | Notes |
---|---|---|---|
migx_bench_bert-large-uncased_16_128 | PASS | 31.207363539631814 | |
migx_bench_bert-large-uncased_16_256 | PASS | 55.50303652834816 | |
migx_bench_bert-large-uncased_16_384 | Numerics | 73.14148765678208 | |
migx_bench_bert-large-uncased_1_128 | PASS | 13.602430612827915 | |
migx_bench_bert-large-uncased_1_256 | PASS | 14.240951777125396 | |
migx_bench_bert-large-uncased_1_384 | PASS | 19.958815195908148 | |
migx_bench_bert-large-uncased_2_128 | PASS | 13.128591842236526 | |
migx_bench_bert-large-uncased_2_256 | PASS | 13.671312931608528 | |
migx_bench_bert-large-uncased_2_384 | PASS | 21.517712740472167 | |
migx_bench_bert-large-uncased_32_128 | PASS | 62.9078254498767 | |
migx_bench_bert-large-uncased_32_256 | PASS | 101.5021381234484 | |
migx_bench_bert-large-uncased_32_384 | Numerics | 143.94597491870323 | |
migx_bench_bert-large-uncased_4_128 | PASS | 14.44128212411286 | |
migx_bench_bert-large-uncased_4_256 | PASS | 17.125056890238607 | |
migx_bench_bert-large-uncased_4_384 | PASS | 26.636395024326745 | |
migx_bench_bert-large-uncased_8_128 | PASS | 18.925565496288442 | |
migx_bench_bert-large-uncased_8_256 | PASS | 27.419584516722423 | |
migx_bench_bert-large-uncased_8_384 | PASS | 41.23994989284113 |
This issue will be used to track compilation failures for migraphx models on CPU and GPU. Compile failures for each model should have a link to an issue with a smaller reproducer in the notes column.
Notes:
migraphx_ORT__bert_base_cased_1
fails on CPU but passes on GPU. Other adjacent models fail for similar reasons on both. Very odd.migraphx_sdxl__unet__model
,migraphx_ORT__bert_large_uncased_1
because they cause a crash (likely OOM)CPU Status Table
The Following report was generated with IREE compiler version iree-org/iree@caacf6c8015b4344b2d9b4a82c2fddc015693831 Torch-mlir version llvm/torch-mlir@2665ed343b19713ba5c1c555b2366a93de8b9d2b
Passing Summary
Fail Summary
Test Run Detail
Test was run with the following arguments: Namespace(device='local-task', backend='llvm-cpu', iree_compile_args=None, mode='cl-onnx-iree', torchtolinalg=True, stages=None, skip_stages=None, benchmark=False, load_inputs=False, groups='all', test_filter='migraphx', testsfile=None, tolerance=None, verbose=True, rundirectory='test-run', no_artifacts=False, cleanup='0', report=True, report_file='mi_10_10.md')
OLD STATUS (Will update and migrate issues to current table)
GPU Status Table
last generated with pip installed iree tools at version
Summary
Test Run Detail
Test was run with the following arguments: Namespace(device='hip://1', backend='rocm', iree_compile_args=['iree-hip-target=gfx942'], mode='onnx-iree', torchtolinalg=False, stages=None, skip_stages=None, load_inputs=False, groups='all', test_filter='migraphx', tolerance=None, verbose=True, rundirectory='test-run', no_artifacts=False, report=True, report_file='9_3_migraphx.md')
Note: GPU missing sd model (runs out of memory and kills the test). Probably happening during native inference, so it might need some looking into.
Performance data with iree-benchmark-module on GPU
Summary
Test Run Detail
Test was run with the following arguments: Namespace(device='local-task', backend='llvm-cpu', iree_compile_args=None, mode='cl-onnx-iree', torchtolinalg=False, stages=None, skip_stages=None, benchmark=True, load_inputs=False, groups='all', test_filter='migraphx', testsfile=None, tolerance=None, verbose=True, rundirectory='test-run', no_artifacts=False, cleanup='0', report=True, report_file='report.md')