GPU: RX6400 (I cannot find this model in all given GPU options)
I was trying to use hipGraph instead of hipStream to accelerate some computation. I find that the difference between performance using stream and graph is minor. I've tested the same program in a cuda manner using Nvidia's GPU and there was significant improvement, so I know for certain that my program was correctly written. My program run on Rocm 5.6.0, then I upgraded it to 5.7.0 and there was no difference in terms of performance. I wonder in which version of Rocm there is some optimization on hipGraph. Also, since I'm using a relatively outdated amd GPU - RX6400, I wonder if hipGraph can only have siginificant influence on some certain models.
Operating System
Ubuntu 22.04.3 LTS(Jammy Jellyfish)
CPU
11th Gen Intel(R) Core(TM) i5-11400
GPU
AMD Radeon VII
ROCm Version
ROCm 5.7.0
ROCm Component
clr, HIP
Steps to Reproduce
I wrote two simple programs to test performance. One uses stream, and another uses graph. I made them txt because github doesn't allow me to upload cpp files. Simply change them to cpp, compile and run the two programs to see the output.
hip_only_stream.txthip_using_graph.txt
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
Problem Description
GPU: RX6400 (I cannot find this model in all given GPU options)
I was trying to use hipGraph instead of hipStream to accelerate some computation. I find that the difference between performance using stream and graph is minor. I've tested the same program in a cuda manner using Nvidia's GPU and there was significant improvement, so I know for certain that my program was correctly written. My program run on Rocm 5.6.0, then I upgraded it to 5.7.0 and there was no difference in terms of performance. I wonder in which version of Rocm there is some optimization on hipGraph. Also, since I'm using a relatively outdated amd GPU - RX6400, I wonder if hipGraph can only have siginificant influence on some certain models.
Operating System
Ubuntu 22.04.3 LTS(Jammy Jellyfish)
CPU
11th Gen Intel(R) Core(TM) i5-11400
GPU
AMD Radeon VII
ROCm Version
ROCm 5.7.0
ROCm Component
clr, HIP
Steps to Reproduce
I wrote two simple programs to test performance. One uses stream, and another uses graph. I made them txt because github doesn't allow me to upload cpp files. Simply change them to cpp, compile and run the two programs to see the output. hip_only_stream.txt hip_using_graph.txt
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response