Open SwEngine opened 1 year ago
It should be an interval of 1s, causing the GPU frequency to drop
I tried your first recommendation, and it works! Thank you!
For who want to try to lock the gpu frequency of jetson devices, commands are given below: $ sudo nvpmodel -m 0 $ sudo jetson_clocks
Oh, that's great
When I provide images in a loop without any delay, the processing time for yolov7-face or yolov8-face is short. However, when I feed the images to the detection function one by one, introducing a 1-second time interval between each photo, the processing time becomes longer. What might be causing this issue?
Here are the processing times for images in a loop:
And here are the processing times with 1-second intervals between images:
Upon analyzing the detect function, I've identified that the following line is taking a long time: CHECK(cudaMemcpyAsync(decode_ptr_host[i],decode_ptr_device,sizeof(float)(1+MAX_OBJECTSNUM_BOX_ELEMENT),cudaMemcpyDeviceToHost,stream));
What could be the issue and what can be the solution? CudaMemCpy is slower when images are given one by one. How can I solve this?