This PR improves the performance of image preprocessing for vision model. Generally, image storage formats include NCHW and NHWC, with NCHW being more suitable for GPU-based computations. Additionally, performance is enhanced by binding the for loop to GPU hardware threads.
This PR improves the performance of image preprocessing for vision model. Generally, image storage formats include NCHW and NHWC, with NCHW being more suitable for GPU-based computations. Additionally, performance is enhanced by binding the for loop to GPU hardware threads.