This pull request introduces a significant performance improvement by offloading the resize operation in IOAdapter and the compute_weight operation in FlowFormer++ to CUDA, whenever CUDA is available. This change aims to reduce CPU usage substantially and leverage GPU acceleration for enhanced processing speed.
Changes
Modified the resize method in IOAdapter to detect CUDA availability and execute on GPU when possible.
Updated FlowFormer++'s compute_weight function to perform computations on CUDA instead of the CPU.
Summary
This pull request introduces a significant performance improvement by offloading the
resize
operation in IOAdapter and thecompute_weight
operation in FlowFormer++ to CUDA, whenever CUDA is available. This change aims to reduce CPU usage substantially and leverage GPU acceleration for enhanced processing speed.Changes
resize
method in IOAdapter to detect CUDA availability and execute on GPU when possible.compute_weight
function to perform computations on CUDA instead of the CPU.