intel / torch-xpu-ops

Apache License 2.0
15 stars 8 forks source link

upsample_bilinear2d fast kernel path support #427

Open majing921201 opened 4 weeks ago

majing921201 commented 4 weeks ago

CUDA add one kernel for performance optimization to support partial channel last case. https://github.com/pytorch/pytorch/blob/df85f34a14dd30f784418624b05bd52b12ab8b0b/aten/src/ATen/native/cuda/UpSampleBilinear2d.cu#L290

performance enhancement is not in 2.5 scope

dvrogozh commented 3 weeks ago

Is this issue to track something beyond what's being currently done in https://github.com/intel/torch-xpu-ops/pull/422?

chuanqi129 commented 6 days ago

@majing921201 please help to double check it. If this issue have been done, please close it