Is there any document for performance benchmark result vs pytorch2.1 compile mode?

microsoft / antares

Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.

Other

444 stars 45 forks source link

Hi @fsword73. Thanks for your questions.

First, Pytorch 2.1 compile mode & Antares don't have any conflicts between each other which allows users to enable both to maximize everything, so I'm confused of the context to "move this software stack".

For benchmarking, your suggestion is great, can you share some concrete model repos that are optimized by Pytorch 2.1 compile mode? We didn't keep track on Pytorch 2.1 compile mode in the past. To avoid any unfairness of comparison, we'll use your suggested repo to do benchmarking.

microsoft / antares

Is there any document for performance benchmark result vs pytorch2.1 compile mode? #377