facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Apache License 2.0
4.54k stars 363 forks source link

Add reciprocal operator #1023

Closed muchulee8 closed 1 month ago

muchulee8 commented 1 month ago

Summary: Add reciprocal

Differential Revision: D62000543

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D62000543

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D62000543

facebook-github-bot commented 1 month ago

This pull request has been merged in facebookincubator/AITemplate@bfb1dc240b6cafcd7f10f9c9c6dae1635248b2d8.