PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
http://www.paddlepaddle.org/
Apache License 2.0
21.66k stars 5.44k forks source link

[XPU] Add rms_norm and rms_norm_grad op #63989

Open cqulilujia opened 2 weeks ago

cqulilujia commented 2 weeks ago

PR Category

Custom Device

PR Types

New features

Description

  1. 添加XPU rms_norm和rms_norm_grad算子
  2. 在PR #60848 中,添加了rms_norm_grad的GPU反向算子,但未支持norm_weight_bias输入,而XPU支持weight_bias输入,因此同时修改了rms_norm_grad相关的接口以支持norm_weight_bias
paddle-bot[bot] commented 2 weeks ago

你的PR提交成功,感谢你对开源项目的贡献! 请关注后续CI自动化测试结果,详情请参考Paddle-CI手册。 Your PR has been submitted. Thanks for your contribution! Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot[bot] commented 1 week ago

Sorry to inform you that 8aade47's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.