PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
http://www.paddlepaddle.org/
Apache License 2.0
21.66k stars 5.44k forks source link

Opt rope #64043

Open Vvsmile opened 1 week ago

Vvsmile commented 1 week ago

PR Category

CINN

PR Types

Performance

Description

pcard-82035 Optimize the Rope subgraph with a dynamic shape and get the performance improvement from 221 us to 138 us, about 37% speedup.

f4f8e899ad76773d587a63acc651e3ab

paddle-bot[bot] commented 1 week ago

你的PR提交成功,感谢你对开源项目的贡献! 请关注后续CI自动化测试结果,详情请参考Paddle-CI手册。 Your PR has been submitted. Thanks for your contribution! Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot[bot] commented 4 days ago

Sorry to inform you that ee9ef40's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.