Closed bukejiyu closed 3 months ago
Thanks for your contribution!
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
Attention: Patch coverage is 0%
with 282 lines
in your changes missing coverage. Please review.
Project coverage is 55.63%. Comparing base (
65e721e
) to head (b791375
). Report is 243 commits behind head on develop.
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
PR types
PR changes
Description
paddle inference_mode 集成xft cpu kernel 机器8463B 输入/输出 128/15 bs=1 静态图llama 测速 next_tokens: 100+ms 48线程 动态图llama 测速 next_tokens: 70+ms