PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
http://www.paddlepaddle.org/
Apache License 2.0
21.63k stars 5.44k forks source link

fix gpubox training #63905

Open danleifeng opened 1 week ago

danleifeng commented 1 week ago

PR Category

Parameter Server

PR Types

Bug fixes

Description

fix gpubox training:

  1. fix several bugs for gpubox training (use lateset develop version and python3.10)
  2. fix several bugs for gpubox right accuracy
  3. add set_date for ps load model and pull sparse( same as pslib-gpups)
  4. support so_parser training
  5. add trainer cache performance for gpubox training

Pcard-83349

paddle-bot[bot] commented 1 week ago

你的PR提交成功,感谢你对开源项目的贡献! 请关注后续CI自动化测试结果,详情请参考Paddle-CI手册。 Your PR has been submitted. Thanks for your contribution! Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot[bot] commented 3 days ago

Sorry to inform you that 39ca857's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.