Closed Wong4j closed 3 months ago
您好,我们已经收到了您的问题,会安排技术人员尽快解答您的问题,请耐心等待。请您再次检查是否提供了清晰的问题描述、复现代码、环境&版本、报错信息等。同时,您也可以通过查看官网API文档、常见问题、历史Issue、AI社区来寻求解答。祝您生活愉快~
Hi! We've received your issue and please be patient to get responded. We will arrange technicians to answer your questions as soon as possible. Please make sure that you have posted enough message to demo your request. You may also check out the API,FAQ,Github Issue and AI community to get the answer.Have a nice day!
There's a related issue that was opened last year and remains unresolved. (https://github.com/PaddlePaddle/Paddle/issues/32927)
This bug can be reproduced with my code by setting EPOCH_NUM = 100
and persistent_workers=False
.
persistent_workers=True
还不是很稳定,我们还在优化这块的逻辑,根据你当前的背景,如果只需要训练一定的steps数,可以尝试先把dataset的__len__
设置为steps * batch_size来做训练的中止,当前DataLoader训练中途做break有可能发生资源没有释放
已排期,Q2内修复~
已排期,Q2内修复~
请问是否有更新?
@heavengate said this task will be targeted in 23 Q3
@heavengate said this task will be targeted in 23 Q3
@onecatcn done?
@heavengate said this task will be targeted in 23 Q3
@onecatcn done?
not yet
@xysheng-baidu sheng will investigate the issue
Same here
Same here
The problem has not been solved yet, we will deal with it as soon as possible.
感谢修复,本地测试develop没问题。
bug描述 Describe the Bug
As a benchmark, I only need to train a few steps per epoch. So, I add a
break
in the loop. For example:It works fine if I set
persistent_workers=False
. But after settingpersistent_workers=True
, I got this error:Here is the complete code to reproduce:
其他补充信息 Additional Supplementary Information
No response