Open csyourui opened 1 year ago
Hi, Is it working well without setting this?
Thank you for your reply. It works well if I do not set preprocessing_num_workers
. However, I am just curious about why it does not work when this parameter is added.
Can you reproduce this problem? Or is it just an issue with my environment?
遇到同样问题,加载数据时,没办法并行处理加载
same question
遇到同样问题,加载数据时,没办法并行处理加载
FYI: We've located the bug, and dev team needs to perform a small-scale refactoring to fix. We will do ASAP and sorry for the inconvenience 🙏
遇到同样问题,加载数据时,没办法并行处理加载
FYI: Bug fixed, please see https://github.com/OptimalScale/LMFlow/pull/845 🤗
Describe the bug tokenizer map in
hf_decoder_model
use multipreprocessing_num_workers
will returnTypeError: cannot pickle 'torch._C._distributed_c10d.ProcessGroup' object
To Reproduce Steps to reproduce the behavior:
add
--preprocessing_num_workers 20 \
toscripts/run_finetune.sh
just start:
Screenshots
package versions: