Open 631068264 opened 1 year ago
Hi 631068264, how did you solve this bug?
Hi all,
I had the same issue as yours and eventually pinpointed the problem in one of my custom datasets. The __getitem__
function of my custom dataset class reads a piece of shared content from the SAME file as part of the prompt, which looks like:
with open('path/to/a/single/system/message/file, 'r') as f:
instruction = f.read()
This read
operation can cause race and deadlock in high-concurrency scenarios, which is one of the possible causes to this issue. Writing the content as a static string instead of reading it from file solved my problem. Hope it be of help to yours :)
Describe the bug Follow this doc , prepare finetune data
cc_sbu_align.zip just choice 2 to 14 jpgto train and translate the caption to Chinese.
error log