Closed liu-jc closed 1 month ago
@liu-jc thanks for opening this, I could reproduce it. Looks like the culprit is
dataloader_num_workers: 11
If I set instead
dataloader_num_workers: 1
then everything runs fine. Maybe one check we should do internally is that the number of worker processes set does not exceed the number of datasets provided.
Hi @lostella,
Thanks for the quick reply! I also just found that with more datasets, it works fine. Thought maybe num_workers problems, but haven't tried to reduce it. Thanks for confirming the solution.
@liu-jc could you confirm that #157 is the required fix? Thanks!
Hi @lostella,
I tested it. It works with your fix :)
Bug report checklist
Describe the bug
When I put a single dataset in the config file like the following:
I would face ValueError:
Basically, this is because the probs is an empty list: probs: [], iterables: []. I am not sure why it would be empty. I think this might be bug but not sure if any one else faced the same issue?
Expected behavior
I think it should run smoothly.
To reproduce
Full script:
Environment description Operating system: Python version: Python 3.11.5 PyTorch version: 2.3.1+cu121 HuggingFace transformers version: 4.41.2 HuggingFace accelerate version: 0.30.1