Open zhangzef opened 2 weeks ago
Hi,
LLaVa does support batched generation, see here and here for example code snippets.
I also wonder why you are passing the images + text through the processor but then not using the
inputs
created?
thank you for your reply! it just the test code, but the second code will be blocking in the mapping processing
@zhangzef indeed, using the transformers version 4.41.2 also doesn't run for me, yet updating it to the latest 4.44.2 works. I will see what was wrong with the older version, doesn't seem to be related to LLaVa code per se as the code hasn't changed drastically for a long time
@zucchini-nlp actually worth it to add multiprocessing tests WDYT?
@ArthurZucker not sure I got you, do you mean adding a test with datasets
? Aren't we supposed to test that within datasets
repo?
FYI, I tried to get the same error by simple multiprocessing.Pool
but it works fine, and I didn't have time to dive into the issue yet
I meant to make sure our processors work with multiprocessing pools! Not necessarily dataset. Cool if that worked!
System Info
transformers
version: 4.41.2Who can help?
@ArthurZucker No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
When I used the llava processor for multiprocess preprocessing of my data set, the program seemed to get stuck. It paused at the beginning of the mapping phase, but when I switched the processor to CLIP it was able to map normally.
Code that works properly:
Code that doesn't work:
and for the detail discussion could see this issue https://github.com/huggingface/trl/issues/1964#issue-2484568153
Expected behavior
...