Closed yassine9666 closed 2 months ago
I want to train the model on another urban dataset. One folder should contain the segmentation masks and the other the images right? Also, do I need the caption.json for my new dataset? Thank you :)
Hi @yassine9666 , you would need the captions for training. You could using some VL-captioning models, e.g., BLIP, LLaVA, to get the caption :)
I want to train the model on another urban dataset. One folder should contain the segmentation masks and the other the images right? Also, do I need the caption.json for my new dataset? Thank you :)