microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
19.11k stars 2.44k forks source link

train text diffuser on customized dataset #1408

Open lwb2099 opened 6 months ago

lwb2099 commented 6 months ago

Describe Model I am using textdiffuser: Hi, I am training textdiffuser using my customized dataset, and I wonder how to build segmentation mask information. It seems that there is no code for generating segmentation mask from raw image. Can you provide the code fro preparation? or should we use UNet for generating segmentation mask.

JingyeChen commented 6 months ago

https://github.com/microsoft/unilm/blob/master/textdiffuser/inference.py#L434C33-L434C33

simajiucai commented 1 month ago

https://github.com/microsoft/unilm/blob/master/textdiffuser/inference.py#L434C33-L434C33 in the code, the segmentation mask is generated by a user-defined prompt. but how can we build the segmentation mask from raw image? @JingyeChen