microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
20.12k stars 2.55k forks source link

train text diffuser on customized dataset #1408

Open lwb2099 opened 10 months ago

lwb2099 commented 10 months ago

Describe Model I am using textdiffuser: Hi, I am training textdiffuser using my customized dataset, and I wonder how to build segmentation mask information. It seems that there is no code for generating segmentation mask from raw image. Can you provide the code fro preparation? or should we use UNet for generating segmentation mask.

JingyeChen commented 10 months ago

https://github.com/microsoft/unilm/blob/master/textdiffuser/inference.py#L434C33-L434C33

simajiucai commented 6 months ago

https://github.com/microsoft/unilm/blob/master/textdiffuser/inference.py#L434C33-L434C33 in the code, the segmentation mask is generated by a user-defined prompt. but how can we build the segmentation mask from raw image? @JingyeChen