-
Hello! We are attempting to perform AR translation from a different language font to English. However, we've encountered the following issue.
- adding alphabets instead of representing the input cha…
-
**Describe**
Model I am using (TextDiffuser) on windows machine with GPU:
I'm wondering if it's possible to run the inference.py for the "text_to_image" model without training??? I have already do…
-
**Describe**
Model I am using TextDiffuser:
I found that there are some index numbers starting with "50001" in the MARIO-LAION dataset, but I did not find the corresponding subfolder in the meta inf…
-
**Describe**
Model I am using : TextDiffuser
Hi, thanks for the great work. I'm trying to train the model on the portion of Mario-Laion image dataset (~50k images).
But currently the images generat…
-
**Describe**
Model I am using (Text diffuser-2):
I am running inference on text diffuser-2 , the inference code of mine:
`CUDA_VISIBLE_DEVICES=6 python inference_textdiffuser2_t2i_full.py \
--pr…
-
**Describe**
Model I am using (UniLM, MiniLM, LayoutLM ...):
I think we need evaluate code like text diffuser-1 to reproduce result on MARIOEval benchmark or other datasets
-
**Describe**
I visualized the segmentation mask and the image pair. It seems the pair that shares the same name (e.g., 00000/000000012.jpg and 00000/000000012/charseg.npy) is not a matching. The segm…
-
I wonder what is inpainting_mask in the use of Zero-shot Inpainting? We should mask the raw_pil_image first? And the model will inpaint the mask part? Thanks a lot!
-
**Describe the bug**
The provided links of meta data and url are invalid.
https://layoutlm.blob.core.windows.net/textdiffuser/laion-ocr.zip
https://layoutlm.blob.core.windows.net/textdiffuser/…
-
hello, I am trying to use textdiffuser-2. When I am in the stage of M1's layout planning training, I found that there is no flag for setting the type of loss, whereas in the paper the author said such…