about the layoutDM+ in paper "Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation"

CyberAgentAILab / layout-dm

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [Inoue+, CVPR2023]

https://cyberagentailab.github.io/layout-dm

Apache License 2.0

210 stars 23 forks source link

about the layoutDM+ in paper "Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation" #32

Closed Usernamezhx closed 8 months ago

Usernamezhx commented 8 months ago

Thank you for your contribution to the layout work. I read the paper "Retrieval-Augmented Layout Transformer". it point that: LayoutDM† [19] is a discrete state-space diffusion model that can handle many constrained generation tasks. Since the model is originally designed for content-agnostic layout generation, we extend the model to accept an input image. Can you provide some ideas about the layoutDM+. thanks very much.

naoto0804 commented 8 months ago

Hi, thank you for your interest in our projects!

LayoutDM+ inserts the cross-attention layer (typical one in Transformer decoder) into the building block to attend flattened image feature maps.

Usernamezhx commented 8 months ago

thanks for your reply.