Transformer mapper - Githubissues

Fantasy-Studio / Paint-by-Example

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Other

1.03k stars 93 forks source link

We appreciate your interest in our work. Actually, a transformer is equivalent to 3 FC layers since the number of tokens is 1. In the paper, we also claim that several FC layers are used instead of several transformers. In the beginning, we explored using the 257 tokens of clip image embeddings. Thus we used several transformers to decode the embeddings. Because a transformer is actually equivalent to 3 FC layers when the number of tokens is 1, we did not replace the transformer with several FC layers for a better ablation study.

Fantasy-Studio / Paint-by-Example

Transformer mapper #14