AIGText / Glyph-ByT5

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
https://glyph-byt5.github.io/
Apache License 2.0
510 stars 22 forks source link

🚀🚀🚀 🔥🔥🔥 Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

This is the official implementation of Glyph-ByT5 and Glyph-ByT5-v2, introduced in Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering and Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering .

News

⛽⛽⛽ Contact: yuhui.yuan@microsoft.com

2024.06.28 We have removed the weights and code that may have used potentially unauthorized datasets in the current stage. We will update the checkpoints after the Microsoft RAI process.

:high_brightness: Highlights

paragraph example 1 paragraph example 2 paragraph example 3 paragraph example 4
design example 1 design example 2 design example 3 design example 4
scene example 1 scene example 2 scene example 3 scene example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4
multilingual example 1 multilingual example 2 multilingual example 3 multilingual example 4

:wrench: Usage

For a detailed guide on Glyph-SDXL and Glyph-SDXL-v2 inference, see this folder.

For a detailed guide on Glyph-ByT5 alignment pretraining, see this folder.

:mailbox_with_mail: Citation

If you find this code useful in your research, please consider citing:

@article{liu2024glyph,
  title={Glyph-byt5: A customized text encoder for accurate visual text rendering},
  author={Liu, Zeyu and Liang, Weicong and Liang, Zhanhao and Luo, Chong and Li, Ji and Huang, Gao and Yuan, Yuhui},
  journal={arXiv preprint arXiv:2403.09622},
  year={2024}
}

and

@article{liu2024glyphv2,
  title={Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering},
  author={Liu, Zeyu and Liang, Weicong and Zhao, Yiming and Chen, Bohan and Li, Ji and Yuan, Yuhui},
  journal={arXiv preprint arXiv:2406.10208},
  year={2024}
}