AIGText / Glyph-ByT5

This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
https://glyph-byt5.github.io/
Apache License 2.0
367 stars 20 forks source link

Why change bbox subtle will lead the text generation worse? #15

Open 5RJ opened 3 days ago

5RJ commented 3 days ago

Hello, thanks for your work to push this area. However, I can not reimplement your inference result, like shower.json in examples/ The json is in applement files. shower_ori.json

I got the result as follows: image

And I also find that when get the bbox smaller or larger, the generated texts gonna be uncontrolble, I'm curious about the reason: this is the result of get bbox width smaller: the json is shower_small.json image

this is the result of get bbox width larger: the json is shower_large.json image