Open Winnie202 opened 1 year ago
Hi @Winnie202, awesome results!! I'm glad you could reconstruct the small texts. We could try generating synthetic scene-text as a follow-up
awesome results!! I'm glad you could reconstruct the small texts. We could try generating synthetic scene-text as a follow-up
Do you have any good suggestions for improving the reconstruction of the text in these scenes
Thank you for sharing the code,I used taming-tranformer to did the image reconstruction for Street View,but smaller text sections don't work well. If i use this model to train this type of dataset can optimize the reconstruction results of vqgan with small text,like these: