Open Richar-Du opened 1 year ago
Should probably upgrade transformers https://github.com/huggingface/transformers/issues/22903
I have updated transformers to 4.30.2, but it doesn't work. The input of the processor is:
and I want to use pix2struct-large to generate its corresponding html. However, now the generated text is just: '<>'
@nbroad1881 @younesbelkada
Hi, I also encountered the same problem. I took a screenshot of the left subgraph of Figure 1 in the pix2struct paper, and the pix2struct-large model can only output the same '<>'. This is severely inconsistent with expectations and I am quite confused. I am eagerly anticipating the response from the author. Thanks a lot.
PS: my transformer version is 4.31.0.
+1
@kentonl, is there a prompt for pretraining?
+1
+1
Thanks for your awesome work!
I want to utilize the model to generate the HTML of an image, so I choose the pre-trained checkpoint without fine-tuning. However, the generated output is very short. For example, the following code only generate
<img_src=image>
without any detailed struct.My transformers version is 4.28.0. Do you know how to solve this problem? Thanks in advance :)