AILab-CVC / SEED

Official implementation of SEED-LLaMA (ICLR 2024).
https://ailab-cvc.github.io/seed
Other
515 stars 30 forks source link

Does model has Chinese OCR ability? #38

Open luohao123 opened 2 months ago

luohao123 commented 2 months ago

Hi, have 2 questions wanna ask:

  1. Does the model has OCR ability, unlike llava, it limited on English OCR ability in vision encoder, does this has?
  2. If the model performance is not good, what's it's limitation? Is in the Tokenzier, or LLM?