kyegomez / ScreenAI

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
https://discord.gg/GYbXvDGevY
MIT License
244 stars 26 forks source link