cooelf / Auto-GUI

Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
https://arxiv.org/abs/2309.11436
Apache License 2.0
174 stars 15 forks source link

Incorrect Scroll map #13

Closed neerajanand321 closed 2 months ago

neerajanand321 commented 3 months ago

Hi, Thanks for the nice work. I think in utils_data, co-ordinates for left and right are wrong, for left instead of [[0.8000, 0.5000], [0.2000, 0.5000]] it should be [[0.5, 0.8], [0.5, 0.2]] and for right it should be [[0.5, 0.2], [0.5, 0.8]]

cooelf commented 2 months ago

Yes. There is a mistake in the old script. It does not align with the latest version of the paper (which has fixed the coordinates). We have just updated the script. Thanks.