issues
search
OSU-NLP-Group
/
SeeAct
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
https://osu-nlp-group.github.io/SeeAct/
Other
571
stars
69
forks
source link
Add SOM Grounding and Update README
#44
Closed
boyuanzheng010
closed
1 month ago