[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Thank you for your demo. I tried to generate screenshots but I didn't find these files: "../data/formal_manual_selection/task_id_dicts/30_selected.pkl"
"../data/source_data/20_chocies/test_website_outputs_top50.json"
Look forward to your reply.
Thank you for your demo. I tried to generate screenshots but I didn't find these files: "../data/formal_manual_selection/task_id_dicts/30_selected.pkl" "../data/source_data/20_chocies/test_website_outputs_top50.json" Look forward to your reply.