No browser event when set experiment_split="element_attribute" in generate_prompt in seeact.py

Yeah, you are right. SeeAct v0.1.0 only supports the text-choice grounding strategy.

We have not added the other two grounding strategies in SeeAct v0.1.0, since they were not working well in our offline experiments. We will support that and OSS models in later versions. (We are still busy for some other things. Sorry for this)

And by the way, SeeAct is easy to expand, so you can also try something like combining different grounding strategies and input information (We will also add some expansions like this in later updates.). We did many ablation studies like combining text_choice and image_annotation at the start of the project.

I've seen people doing such things in recent papers. For example

VisualWebArena:
- screenshot+captioning+HTML(similar to screenshot+text_choice+HTML),
- screenshot + image_annotation+HTML;
WebVoyager:
- screenshot+image_annotation+text content (the same thing I implemented in the online text_choice, hence essentially combining text_choice and image_annotatoin)

And we will definitely make more expansions other than these to build better web agents. Stay tuned.

OSU-NLP-Group / SeeAct

No browser event when set experiment_split="element_attribute" in generate_prompt in seeact.py #14