web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
https://webarena.dev
Apache License 2.0
767 stars 122 forks source link

Question about 179 human trajectory #171

Open dgjun32 opened 3 months ago

dgjun32 commented 3 months ago

Is it possible to extract sequence of html (or AXTree) and actions from the trace.zip file?

shuyanzhou commented 3 months ago

HTMLs should be possible. You can find the HTMLs after un-compressing each trace. Even from the viewer, you can see the web pages are stored as HTMLs.

Once you have the HTMLs, you can reload it to a browser and extract the AXTree.

cc @oootttyyy IIRC, you have the scripts to extract AXTrees, can you share? Thanks!

oootttyyy commented 2 months ago

Hi @dgjun32, the extraction script is here #87