ServiceNow / BrowserGym

BrowserGym, a gym environment for web task automation in the Chromium browser.
Other
332 stars 43 forks source link

Task hanging in webarena #206

Open recursix opened 1 month ago

recursix commented 1 month ago

Logs were not in debug mode:

2024-10-24 05:41:36,037 - 766755 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.241_16 in:
  /home/toolkit/agentlab_results/2024-10-24_02-15-43_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-24_02-15-44_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.241_16
2024-10-24 05:41:56,460 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:41:56,941 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:42:11,611 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:13,130 - 766755 - browsergym.experiments.loop - INFO - The current page displays a list of forums, and I need to find the specific recipe post related to the image. Since the image depicts a decorated cake, I will look for a relevant forum, likely under the "food" category. I will click on the "food" link to explore the posts related to recipes.

action:
click('182')

2024-10-24 05:42:15,369 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:16,707 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:42:16,707 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:42:23,861 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:24,687 - 766755 - browsergym.experiments.loop - INFO - The current page displays a list of posts in the "food" forum. I need to find a recipe post related to the image of the decorated cake. I will look for posts that might mention cakes or desserts. The first step is to scan through the visible posts and identify any that seem relevant to cakes. I will click on the post titled "[Homemade] Buttercream cake" as it is likely related to the image.

action:
click('640')

2024-10-24 05:42:26,840 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:28,075 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:42:28,076 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:42:34,640 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:35,471 - 766755 - browsergym.experiments.loop - INFO - I have navigated to the post titled "[Homemade] Buttercream cake," which is likely related to the image of the decorated cake. Now, I need to find the comments section of this post to gather more information about the recipe. I will click on the link that indicates the number of comments, which is "91 comments."

action:
click('657')

2024-10-24 05:42:39,837 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:41,874 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:42:42,781 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:42:42,782 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:42:50,557 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:51,851 - 766755 - browsergym.experiments.loop - INFO - I have successfully navigated to the comments section of the "[Homemade] Buttercream cake" post. Now, I need to find the specific comment that contains the recipe or any relevant information about it. I will scroll through the comments to locate any mention of the recipe or instructions related to the cake.

action:
scroll(0, 1000)  # Scroll down to view more comments

2024-10-24 05:42:54,114 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:42:56,430 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:42:57,321 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:42:57,323 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:43:03,887 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:05,153 - 766755 - browsergym.experiments.loop - INFO - I have scrolled down to view more comments in the "[Homemade] Buttercream cake" post. Now, I need to look for any comments that specifically mention the recipe or provide instructions related to the cake. I will continue scrolling to see if any relevant comments appear.

action:
scroll(0, 1000)  # Scroll down further to view more comments

2024-10-24 05:43:07,276 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:09,318 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:43:10,159 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:43:10,160 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:43:18,065 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:19,305 - 766755 - browsergym.experiments.loop - INFO - I have scrolled down to view more comments in the "[Homemade] Buttercream cake" post. Now, I need to look for any comments that specifically mention the recipe or provide instructions related to the cake. I will continue scrolling to see if any relevant comments appear. Since I have already scrolled down twice, I will scroll down again to see if I can find any comments that contain the recipe or related information.

action:
scroll(0, 1000)  # Scroll down further to view more comments

2024-10-24 05:43:21,199 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:23,408 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:43:24,209 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:43:24,211 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:43:33,398 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:34,668 - 766755 - browsergym.experiments.loop - INFO - I have scrolled down multiple times in the comments section of the "[Homemade] Buttercream cake" post, but I haven't found any specific comments that mention the recipe or provide instructions related to the cake. I will now scroll back up to the top of the comments section to see if there are any pinned comments or earlier comments that might contain the recipe or relevant information.

action:
scroll(0, -1000)  # Scroll up to view earlier comments

2024-10-24 05:43:37,251 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:39,682 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:43:40,547 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:43:40,548 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:43:51,148 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:52,435 - 766755 - browsergym.experiments.loop - INFO - I have scrolled back up to view earlier comments in the "[Homemade] Buttercream cake" post. I need to look for any comments that specifically mention the recipe or provide instructions related to the cake. Since I haven't found any relevant comments yet, I will scroll down again to see if any comments appear that contain the recipe or related information.

action:
scroll(0, 1000)  # Scroll down to view more comments

2024-10-24 05:43:54,640 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:43:57,230 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:43:58,078 - 766755 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:43:58,079 - 766755 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:17,310 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:44:18,584 - 766755 - browsergym.experiments.loop - INFO - I have scrolled multiple times through the comments section of the "[Homemade] Buttercream cake" post, but I haven't found any specific comments that mention the recipe or provide instructions related to the cake. I will now scroll back down to see if any new comments appear that might contain the recipe or relevant information.

action:
scroll(0, 1000)  # Scroll down to view more comments

2024-10-24 05:44:20,762 - 766755 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:44:22,757 - 766755 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
recursix commented 1 month ago
2024-10-24 05:43:37,801 - 824742 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.245_5 in:
  /home/toolkit/agentlab_results/2024-10-24_02-15-43_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-24_02-15-44_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.245_5
2024-10-24 05:43:54,475 - 824742 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:43:54,941 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:02,639 - 824742 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:44:03,870 - 824742 - browsergym.experiments.loop - INFO - The current page displays a list of forums, but I need to find the specific post related to the image provided. Since the image depicts a cityscape at night, I will search for relevant forums that might contain such posts. The most likely candidates are forums related to cities or photography. I will click on the "photography" forum link to explore posts that may contain the image.

action:
click('256')  # Assuming '256' corresponds to the 'photography' forum link.

2024-10-24 05:44:07,067 - 824742 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:44:07,068 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:13,447 - 824742 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:44:14,143 - 824742 - browsergym.experiments.loop - INFO - The current page displays a list of articles in the Music forum, but I need to find the specific post related to the image provided. Since the image depicts a cityscape at night, I will look for posts that might relate to photography or cityscapes. I will click on the first article link to explore its content and see if it matches the image.

action:
click('378')  # Clicking on the first article link to check its content.

2024-10-24 05:44:21,175 - 824742 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:44:22,711 - 824742 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:44:22,713 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:22,782 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:22,846 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:22,915 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:22,982 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,050 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,115 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,181 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,246 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,312 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,385 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,453 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:23,502 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:42,818 - 824742 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:44:44,616 - 824742 - browsergym.experiments.loop - INFO - The current page displays a post in the Music forum, and I can see that there are comments related to the post. I need to navigate to the comments section to find the specific comments related to the image provided. The comments section is already visible, so I will click on the link that indicates the number of comments to access them directly.

action:
click('146')  # Clicking on the link for '175 comments' to navigate to the comments section.

2024-10-24 05:44:52,358 - 824742 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 05:44:53,813 - 824742 - root - WARNING - Agent is in goal mode, but multiple user messages are present in the chat. Consider switching to `enable_chat=True`.
2024-10-24 05:44:53,815 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:53,888 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:53,958 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,026 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,128 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,213 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,284 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,357 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,429 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,498 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,573 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,649 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:44:54,702 - 824742 - root - WARNING - The content of the message has images, which are not displayed in the string representation.
2024-10-24 05:45:04,711 - 824742 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 05:45:06,499 - 824742 - browsergym.experiments.loop - INFO - The previous action successfully navigated to the comments section of the post. I can see the comments are visible, and I need to find specific comments related to the image provided. Since the comments are already displayed, I will review them to find any relevant discussions or insights related to the image.

action:
noop(1000)  # Waiting briefly to ensure the comments are fully loaded for review.

2024-10-24 05:45:12,110 - 824742 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
recursix commented 1 month ago
2024-10-24 06:02:03,409 - 840956 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.325_11 in:
  /home/toolkit/agentlab_results/2024-10-24_02-15-43_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-24_02-15-44_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.325_11
2024-10-24 06:02:15,566 - 840956 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 06:02:19,617 - 840956 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 06:02:20,190 - 840956 - browsergym.experiments.loop - INFO - The topmost container on the page is likely the main header or navigation area. To determine its color, I will need to inspect the styles applied to the topmost visible element. I will hover over the first visible element in the AXTree to see its properties, including its background color.

action:
hover('27')

2024-10-24 06:02:28,322 - 840956 - browsergym.core.env - INFO - The active page and / or page history has changed during task.validate(). A recovery fix will be applied.
2024-10-24 06:02:29,458 - 840956 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 06:02:33,866 - 840956 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 06:02:34,411 - 840956 - browsergym.experiments.loop - INFO - The topmost container appears to be the navigation area. I will now check its properties to determine its background color. I will hover over the first visible element in the navigation section to inspect its styles.

action:
hover('27')

2024-10-24 06:02:42,516 - 840956 - browsergym.core.env - INFO - The active page and / or page history has changed during task.validate(). A recovery fix will be applied.
2024-10-24 06:02:44,330 - 840956 - browsergym.core.observation - WARNING - Failed to extract BrowserGym data from ARIA string: 'Formatting help + Markdown allowed.Formatting help +'
2024-10-24 06:02:49,231 - 840956 - httpx - INFO - HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
2024-10-24 06:02:49,789 - 840956 - browsergym.experiments.loop - INFO - The topmost container is the navigation area, which I have already identified. To determine its background color, I will need to inspect the styles applied to this element. I will attempt to hover over the navigation area again to see if I can retrieve the color information.

action:
hover('27')

2024-10-24 06:02:57,914 - 840956 - browsergym.core.env - INFO - The active page and / or page history has changed during task.validate(). A recovery fix will be applied.