Closed recursix closed 4 days ago
Showing Max 2 stack traces:
2024-10-28 01:38:58,428 - 2180615 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.282_22 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-10_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.282_22 2024-10-28 01:38:58,431 - 2180615 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:38:58,432 - 2180615 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:38:58,458 - 2180615 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:38:58,547 - 2180615 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:38:58,965 - 2180615 - asyncio - DEBUG - Using selector: EpollSelector 2024-10-28 01:39:01,707 - 2180615 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:39:01,708 - 2180615 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:39:01,717 - 2180615 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False ... ...truncated middle of the log ... 2024-10-28 01:39:09,115 - 2180615 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/'> 2024-10-28 01:39:12,031 - 2180615 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/f/dataisbeautiful'> 2024-10-28 01:39:21,144 - 2180615 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.282. TimeoutError: Timeout 10000ms exceeded. =========================== logs =========================== navigating to "http://visualwebarena.eastus.cloudapp.azure.com:8080/f/dataisbeautiful", waiting until "load" ============================================================ Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 216, in run step_info.from_reset( File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 405, in from_reset self.obs, env_info = env.reset(seed=seed) ^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 75, in reset return self.env.reset(**kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 61, in reset return self.env.reset(**kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 285, in reset task_goal, task_info = self.task.setup(page=self.page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 204, in setup page.goto(url) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/sync_api/_generated.py", line 9303, in goto self._sync( File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_sync_base.py", line 109, in _sync return task.result() ^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_page.py", line 473, in goto return await self._main_frame.goto(**locals_to_params(locals())) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_frame.py", line 138, in goto await self._channel.send("goto", locals_to_params(locals())) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 61, in send return await self._connection.wrap_api_call( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 490, in wrap_api_call return await cb() ^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 99, in inner_send result = next(iter(done)).result() ^^^^^^^^^^^^^^^^^^^^^^^^^ playwright._impl._api_types.TimeoutError: Timeout 10000ms exceeded. =========================== logs =========================== navigating to "http://visualwebarena.eastus.cloudapp.azure.com:8080/f/dataisbeautiful", waiting until "load" ============================================================
2024-10-28 01:38:58,366 - 2180618 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.305_18 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-11_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.305_18 2024-10-28 01:38:58,368 - 2180618 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:38:58,370 - 2180618 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:38:58,394 - 2180618 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:38:58,455 - 2180618 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:38:58,879 - 2180618 - asyncio - DEBUG - Using selector: EpollSelector 2024-10-28 01:39:01,629 - 2180618 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:39:01,629 - 2180618 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:39:01,638 - 2180618 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False ... ...truncated middle of the log ... 2024-10-28 01:39:08,417 - 2180618 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/'> 2024-10-28 01:39:09,799 - 2180618 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/f/memes/new'> 2024-10-28 01:39:20,434 - 2180618 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.305. TimeoutError: Timeout 10000ms exceeded. =========================== logs =========================== navigating to "http://visualwebarena.eastus.cloudapp.azure.com:8080/f/memes/new", waiting until "load" ============================================================ Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 216, in run step_info.from_reset( File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 405, in from_reset self.obs, env_info = env.reset(seed=seed) ^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 75, in reset return self.env.reset(**kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 61, in reset return self.env.reset(**kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 285, in reset task_goal, task_info = self.task.setup(page=self.page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 204, in setup page.goto(url) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/sync_api/_generated.py", line 9303, in goto self._sync( File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_sync_base.py", line 109, in _sync return task.result() ^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_page.py", line 473, in goto return await self._main_frame.goto(**locals_to_params(locals())) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_frame.py", line 138, in goto await self._channel.send("goto", locals_to_params(locals())) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 61, in send return await self._connection.wrap_api_call( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 490, in wrap_api_call return await cb() ^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 99, in inner_send result = next(iter(done)).result() ^^^^^^^^^^^^^^^^^^^^^^^^^ playwright._impl._api_types.TimeoutError: Timeout 10000ms exceeded. =========================== logs =========================== navigating to "http://visualwebarena.eastus.cloudapp.azure.com:8080/f/memes/new", waiting until "load" ============================================================
2024-10-28 01:40:47,810 - 2180614 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.597_12 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-11_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.597_12 2024-10-28 01:40:47,811 - 2180614 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:40:47,813 - 2180614 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:40:47,835 - 2180614 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:40:47,836 - 2180614 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:40:49,014 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='about:blank'> 2024-10-28 01:40:49,615 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/customer/account/login/'> 2024-10-28 01:40:50,545 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/customer/account/login/'> 2024-10-28 01:40:50,549 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/customer/account/login/'> ... ...truncated middle of the log ... action: click('1625') # This link appears to be for the Microsoft Xbox One Console with Accessories, 1TB HDD - Black. 2024-10-28 01:41:39,838 - 2180614 - browsergym.experiments.loop - DEBUG - Chat info sent. 2024-10-28 01:41:39,838 - 2180614 - browsergym.experiments.loop - DEBUG - Sending action to environment. 2024-10-28 01:41:39,838 - 2180614 - browsergym.core.env - DEBUG - Executing action 2024-10-28 01:41:39,943 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=xbox+one+console'> 2024-10-28 01:41:39,946 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=xbox+one+console'> 2024-10-28 01:41:39,948 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=xbox+one+console'> 2024-10-28 01:41:39,949 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=xbox+one+console'> 2024-10-28 01:41:39,951 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=xbox+one+console'> 2024-10-28 01:41:40,979 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/microsoft-xbox-one-console-with-accessories-1tb-hdd-black.html'> 2024-10-28 01:41:41,189 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/microsoft-xbox-one-console-with-accessories-1tb-hdd-black.html'> 2024-10-28 01:41:41,192 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/microsoft-xbox-one-console-with-accessories-1tb-hdd-black.html'> 2024-10-28 01:41:42,957 - 2180614 - browsergym.core.env - DEBUG - Action executed 2024-10-28 01:41:43,512 - 2180614 - browsergym.core.env - DEBUG - Active page checked 2024-10-28 01:41:43,513 - 2180614 - browsergym.core.env - DEBUG - User message done 2024-10-28 01:41:43,513 - 2180614 - browsergym.core.env - DEBUG - Initiating task validation 2024-10-28 01:41:43,691 - 2180614 - urllib3.connectionpool - DEBUG - Starting new HTTP connection (1): visualwebarena.eastus.cloudapp.azure.com:8082 2024-10-28 01:41:43,792 - 2180614 - urllib3.connectionpool - DEBUG - http://visualwebarena.eastus.cloudapp.azure.com:8082 "GET /media/catalog/product/cache/829a59e57f886f8cf0598ffca4f8a940/B/0/B00KL3WBBC.0.jpg HTTP/11" 200 14830 2024-10-28 01:41:43,860 - 2180614 - urllib3.connectionpool - DEBUG - Starting new HTTP connection (1): visualwebarena.eastus.cloudapp.azure.com:8082 2024-10-28 01:41:43,947 - 2180614 - urllib3.connectionpool - DEBUG - http://visualwebarena.eastus.cloudapp.azure.com:8082 "GET /media/catalog/product/cache/829a59e57f886f8cf0598ffca4f8a940/B/0/B00KL3WBBC.1.jpg HTTP/11" 200 17349 2024-10-28 01:41:43,976 - 2180614 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.597. TypeError: 'NoneType' object is not callable Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 246, in run step_info.from_step(env, action, obs_preprocessor=agent.obs_preprocessor) File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 379, in from_step self.obs, self.reward, self.terminated, self.truncated, env_info = env.step(action) ^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 57, in step observation, reward, terminated, truncated, info = self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 56, in step return self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 415, in step reward, done, user_message, task_info = self._task_validate() ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 440, in _task_validate reward, done, user_message, info = self.task.validate(self.page, self.chat.messages) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 259, in validate score = self.evaluator( ^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 636, in __call__ cur_score = evaluator(trajectory, config_file, page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<@beartype(visualwebarena.evaluation_harness.evaluators.PageImageEvaluator.__call__) at 0x7fb75721b2e0>", line 92, in __call__ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 576, in __call__ pred_ans = self.captioning_fn( ^^^^^^^^^^^^^^^^^^^ TypeError: 'NoneType' object is not callable
2024-10-28 01:41:03,079 - 2186124 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.598_10 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-11_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.598_10 2024-10-28 01:41:03,081 - 2186124 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:41:03,082 - 2186124 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:41:03,110 - 2186124 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:41:03,171 - 2186124 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:41:03,720 - 2186124 - asyncio - DEBUG - Using selector: EpollSelector 2024-10-28 01:41:07,604 - 2186124 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:41:07,605 - 2186124 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:41:07,615 - 2186124 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False ... ...truncated middle of the log ... action: click('1528') # Click on the first headphone option to view details. 2024-10-28 01:41:58,377 - 2186124 - browsergym.experiments.loop - DEBUG - Chat info sent. 2024-10-28 01:41:58,377 - 2186124 - browsergym.experiments.loop - DEBUG - Sending action to environment. 2024-10-28 01:41:58,377 - 2186124 - browsergym.core.env - DEBUG - Executing action 2024-10-28 01:41:58,494 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=Sony+headphones'> 2024-10-28 01:41:58,498 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=Sony+headphones'> 2024-10-28 01:41:58,499 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=Sony+headphones'> 2024-10-28 01:41:58,502 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=Sony+headphones'> 2024-10-28 01:41:58,503 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/catalogsearch/result/?q=Sony+headphones'> 2024-10-28 01:41:59,328 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/sony-zx110nc-noise-cancelling-headphones.html'> 2024-10-28 01:41:59,547 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/sony-zx110nc-noise-cancelling-headphones.html'> 2024-10-28 01:41:59,550 - 2186124 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8082/sony-zx110nc-noise-cancelling-headphones.html'> 2024-10-28 01:42:01,497 - 2186124 - browsergym.core.env - DEBUG - Action executed 2024-10-28 01:42:02,052 - 2186124 - browsergym.core.env - DEBUG - Active page checked 2024-10-28 01:42:02,052 - 2186124 - browsergym.core.env - DEBUG - User message done 2024-10-28 01:42:02,052 - 2186124 - browsergym.core.env - DEBUG - Initiating task validation 2024-10-28 01:42:02,222 - 2186124 - urllib3.connectionpool - DEBUG - Starting new HTTP connection (1): visualwebarena.eastus.cloudapp.azure.com:8082 2024-10-28 01:42:02,328 - 2186124 - urllib3.connectionpool - DEBUG - http://visualwebarena.eastus.cloudapp.azure.com:8082 "GET /media/catalog/product/cache/829a59e57f886f8cf0598ffca4f8a940/B/0/B01K5CTWPG.0.jpg HTTP/11" 200 17096 2024-10-28 01:42:02,417 - 2186124 - urllib3.connectionpool - DEBUG - Starting new HTTP connection (1): visualwebarena.eastus.cloudapp.azure.com:8082 2024-10-28 01:42:02,487 - 2186124 - urllib3.connectionpool - DEBUG - http://visualwebarena.eastus.cloudapp.azure.com:8082 "GET /media/catalog/product/cache/829a59e57f886f8cf0598ffca4f8a940/B/0/B01K5CTWPG.1.jpg HTTP/11" 200 18600 2024-10-28 01:42:02,518 - 2186124 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.598. TypeError: 'NoneType' object is not callable Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 246, in run step_info.from_step(env, action, obs_preprocessor=agent.obs_preprocessor) File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 379, in from_step self.obs, self.reward, self.terminated, self.truncated, env_info = env.step(action) ^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 57, in step observation, reward, terminated, truncated, info = self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 56, in step return self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 415, in step reward, done, user_message, task_info = self._task_validate() ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 440, in _task_validate reward, done, user_message, info = self.task.validate(self.page, self.chat.messages) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 259, in validate score = self.evaluator( ^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 636, in __call__ cur_score = evaluator(trajectory, config_file, page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<@beartype(visualwebarena.evaluation_harness.evaluators.PageImageEvaluator.__call__) at 0x7f2f38f731a0>", line 92, in __call__ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 576, in __call__ pred_ans = self.captioning_fn( ^^^^^^^^^^^^^^^^^^^ TypeError: 'NoneType' object is not callable
2024-10-28 01:38:58,353 - 2180614 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.248_30 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-10_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.248_30 2024-10-28 01:38:58,355 - 2180614 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:38:58,356 - 2180614 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:38:58,381 - 2180614 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:38:58,440 - 2180614 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:38:58,841 - 2180614 - asyncio - DEBUG - Using selector: EpollSelector 2024-10-28 01:39:01,588 - 2180614 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:39:01,589 - 2180614 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:39:01,598 - 2180614 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False ... ...truncated middle of the log ... action: click('324') 2024-10-28 01:39:35,218 - 2180614 - browsergym.experiments.loop - DEBUG - Chat info sent. 2024-10-28 01:39:35,218 - 2180614 - browsergym.experiments.loop - DEBUG - Sending action to environment. 2024-10-28 01:39:35,218 - 2180614 - browsergym.core.env - DEBUG - Executing action 2024-10-28 01:39:35,334 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:35,337 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:35,340 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:35,343 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:35,345 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:38,340 - 2180614 - browsergym.core.env - DEBUG - Action executed 2024-10-28 01:39:38,896 - 2180614 - browsergym.core.env - DEBUG - Active page checked 2024-10-28 01:39:38,896 - 2180614 - browsergym.core.env - DEBUG - User message done 2024-10-28 01:39:38,896 - 2180614 - browsergym.core.env - DEBUG - Initiating task validation 2024-10-28 01:39:41,532 - 2180614 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.248. Error: Execution context was destroyed, most likely because of a navigation Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 246, in run step_info.from_step(env, action, obs_preprocessor=agent.obs_preprocessor) File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 379, in from_step self.obs, self.reward, self.terminated, self.truncated, env_info = env.step(action) ^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 57, in step observation, reward, terminated, truncated, info = self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 56, in step return self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 415, in step reward, done, user_message, task_info = self._task_validate() ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 440, in _task_validate reward, done, user_message, info = self.task.validate(self.page, self.chat.messages) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 259, in validate score = self.evaluator( ^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 636, in __call__ cur_score = evaluator(trajectory, config_file, page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<@beartype(visualwebarena.evaluation_harness.evaluators.PageImageEvaluator.__call__) at 0x7fb75721b2e0>", line 92, in __call__ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 532, in __call__ elements = page.query_selector_all(locator) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/sync_api/_generated.py", line 8237, in query_selector_all self._sync(self._impl_obj.query_selector_all(selector=selector)) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_sync_base.py", line 109, in _sync return task.result() ^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_page.py", line 337, in query_selector_all return await self._main_frame.query_selector_all(selector) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_frame.py", line 304, in query_selector_all await self._channel.send("querySelectorAll", dict(selector=selector)), ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 61, in send return await self._connection.wrap_api_call( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 490, in wrap_api_call return await cb() ^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 99, in inner_send result = next(iter(done)).result() ^^^^^^^^^^^^^^^^^^^^^^^^^ playwright._impl._api_types.Error: Execution context was destroyed, most likely because of a navigation 2024-10-28 01:39:41,563 - 2180614 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/f/pics'>
2024-10-28 01:38:58,350 - 2180616 - browsergym.experiments.loop - INFO - Running experiment GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.249_32 in: /home/toolkit/agentlab_results/2024-10-24_21-33-14_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena/2024-10-28_01-38-10_GenericAgent-gpt-4o-mini-2024-07-18_on_visualwebarena.249_32 2024-10-28 01:38:58,351 - 2180616 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:38:58,352 - 2180616 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:38:58,375 - 2180616 - browsergym.experiments.loop - DEBUG - Agent created. 2024-10-28 01:38:58,434 - 2180616 - browsergym.experiments.loop - DEBUG - Environment created. 2024-10-28 01:38:58,841 - 2180616 - asyncio - DEBUG - Using selector: EpollSelector 2024-10-28 01:39:01,545 - 2180616 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False 2024-10-28 01:39:01,546 - 2180616 - httpx - DEBUG - load_verify_locations cafile='/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/certifi/cacert.pem' 2024-10-28 01:39:01,555 - 2180616 - httpx - DEBUG - load_ssl_context verify=True cert=None trust_env=True http2=False ... ...truncated middle of the log ... action: click('110') 2024-10-28 01:39:17,341 - 2180616 - browsergym.experiments.loop - DEBUG - Chat info sent. 2024-10-28 01:39:17,341 - 2180616 - browsergym.experiments.loop - DEBUG - Sending action to environment. 2024-10-28 01:39:17,341 - 2180616 - browsergym.core.env - DEBUG - Executing action 2024-10-28 01:39:17,451 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums/all'> 2024-10-28 01:39:17,454 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums/all'> 2024-10-28 01:39:17,457 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums/all'> 2024-10-28 01:39:17,459 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums/all'> 2024-10-28 01:39:17,461 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums/all'> 2024-10-28 01:39:20,472 - 2180616 - browsergym.core.env - DEBUG - Action executed 2024-10-28 01:39:21,026 - 2180616 - browsergym.core.env - DEBUG - Active page checked 2024-10-28 01:39:21,026 - 2180616 - browsergym.core.env - DEBUG - User message done 2024-10-28 01:39:21,026 - 2180616 - browsergym.core.env - DEBUG - Initiating task validation 2024-10-28 01:39:21,149 - 2180616 - browsergym.experiments.loop - WARNING - Exception uncaught by agent or environment in task visualwebarena.249. Error: Execution context was destroyed, most likely because of a navigation Traceback (most recent call last): File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 246, in run step_info.from_step(env, action, obs_preprocessor=agent.obs_preprocessor) File "/home/toolkit/dev/BrowserGym/browsergym/experiments/src/browsergym/experiments/loop.py", line 379, in from_step self.obs, self.reward, self.terminated, self.truncated, env_info = env.step(action) ^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/time_limit.py", line 57, in step observation, reward, terminated, truncated, info = self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/gymnasium/wrappers/order_enforcing.py", line 56, in step return self.env.step(action) ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 415, in step reward, done, user_message, task_info = self._task_validate() ^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/core/src/browsergym/core/env.py", line 440, in _task_validate reward, done, user_message, info = self.task.validate(self.page, self.chat.messages) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/dev/BrowserGym/browsergym/visualwebarena/src/browsergym/visualwebarena/task.py", line 259, in validate score = self.evaluator( ^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 636, in __call__ cur_score = evaluator(trajectory, config_file, page) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<@beartype(visualwebarena.evaluation_harness.evaluators.PageImageEvaluator.__call__) at 0x7f52afb932e0>", line 92, in __call__ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/visualwebarena/evaluation_harness/evaluators.py", line 532, in __call__ elements = page.query_selector_all(locator) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/sync_api/_generated.py", line 8237, in query_selector_all self._sync(self._impl_obj.query_selector_all(selector=selector)) File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_sync_base.py", line 109, in _sync return task.result() ^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_page.py", line 337, in query_selector_all return await self._main_frame.query_selector_all(selector) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_frame.py", line 304, in query_selector_all await self._channel.send("querySelectorAll", dict(selector=selector)), ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 61, in send return await self._connection.wrap_api_call( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 490, in wrap_api_call return await cb() ^^^^^^^^^^ File "/home/toolkit/micromamba/envs/ui-assist/lib/python3.12/site-packages/playwright/_impl/_connection.py", line 99, in inner_send result = next(iter(done)).result() ^^^^^^^^^^^^^^^^^^^^^^^^^ playwright._impl._api_types.Error: Execution context was destroyed, most likely because of a navigation 2024-10-28 01:39:21,177 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:21,236 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'> 2024-10-28 01:39:21,238 - 2180616 - browsergym.core.env - DEBUG - _activate_page_from_js(page) called, page=<Page url='http://visualwebarena.eastus.cloudapp.azure.com:8080/forums'>
I'll close this issue, please open a new one if you still have errors
16x : Exception uncaught by agent or environment in task.
TimeoutError:
Timeout 10000ms exceeded.
========================
Showing Max 2 stack traces:
8x : Exception uncaught by agent or environment in task.
TypeError:
'NoneType' object is not callabl
Showing Max 2 stack traces:
2x : Exception uncaught by agent or environment in task.
Error:
Execution context was destroyed, most likely because of a navigatio
Showing Max 2 stack traces: