Closed fardinabbasi closed 10 months ago
@fardinabbasi It looks like the remote server doesn't have access to the comet.com server. If it doesn't, then you can still run your code, changing one line:
Change:
from comet_ml import Experiment
to:
from comet_ml import OfflineExperiment as Experiment
That will create a zip file on the remore machine that you can retrieve and log to comet. You will loose live updates, however.
For more information, see: https://www.comet.com/docs/v2/api-and-sdk/python-sdk/advanced/running-offline/
Does that help?
What is your question related to?
What is your question?
I am running an RLlib experiment on a remote server using the bash shell, using Comet as callbacks. However, I am encountering an error where the run will not be logged. Here are the details of the issue:
Code
COMET WARNING: Failed to check backend version at URL: 'https://www.comet.com/clientlib/isAlive/ver' COMET ERROR: Run will not be logged For more details, please refer to: https://www.comet.com/docs/v2/api-and-sdk/python-sdk/warnings-errors/ Traceback (most recent call last): File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connection.py", line 203, in _new_conn sock = connection.create_connection( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/util/connection.py", line 85, in create_connection raise err File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/util/connection.py", line 73, in create_connection sock.connect(sa) OSError: [Errno 101] Network is unreachable The above exception was the direct cause of the following exception:
Traceback (most recent call last): File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 790, in urlopen response = self._make_request( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 491, in _make_request raise new_e File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 467, in _make_request self._validate_conn(conn) File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 1092, in _validate_conn conn.connect() File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connection.py", line 611, in connect self.sock = sock = self._new_conn() File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connection.py", line 218, in _new_conn raise NewConnectionError( urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPSConnection object at 0x2b4b3692b580>: Failed to establish a new connection: [Errno 101] Network is unreachable
The above exception was the direct cause of the following exception:
Traceback (most recent call last): File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/requests/adapters.py", line 486, in send resp = conn.urlopen( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 874, in urlopen return self.urlopen( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 874, in urlopen return self.urlopen( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 874, in urlopen return self.urlopen( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/connectionpool.py", line 844, in urlopen retries = retries.increment( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/urllib3/util/retry.py", line 515, in increment raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type] urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='www.comet.com', port=443): Max retries exceeded with url: /clientlib/logger/add/run (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x2b4b3692b580>: Failed to establish a new connection: [Errno 101] Network is unreachable'))
During handling of the above exception, another exception occurred:
Traceback (most recent call last): File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/experiment.py", line 1004, in _start self.alive = self._setup_streamer() File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/_online.py", line 312, in _setup_streamer results = self._authenticate() File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/_online.py", line 398, in _authenticate run_id_response = self._get_run_id() File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/_online.py", line 435, in _get_run_id return self.connection.get_run_id(self.project_name, self.workspace) File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/connection.py", line 868, in get_run_id r = self._low_level_http_client.post( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/connection.py", line 571, in post return self.do( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/comet_ml/connection.py", line 677, in do response = session.request( File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/requests/sessions.py", line 589, in request resp = self.send(prep, send_kwargs) File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/requests/sessions.py", line 703, in send r = adapter.send(request, kwargs) File "/mainfs/scratch/sb5e19/.conda/envs/py39/lib/python3.9/site-packages/requests/adapters.py", line 519, in send raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPSConnectionPool(host='www.comet.com', port=443): Max retries exceeded with url: /clientlib/logger/add/run (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x2b4b3692b580>: Failed to establish a new connection: [Errno 101] Network is unreachable'))
What have you tried?