CoffeaTeam / lpcjobqueue

A dask-jobqueue plugin for the LPC Condor queue
BSD 3-Clause "New" or "Revised" License
12 stars 11 forks source link

Problem in AlmaLinux 9.3 #33

Open alefisico opened 4 months ago

alefisico commented 4 months ago

Hey

I am running in an el9 machine at the LPC. I am running the container coffeateam/coffea-dask:0.7.22-py3.10-ga998f with the lpcjq version 0.3.1, and I am getting this error when I tried to run in condor:

[05/06/24 15:07:08] WARNING  WARNING:distributed.deploy.spec:Cluster closed without starting up         spec.py:450
Traceback (most recent call last):
  File "/opt/conda/lib/python3.10/site-packages/distributed/deploy/spec.py", line 319, in _start
    self.scheduler = cls(**self.scheduler_spec.get("options", {}))
  File "/opt/conda/lib/python3.10/site-packages/distributed/scheduler.py", line 3477, in __init__
    routes = get_handlers(
  File "/opt/conda/lib/python3.10/site-packages/distributed/http/utils.py", line 40, in get_handlers
    module = importlib.import_module(module_name)
  File "/opt/conda/lib/python3.10/importlib/__init__.py", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1050, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1027, in _find_and_load
  File "<frozen importlib._bootstrap>", line 1006, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 688, in _load_unlocked
  File "<frozen importlib._bootstrap_external>", line 883, in exec_module
  File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed
  File "/opt/conda/lib/python3.10/site-packages/distributed/http/proxy.py", line 10, in <module>
    from jupyter_server_proxy.handlers import ProxyHandler
  File "/opt/conda/lib/python3.10/site-packages/jupyter_server_proxy/__init__.py", line 3, in <module>
    from .api import IconHandler, ServersInfoHandler
  File "/opt/conda/lib/python3.10/site-packages/jupyter_server_proxy/api.py", line 3, in <module>
    from jupyter_server.base.handlers import JupyterHandler
  File "/opt/conda/lib/python3.10/site-packages/jupyter_server/base/handlers.py", line 24, in <module>
    from jupyter_events import EventLogger
  File "/opt/conda/lib/python3.10/site-packages/jupyter_events/__init__.py", line 3, in <module>
    from .logger import EVENTS_METADATA_VERSION, EventLogger
  File "/opt/conda/lib/python3.10/site-packages/jupyter_events/logger.py", line 19, in <module>
    from .schema import SchemaType
  File "/opt/conda/lib/python3.10/site-packages/jupyter_events/schema.py", line 18, in <module>
    from .validators import draft7_format_checker, validate_schema
  File "/opt/conda/lib/python3.10/site-packages/jupyter_events/validators.py", line 44, in <module>
    JUPYTER_EVENTS_SCHEMA_VALIDATOR = Draft7Validator(
TypeError: create.<locals>.Validator.__init__() got an unexpected keyword argument 'registry'

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/srv/python/runner.py", line 323, in <module>
    cluster = LPCCondorCluster(**cluster_args)
  File "/srv/.env/lib/python3.10/site-packages/lpcjobqueue/cluster.py", line 246, in __init__
    super().__init__(**kwargs)
  File "/opt/conda/lib/python3.10/site-packages/dask_jobqueue/core.py", line 638, in __init__
    super().__init__(
  File "/opt/conda/lib/python3.10/site-packages/distributed/deploy/spec.py", line 286, in __init__
    self.sync(self._start)
  File "/opt/conda/lib/python3.10/site-packages/distributed/utils.py", line 345, in sync
    return sync(
  File "/opt/conda/lib/python3.10/site-packages/distributed/utils.py", line 412, in sync
    raise exc.with_traceback(tb)
  File "/opt/conda/lib/python3.10/site-packages/distributed/utils.py", line 385, in f
    result = yield future
  File "/opt/conda/lib/python3.10/site-packages/tornado/gen.py", line 767, in run
    value = future.result()
  File "/srv/.env/lib/python3.10/site-packages/lpcjobqueue/cluster.py", line 278, in _start
    await super()._start()
  File "/opt/conda/lib/python3.10/site-packages/distributed/deploy/spec.py", line 330, in _start
    raise RuntimeError(f"Cluster failed to start: {e}") from e
RuntimeError: Cluster failed to start: create.<locals>.Validator.__init__() got an unexpected keyword argument 'registry'

any idea how to fix this?