Open pelacables opened 2 years ago
Were you able to eventually solve it? If so, how?
no :-(
Thanks for letting me know.
In the half day since I asked, I found that the problem for me was that the batch nodes were trying to connect to the hub on the batch nodes themselves, rather than on the actual server where it is running. Setting the correct IP with c.JupyterHub.hub_ip
(and port) and making sure it was reachable from the batch nodes solved the issue.
what is the value, then? cause in my case it was the DNS alias without port/protocol. Maybe that's my issue, too.
I don't have the port/protocol there either. In my case the hub_ip is simply the hostname in the same way that I would use to ssh there, namely co35svhead01
As a separate setting, I have c.JupyterHub.port = 443
and I don't repeat that in the hub_ip
setting or anywhere else.
I suggest you try a separate install/configuration starting from zero and adding things one at the time. I found that often stuff that you think is unrelated instead has unintended consequences, so adding one thing at the time and trying helped me.
I have added the port but it still does not work. Would you mind sharing your conf with me, please?
Sure. I will do that later today.
This is what I have
c.JupyterHub.port = 443
c.JupyterHub.proxy_class = 'jupyterhub_traefik_proxy.TraefikTomlProxy'
c.JupyterHub.spawner_class = 'batchspawner.SlurmSpawner'
import batchspawner
c.SlurmSpawner.req_prologue = """
hostname
source /home/sw/modules.sh
spack env activate JH
"""
c.JupyterHub.hub_ip = "co35svhead01"
c.JupyterHub.ssl_cert = 'cert.pem'
c.JupyterHub.ssl_key = 'key.pem'
c.Spawner.default_url = '/lab'
Note the apparently useless import batchspawner
per their instructions
Thanks. Seems quute like mine, maybe some fw... i hate proxies.
El dv., 29 de jul. 2022, 21:11, Davide @.***> va escriure:
This is what I have
c.JupyterHub.port = 443 c.JupyterHub.proxy_class = 'jupyterhub_traefik_proxy.TraefikTomlProxy' c.JupyterHub.spawner_class = 'batchspawner.SlurmSpawner' import batchspawner c.SlurmSpawner.req_prologue = """ hostname source /home/sw/modules.sh spack env activate JH """ c.JupyterHub.hub_ip = "co35svhead01" c.JupyterHub.ssl_cert = 'cert.pem' c.JupyterHub.ssl_key = 'key.pem' c.Spawner.default_url = '/lab'
Note the apparently useless import batchspawner per their instructions
— Reply to this email directly, view it on GitHub https://github.com/jupyterhub/batchspawner/issues/233#issuecomment-1199866684, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAYX2LS2FSUBKLE2HAUEABLVWQUEXANCNFSM5SFROYAA . You are receiving this because you authored the thread.Message ID: @.***>
Yes, it's a lot of moving parts and dependencies... I used traefik because I could not get node.js
to work for unrelated issues.....
@pelacables Did you get yours working? I'm having the same issue. worker notifies hub, but then nothing…
Unfortunately not. Let me know if you manage to solve the issue, please.
Hi, I believe to have encounter the same issue. I cannot make the process jupyterhub-singleuser to connect to the JupyterHub after it has been successfully spawned and is running. The batch process just times out and peacefully stops.
[I 2022-11-29 14:42:20.232 SingleUserNotebookApp notebookapp:2327] Jupyter Notebook 6.4.12 is running at:
[I 2022-11-29 14:42:20.232 SingleUserNotebookApp notebookapp:2327] http://REDACTED_URL:40331/user/ubuntu
[I 2022-11-29 14:42:20.232 SingleUserNotebookApp notebookapp:2328] Use Control-C to stop this server and shut down all kernels (twice to skip confirmation).
I tried to connect to the Notebook directly in browser with the address given above just to be redirected to address
http://REDACTED_URL:40331/hub/api/oauth2/authorize?client_id=...
and with error
404 : Not Found
You are requesting a page that does not exist!
I believe that the Notebook cannot connect to JupyterHub for authentication.
I noticed that when the batch job is running, a connection to another randomly generated port appears. And the two ports do not match.
$ sudo lsof -i
batchspaw 113935 ubuntu 10u IPv4 2237630 0t0 TCP *:40331 (LISTEN)
jupyterhu 113257 root 9u IPv4 2225954 0t0 TCP *:8000 (LISTEN)
jupyterhu 113257 root 10u IPv4 2227193 0t0 TCP REDACTED_NAME:8000->REDACTED_NAME:55474 (ESTABLISHED)
Could this be the reason why the services do not communicate?
I also see the 404, but what I do not see is the serevr talking to the client. That's why I did not pay attention to the 404. to me it looks like the server is, for some reason, not making the connection to the node endpoint at all.
but at this point I'm not sure about anything :-)
I was getting this same issue and https://github.com/jupyterhub/batchspawner/pull/251 fixed it for me
thanks ! that fixed the issue for me, too.
Bug description
When I strart a server the jobs gets scheduled and successfully starts but the hub cannot talk to it.
Expected behaviour
The hub knows about the server running in the node and the server starts.
Actual behaviour
From the job logs:
From the hub logs:
How to reproduce
Request a job using the batchspawner.
Your personal set up
alembic 1.7.7 anyio 0.0.0 argon2-cffi 21.3.0 argon2-cffi-bindings 0.0.0 asttokens 2.0.5 async-generator 1.10 attrs 21.4.0 Babel 2.9.1 backcall 0.2.0 batchspawner 1.1.0 beautifulsoup4 4.10.0 bleach 4.1.0 certifi 2021.10.8 certipy 0.1.3 cffi 1.15.0 charset-normalizer 2.0.12 colorama 0.4.4 cryptography 36.0.2 debugpy 1.5.1 decorator 5.1.1 defusedxml 0.7.1 deprecation 2.1.0 entrypoints 0.4 executing 0.0.0 flit_core 3.6.0 gitdb 4.0.9 GitPython 3.1.27 greenlet 1.1.2 idna 3.3 iniconfig 0.0.0 ipykernel 6.9.2 ipython 8.1.1 ipython-genutils 0.2.0 ipywidgets 7.6.5 jedi 0.18.1 Jinja2 3.0.3 json5 0.9.6 jsonschema 0.0.0 jupyter-client 7.1.2 jupyter-core 4.9.2 jupyter-packaging 0.11.1 jupyter-resource-usage 0.6.1 jupyter-server 1.15.6 jupyter-server-mathjax 0.2.3 jupyter-telemetry 0.1.0 jupyterhub 2.2.2 jupyterlab 3.3.2 jupyterlab-git 0.34.2 jupyterlab-launcher 0.13.1 jupyterlab-pygments 0.1.2 jupyterlab-server 2.11.2 jupyterlab-widgets 1.0.2 Mako 1.2.0 MarkupSafe 2.1.1 matplotlib-inline 0.1.3 mistune 0.8.4 nbclassic 0.3.7 nbclient 0.5.13 nbconvert 6.4.4 nbdime 3.1.1 nbformat 5.2.0 nest-asyncio 1.5.4 notebook 6.4.10 notebook-shim 0.1.0 oauthenticator 14.2.0 oauthlib 3.2.0 packaging 21.3 pamela 1.0.0 pandocfilters 1.5.0 parso 0.8.3 pexpect 4.8.0 pickleshare 0.7.5 pip 22.0.4 pluggy 0.0.0 poetry 1.1.13 poetry-core 1.0.8 prometheus-client 0.13.1 prompt-toolkit 3.0.28 psutil 5.9.0 ptyprocess 0.7.0 pure-eval 0.0.0 py 1.10.0 pycparser 2.21 Pygments 2.11.2 PyJWT 2.3.0 pyOpenSSL 22.0.0 pyparsing 3.0.7 pyrsistent 0.18.1 pytest 0.0.0 python-dateutil 0.0.0 python-json-logger 2.0.2 pytz 2022.1 pyzmq 22.3.0 requests 2.27.1 ruamel.yaml 0.17.21 ruamel.yaml.clib 0.2.6 semantic-version 2.9.0 Send2Trash 1.8.0 setuptools 60.10.0 setuptools-rust 1.1.2 six 1.16.0 smmap 3.0.1 sniffio 1.2.0 soupsieve 2.3.1 SQLAlchemy 1.4.32 stack-data 0.0.0 terminado 0.13.3 testpath 0.6.0 toml 0.10.2 tomlkit 0.10.0 tornado 6.1 traitlets 5.1.1 typing-extensions 3.10.0.2 urllib3 1.26.9 wcwidth 0.2.5 webencodings 0.5.1 websocket-client 1.3.1 wheel 0.37.1 wrapspawner 1.0.1
Logs
Hub logs (148.191.97.103 resolves to computenode).
Job logs: