SlideRuleEarth / sliderule

Server and client framework for on-demand science data processing in the cloud
https://slideruleearth.io
Other
26 stars 11 forks source link

`FatalError` during `init`, clarify error + how to start cluster? #346

Open rhugonnet opened 9 months ago

rhugonnet commented 9 months ago

Hi @jpswinski, @tsutterley,

After setting up an account with "uw", and following the guidelines in https://slideruleearth.io/web/rtd/user_guide/Private-Clusters.html#getting-started-with-private-clusters, I got the following FatalError:

Executing:

sliderule.init("slideruleearth.io", organization="uw")

I get:

Connection error to endpoint https://uw.slideruleearth.io/source/version ...retrying request
Connection error to endpoint https://uw.slideruleearth.io/source/version ...retrying request
Connection error to endpoint https://uw.slideruleearth.io/source/version ...retrying request
Traceback (most recent call last):
  File "/home/atom/miniconda3/envs/srtm_pene/lib/python3.10/site-packages/IPython/core/interactiveshell.py", line 3508, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-3-5305c5113031>", line 11, in <module>
    sliderule.init("slideruleearth.io", organization="uw")
  File "/home/atom/miniconda3/envs/srtm_pene/lib/python3.10/site-packages/sliderule/sliderule.py", line 678, in init
    return check_version(plugins=plugins) # verify compatibility between client and server versions
  File "/home/atom/miniconda3/envs/srtm_pene/lib/python3.10/site-packages/sliderule/sliderule.py", line 1172, in check_version
    info = get_version()
  File "/home/atom/miniconda3/envs/srtm_pene/lib/python3.10/site-packages/sliderule/sliderule.py", line 1149, in get_version
    rsps = source("version", {})
  File "/home/atom/miniconda3/envs/srtm_pene/lib/python3.10/site-packages/sliderule/sliderule.py", line 785, in source
    raise FatalError("Unable to complete request due to errors")
sliderule.sliderule.FatalError: Unable to complete request due to errors

I took me a bit of time to figure out this might be from the fact that the cluster is not deployed:

Cluster State
    uw is NOT deployed

Following https://slideruleearth.io/web/rtd/user_guide/Private-Clusters.html#starting-and-scaling-a-private-cluster, I also didn't know how long exactly for the cluster to start after using sliderule.update_available_servers (or that I would have to use that call at the very beginning of the script).

Now, I still get version errors:

RuntimeError: Client (version (4, 0, 2)) is incompatible with the server (version (3, 7, 0))

Maybe we could clarify these three aspects in SlideRule:

jpswinski commented 9 months ago

@rhugonnet - Thank you for going through this process and giving us this feedback!

In the short term

In the long term

tsutterley commented 9 months ago

@jpswinski bumping the uw cluster to v4 works on our end.