Closed khl02007 closed 3 years ago
This is surprising to me. I added some checks and made a potential fix. Please upgrade to
kachery-daemon >=1.0.20
and kachery-client >=1.0.11
Here is another type of crash
Traceback (most recent call last):
File "/home/kacheryuser/miniconda3/envs/kachery-env/bin/sortingview-start-back
end", line 6, in <module>
sortingview.start_backend_cli()
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/click/core.py", line 1137, in __call__
return self.main(*args, **kwargs)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/click/core.py", line 1062, in main
rv = self.invoke(ctx)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/click/core.py", line 1404, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/click/core.py", line 763, in invoke
return __callback(*args, **kwargs)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/sortingview/backend/start_backend_cli.py", line 8, in start_backend_cli
start_backend(channel=channel)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/sortingview/backend/start_backend.py", line 14, in start_backend
kc.run_task_backend(
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-packages/kachery_client/task_backend/run_task_backend.py", line 32, in run_task_backend
B.process_events()
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-packages/kachery_client/task_backend/TaskBackend.py", line 47, in process_events
self._task_job_manager.process_events()
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/kachery_client/task_backend/TaskJobManager.py", line 52, in process_events
requested_task.update_status(status=job.status, error_message=error_message, result=result)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/kachery_client/task_backend/RequestedTask.py", line 35, in update_status
_update_task_status(channel=self.registered_task_function.channel, task_id=s
elf.task_id, task_function_id=self._registered_task_function.task_function_id, t
ask_hash=self.task_hash, task_function_type=self.task_function_type, status=stat
us, result=result, error_message=error_message)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/kachery_client/task_backend/_update_task_status.py", line 15, in _update_task_
status
result_content = simplejson.dumps(result, separators=(',', ':'), indent=None
, allow_nan=False).encode()
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/simplejson/__init__.py", line 398, in dumps
return cls(
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/simplejson/encoder.py", line 296, in encode
chunks = self.iterencode(o, _one_shot=True)
File "/home/kacheryuser/miniconda3/envs/kachery-env/lib/python3.8/site-package
s/simplejson/encoder.py", line 378, in iterencode
return _iterencode(o, 0)
ValueError: Out of range float values are not JSON compliant
Cleaning up parallel job handler
Cleaning up parallel job handler
Cleaning up parallel job handler
Cleaning up parallel job handler
Cleaning up parallel job handler
Cleaning up parallel job handler
Cleaning up parallel job handler
@khl02007 I think this is another example of NaN creeping into the results of some backend tasks. But obviously we don't want that to crash the backend. So I updated the backend so it won't crash on these instances.
You'll need to pip upgrade sortingview to 0.2.25 (and that should automatically bump kachery-client to >= 1.0.12). Then restart the backend, and report any further crashes.
Closing because I think this has been resolved.
kachery-daemon is running Tried upgrading kachery, kachery-daemon, and hither