Open aronchick opened 2 months ago
I tried running the job and the job state was marked as Completed for me. This is an example where Reliable Orchestrator epic would help as today there can be a disconnect between the different components in the network that can result in this out of sync and orphan state. This is a current work in progress
→ bacalhau job describe j-fcc9711f-19fb-48a8-ad7e-0638a6f6041a
ID = j-fcc9711f-19fb-48a8-ad7e-0638a6f6041a
Name = j-fcc9711f-19fb-48a8-ad7e-0638a6f6041a
Namespace = default
Type = batch
State = Completed
Count = 1
Created Time = 2024-08-11 13:56:06
Modified Time = 2024-08-11 13:57:11
Version = 0
Summary
Completed = 1
Job History
TIME TOPIC EVENT
2024-08-11 13:56:06 Submission Job submitted
2024-08-11 13:56:08
2024-08-11 13:57:11
Executions
ID NODE ID STATE DESIRED REV. CREATED MODIFIED COMMENT
e-032daba7 n-e002001e Completed Stopped 6 1m48s ago 43s ago Accepted job
Execution e-032daba7 History
TIME TOPIC EVENT
2024-08-11 13:56:06
2024-08-11 13:56:06
2024-08-11 13:56:08 Requesting Node Accepted job
2024-08-11 13:56:08
2024-08-11 13:56:08
2024-08-11 13:57:11
Standard Output
stress-ng: info: [1] setting to a 1 min, 0 secs run per stressor
stress-ng: info: [1] dispatching hogs: 2 cpu
stress-ng: info: [1] skipped: 0
stress-ng: info: [1] passed: 2: cpu (2)
stress-ng: info: [1] failed: 0
stress-ng: info: [1] metrics untrustworthy: 0
stress-ng: info: [1] successful run completed in 1 min, 0.00 secs
I don't know how i got here.
Job spec:
Job is "running", but history says "completed" and the container is completed.