bacalhau-project / bacalhau

Compute over Data framework for public, transparent, and optionally verifiable computation
https://docs.bacalhau.org
Apache License 2.0
643 stars 85 forks source link

obtuse error message from `bacalhau docker run` when issuing job with invalid parameters #4032

Open frrist opened 1 month ago

frrist commented 1 month ago

Bug Description

Run this job: bacalhau docker run --concurrency=4 ubuntu:latest eho hello world (notice echo is misspelled, meaning its an invalid binary to execute) Observe this result:

Job successfully submitted. Job ID: 69d52def-06b1-4e81-a70d-043df8d292f6
Checking job status... (Enter Ctrl+C to exit at any time, your job will continue running):

    Communicating with the network  ................  done ✅  0.1s
       Creating job for submission  ................  done ✅  0.0s
                   Job in progress  ................  err  ❌  0.6s

Error submitting job: not enough nodes to run job. requested: 1, available: 4, suitable: 0.
• 4 of 4 nodes: job already executed on this node more than once
Job Results By Node:
• Node QmVHCeiL, Qma5yQAk, Qma5yQAk, QmafZ9oC, QmRr9qPT: execution error: failed to start container: executable file not found: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: exec: "eho": executable file not found in $PATH: unknown. execution failed
• Node QmVHCeiL, QmRr9qPT, QmafZ9oC: 
    Accepted job. not enough nodes to run job. requested: 1, available: 4, suitable: 0.
    • 4 of 4 nodes: job already executed on this node more than once

Expected Behavior

I would expect:

Bacalhau Versions

Host Environment

Provide details about the environment where the bug occurred:

Job Specification

(If applicable, provide the job spec used when the issue occurred.)

Logs

Node Logs: https://gist.github.com/frrist/69a9c85891890e114f235aece40ed888