bacalhau-project / bacalhau

Compute over Data framework for public, transparent, and optionally verifiable computation
https://docs.bacalhau.org
Apache License 2.0
679 stars 88 forks source link

Developing command for determining the cost of compute for given bacalau job. #2500

Closed dhruvmalik007 closed 9 months ago

dhruvmalik007 commented 1 year ago

Context

My team @ extra is building a marketplace for:

Current Workflow:

sequenceDiagram
    user->>ipfs-storage: 1. Stores the raw 3D scan file (in .ply format)
    ipfs-storage-->>bacalau: 2. gets the CID of the given image and then passes as the parameter for the reconstruction container 
    docker-reconstruction-->> bacalau: 3.runs the compute with the input file parameter, and 
    bacalau-->>ipfs-storage: after the compute job is complete, it then fetches output reconstructed file and stores to ipfs

Here we wanted to have need possiblity for the user that wants to understand how much cost (based on compute requirements) will need to be paid in order to support the hosting services of the infrastructure.

Issue

So in order for this architecture to be commercially viable, we needed feature in the cli that determines the compute cost for the given job (for eg of type bacalau docker estimate <<image:tag>> ) .

This I think will change the whole workflow for both the resource providers (i.e those hosting the cluster of compute/requester node ) as either they can define during the deployment , how much cost they want the users to charge for the given compute instance.

@aronchick happy to share further details / task list based on the feedback of the community.

thanks in advance

aronchick commented 1 year ago

This is so valuable! Thank you so much - we're adding a bunch of scheduling constructs as we speak - let's see if we can align. cc @wdbaruni

dhruvmalik007 commented 1 year ago

hi @aronchick any new updates ?