A potential research collaborator is evaluating data platforms for running analysis pipelines on their upcoming very large dataset. They're interested in estimating the cost of running an existing pipeline using the Hail Query framework.
I think that getting one number here is likely very difficult. I do also think that this is a completely reasonable question for them to ask and that it would greatly benefit us to have some kind of documentation on cost estimation. Other collaborators might also have ideas.
A potential research collaborator is evaluating data platforms for running analysis pipelines on their upcoming very large dataset. They're interested in estimating the cost of running an existing pipeline using the Hail Query framework.
I think that getting one number here is likely very difficult. I do also think that this is a completely reasonable question for them to ask and that it would greatly benefit us to have some kind of documentation on cost estimation. Other collaborators might also have ideas.