Closed nithinjoshy closed 6 months ago
Hi @nithinjoshy! We'll need to update the google-batch provider in a future release. The code for specifying a custom machine is here, in the google_v2_base provider, which unfortunately isn't currently shared with the google-batch provider. The code for Batch provider's instance policy will also need to be updated to take in this machine type, based on this documentation.
@wnojopra Just curious if there are any updates on allowing custom machine types with Google Batch?
Hello, I am contributing to this and i have an ongoing PR : https://github.com/DataBiosphere/dsub/pull/285
Thank you @mccstan , appreciate this so much! I was actually starting to look into building my own wrapper around gcloud batch
to get this done, but it's awesome to see this being fixed within dsub :)
@wnojopra My PR is ready for Review.
Hi @nithinjoshy and @lm-jkominek :
We just put out release 0.4.11, which includes:
When you get the chance, can you verify if it resolves your issues?
@wnojopra, thank you for this, much apprish! And sure thing, I will take a look later this week to see how it performs in the wild
@wnojopra, I run 0.4.11 with google-batch
and it did provision the resources that I asked, so I can confirm that it works, at least for me, yay!
One issue I noticed though was that for some reason all the VMs spun up in the us-central1 region instead of us-east1, which I specified via --regions
so something may be amiss in that corner? I submitted the jobs with a command that I normally use with google-cls-v2
, unless batch uses a different param for that?
Hi @lm-jkominek great to hear that everything works except the regions. I've filed #289 to track the region issue. It seems like a quick fix for the next release.
I am attempting to switch to use google batch as the provider for dsub but I am encountering a problem where the machine type for the worker is always "e2-highcpu-2" instead of what I have provided as arguments. I have been using the below call for the last year or so without the "--provider google-batch" line and it has worked which makes me think google-batch is the problem.
I have tried switching to use the following two lines instead of "--machine-type" but still get an "e2-highcpu-2" machine which is not sufficient for my needs.
My jobs are failing with the following error although I am asking about this because I believe that the reason for that is that the memory is insufficient for the program I am running.
I am curious if there is any idea about why it is always e2-highcpu-2 and if there is some way I can change this to get a different machine.
I apologize in advance if I missed that information about this was already written somewhere or this is the wrong place to ask about this.