Azure / doAzureParallel

A R package that allows users to submit parallel workloads in Azure
MIT License
107 stars 51 forks source link

"Error in e$fun(obj, substitute(ex) ...." on azure batch example script #353

Open mohammadullah opened 5 years ago

mohammadullah commented 5 years ago

Before submitting a bug please check the following:

Description Hello, This is my first time writing an issue at github as a first time user of Azure batch cluster. As a beginner, I started with trying the given example in this tutorial.

https://docs.microsoft.com/en-us/azure/batch/tutorial-r-doazureparallel

I have just copy pasted, exactly as it is. It seems everything running fine. All the 5 nodes are up and running. The Progress: section shows 100%. But, When it comes to Merging then I see following error.

| Progress: 100.00% (10/10) | Running: 0 | Queued: 0 | Completed: 10 | Failed: 0 | Tasks have completed. Merging results...An error has occurred in the merge task of the job 'job20190510161411'. Error handling is set to 'stop' and has proceeded to terminate the job. The user will have to handle deleting the job. If this is not the correct behavior, change the errorhandling property to 'pass' or 'remove' in the foreach object. Use the 'getJobFile' function to obtain the logs. For more information about getting job logs, follow this link: https://github.com/Azure/doAzureParallel/blob/master/docs/90-troubleshooting.md#viewing-files-directly-from-compute-nodeError in e$fun(obj, substitute(ex), parent.frame(), e$data) : object 'results' not found

After going through some other issues in this page, I stopped merging by using enableCloudCombine = FALSE option. Then I get following error.

Error in e$fun(obj, substitute(ex), parent.frame(), e$data) : object 'results' not found

Any help is highly appreciated.

Thanks Moeen

mohammadullah commented 5 years ago

Update:

It seems using following return the result, Although error is there. This might be a data/result return issue

setAutoDeleteJob(FALSE) restult2 <- getJobResult("jobxxxxxxxxxxxx")

Moeen

muneer9211 commented 5 years ago

I had the same issue and contacted Microsoft. They have suggested a quick fix, which worked for me.

devtools::install_github("Azure/rAzureBatch", ref="fix/v3.7-support") devtools::install_github("Azure/doAzureParallel", ref="fix/v3.7-support")

Install these packages from their working branch. Hopefully it works for you

quison1 commented 5 years ago

I am having the same issue - with the sample code as well as a program that worked a few months ago.

Installing the above packages doesn't work for me, neither did the getJobResult(). I get the following error code:

Error in getJobResult("job20190613085024") : Error getting job result: Maxmium number of retries (3) reached <?xml version="1.0" encoding="UTF-8"?>

BlobNotFound The specified blob does not exist. RequestId:01abc386-801e-001b-2ac5-21b75d000000 Time:2019-06-13T08:52:27.3624120Z

Appreciate any help.

mohammadullah commented 5 years ago

Not sure about your particular "blob" issue but to use the getjobresult, first set the auto delete to FALSE "setAutoDeleteJob(Flase)". You can also set wait = False in the foreach option to get the job id automatically. Check this documentation

https://github.com/Azure/doAzureParallel/blob/master/docs/51-long-running-job.md