sourav-mazumder / Data-Science-Extensions

71 stars 63 forks source link

Fail silently #1

Open joe4k opened 6 years ago

joe4k commented 6 years ago

Thanks @sourav-mazumder for providing this capability; it is a critical enabler in leveraging distributed nature of Spark when calling REST APIs such as the Watson services on IBM Cloud. One suggestion: Occasionally, there are timeout errors that come back from the REST APIs being called and currently, that kills the job completely. It would be good to add a flag to allow Spark to continue processing other records and for the ones that timeout, to return with a timeout message indicating such.