Add logging, exception handling, and AWS client class

One question I had related to error handling while I was reading this code: A few of the client methods are wrapped in try/except blocks that prevent them from raising an exception, but what happens if a code block that is not protected in this way causes the job to error out before it can upload its logs to CloudWatch? How will we get alerted that the job has failed? I wonder if it's worth wrapping main() in a big try/except block that logs the exception, attempts to ship logs to CloudWatch, and alerts us before raising the exception.

So the way I'd set this up it wasn't possible to use the Spark logger in the except, since it wouldn't exist if the main loop failed.

In the process of refactoring I discovered you totally can just use both the Spark AND Python loggers at the same time. So, I switched over to generic Python logging in lieu of passing around the Spark logger. This gets us:

Better namespacing in the log output
No need to pass around the Spark session logger
Writing Python logging to both a file and stdout
An easier to extend logging setup if we add more modules

I think it's a much better design overall, but curious to see what you think. Lots of changes here, so re-requesting review @jeancochrane!

ccao-data / service-spark-iasworld

Add logging, exception handling, and AWS client class #6