Open pregazzoni opened 8 years ago
This looks good Pierre. Would these steps fo into a new task called something like Setup that the user would just run once with gradle?
Will it also work on basic clusters?
@snowch need to look into this more closely as I believe you would need to become oozie user to do this (so need root). Same is true for basic.
I am also inquiring if this could become default though so it is there by default in the shared lib to start with.
Ah, cool. Thanks @pregazzoni
In order for oozie spark job to run in Yarn we need the spark-assembly.jar to be in job path. Right now we get the jar for the cluster (webhdfs) and then put (webhdfs) it into the $jobDir/lib directory. This takes over few minutes.
Another way would be too have the lib in the oozie shared lib directory by default.
As oozie, you can do:
Once this is done, there is no need to put the jar under $jobDir/lib as it will be automatically picked from the oozie shared lib.