gluent / goe

GOE: a simple and flexible way to copy data from an Oracle Database to Google BigQuery.
Apache License 2.0
8 stars 2 forks source link

Spark standalone make target should be updated #132

Open nj1973 opened 6 months ago

nj1973 commented 6 months ago

The version of Spark, configuration and associated JAR files needs to be refreshed.

Spark version We should look for a more recent version in line with supported Dataproc versions.

JAR files We should find a more recent combination of JAR files but also cater for not including the JAR files at all, it is better to supply package addresses via offload.env and not host JAR files. Perhaps we need to support both.

Configuration We can assume a GCP target in the short term so could include those configuration items in the default fiules. Or perhaps cater for it by having multiple make targets.