GoogleCloudPlatform / DataflowJavaSDK

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
http://cloud.google.com/dataflow
855 stars 324 forks source link

Upgrade to Apache Beam, version 0.6.0 #558

Closed jasonkuster closed 7 years ago

jasonkuster commented 7 years ago

Lots of manual changes -- hopefully some which will make this process easier in the future.

Verified via mvn clean install, then generating, and then running WordCount in both examples and examples-java8 and StarterPipeline from starter.

R: @davorbonaci @aaltay

jasonkuster commented 7 years ago

Note: As-is, will not pass -- need to update container image to 2.0.0-beta3 (omitting -20170202); 20170202 container is missing changes from Beam 0.6.0. This will be done as part of the upcoming release process.