GoogleCloudPlatform / DataflowJavaSDK

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
http://cloud.google.com/dataflow
855 stars 325 forks source link

Researching Build Breakage #615

Closed clanghout closed 6 years ago

clanghout commented 6 years ago

Dear developers of the GoogleCloudPlatform/DataflowJavaSDK project,

We are MSc students from the Delft University of Technology doing research on why builds break in open source projects. To do this, we are analyzing Travis build statistics and build history of DataflowJavaSDK.

By analyzing the build history of DataflowJavaSDK, we found the following:

When looking at DataflowJavaSDK’s build metrics, we could not find clear factors that are highly correlated with this result.

To identify the reasons behind build breakage, we would like to collect developer insights into the reasons why build breakage occurs. If you have five minutes, please answer the survey we have created. Your response will be very useful in our study.

Thank you.