DATAVIEW is a big data workflow management system. It uses Dropbox as the data cloud and Amazon EC2 as the compute cloud. Current research focuses on the security and privacy aspects of DATAVIEW as well as performance and cost optimization for running workflows in clouds.
11
stars
5
forks
source link
Record a tutorial for the word-couting workflow example #7
We will present this tutorial with the following steps:
Show the visualization of the workflow
Based on the workflow visualization, explain the overall structure of the workflow
Explain the code for each task (three tasks)
Explain the code for the workflow
Prepare for running the workflow: parameter setting, moving class files to the right directory
Run the workflow and explain the input and output of the execution