shiyonglu / DATAVIEW

DATAVIEW is a big data workflow management system. It uses Dropbox as the data cloud and Amazon EC2 as the compute cloud. Current research focuses on the security and privacy aspects of DATAVIEW as well as performance and cost optimization for running workflows in clouds.
11 stars 5 forks source link

Record a tutorial for the word-couting workflow example #7

Closed shiyonglu closed 5 years ago

shiyonglu commented 5 years ago

We will present this tutorial with the following steps:

  1. Show the visualization of the workflow

  2. Based on the workflow visualization, explain the overall structure of the workflow

  3. Explain the code for each task (three tasks)

  4. Explain the code for the workflow

  5. Prepare for running the workflow: parameter setting, moving class files to the right directory

  6. Run the workflow and explain the input and output of the execution

shiyonglu commented 5 years ago

Done, Next time should we show which directory to move the class files?