[ ] generate/find small workload for both Hadoop and Spark (respectively) that have the same CPU characteristic on single node as running in cluster mode
[ ] find the region of interest
Minor
[ ] Programmatic API for Perf instead of using script to process command line output
Project goals
Major
Minor