Closed aashish24 closed 8 years ago
Run analysis on 12 TB on one variable ~ 4 TB Run analysis on 6 TB on one variable ~ 2 TB Run analysis on 1 TB on one variable - ¼ TB
By running the benchmark , we should be able to see how the performance downgrades with more data but same number of nodes.
Decide what values we are interested in looking at (IO/Network/Map/Collect)
Store the log file in google-drive after converting them into ASCII format
Extract information from log files Store the output of analysis in CSV
@kotfic can we close this one?
Run analysis on 12 TB on one variable ~ 4 TB Run analysis on 6 TB on one variable ~ 2 TB Run analysis on 1 TB on one variable - ¼ TB
By running the benchmark , we should be able to see how the performance downgrades with more data but same number of nodes.
Decide what values we are interested in looking at (IO/Network/Map/Collect)
Store the log file in google-drive after converting them into ASCII format
Extract information from log files Store the output of analysis in CSV