Open jldiaz opened 8 years ago
Thanks for reporting this. The current configuration requires at least 3GB of memory per machine (see https://github.com/GoogleCloudPlatform/PerfKitBenchmarker/blob/master/perfkitbenchmarker/data/hadoop/mapred-site.xml#L27). I think this was likely copied from the setup instructions here: https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/ClusterSetup.html
It shouldn't be hard to make this run (or at the very least warn the user if they attempt to run it on a machine with less memory). Until this is fixed, running on any machine type with more than 3GB of RAM (e.g. the Standard_D1 type) should work.
I ran
When the script finally reached the point in which it invokes
yarn
to run TeraGen, it got stuck for hours wating it to end.I logged via ssh in the master machine and used
yarn
to discover that, the TeraGen application was ACCEPTED but still not RUNNING after 11hours.No idea about what the issue could be. Insufficient memory? Which instance type is required as minimum to run this benchmark? By default
pkb
uses Small, which translates into A1-Standard (1 Core, 1.75Gb RAM)