SWIMProjectUCB / SWIM

Statistical Workload Injector for MapReduce - Project at UC Berkeley AMP Lab
https://github.com/SWIMProjectUCB/SWIM/wiki
129 stars 94 forks source link

How can I find this data "FB-2009 and FB-2010"? #12

Open ayat-khairy opened 5 years ago

ayat-khairy commented 5 years ago

FB-2009 comes from historical Hadoop traces on a 600-machine cluster at Facebook. The original trace spans 6 months from May 2009 to October 2009, and contains roughly 1 million jobs.

FB-2010 comes from historical Hadoop traces on the same cluster at Facebook, now grown to 3000 machines. The original trace spans 1.5 months from October 2010 to November 2010, and also contains roughly 1 million jobs.

I am wondering how can I access this data? ... Thanks!