Closed mengxr closed 10 years ago
I got broken pipe error when copying the data to HDFS because of poor connection. It should be better if we copy small data sets before the biggest wikipedia counts.
Sure, sounds good. In the past I launch all of the clusters from a driver node on ec2... but good idea to have this!
I got broken pipe error when copying the data to HDFS because of poor connection. It should be better if we copy small data sets before the biggest wikipedia counts.