Closed deathly809 closed 5 years ago
Some users might need to use replication to ensure no data loss. Also, in theory MR should run faster as you can schedule jobs over multiple nodes since data is replicated across multiple nodes.
Some users might need to use replication to ensure no data loss. Also, in theory MR should run faster as you can schedule jobs over multiple nodes since data is replicated across multiple nodes.