Yelp / mrjob

Run MapReduce jobs on Hadoop or Amazon Web Services
http://packages.python.org/mrjob/
Other
2.61k stars 587 forks source link

support MapR on EMR #1018

Open coyotemarin opened 9 years ago

coyotemarin commented 9 years ago

EMR offers MapR on some AMIs; we should give mrjob users a way to access it.

The EMR docs are a bit circumspect about what you need to pass to the API to get MapR (they talk about the web interface and the command line tool), but it has something to do with the NewSupportedProducts parameter to RunJobFlow.

coyotemarin commented 8 years ago

Multi-step jobs should store their intermediate data in maprfs://..., since there is no HDFS. See #1325 for the analogous issue in the Hadoop runner.