Netflix / PigPen

Map-Reduce for Clojure
Apache License 2.0
565 stars 55 forks source link

EMR Docs #53

Open mbossenbroek opened 10 years ago

mbossenbroek commented 10 years ago

Add docs for how to run e2e on EMR w/ s3

technion commented 7 years ago

@mbossenbroek I'm interested in doing this - does PigPen introduce any complications beyond AWS' documented "submit a pig script" process?

If I can get enough hints to make it work I'd be happy to write a step by step for this issue.

mbossenbroek commented 7 years ago

I don't believe it should be any different. You should be able to build an uberjar, generate a script, and run it. Looking forward to see what you come up with!

technion commented 7 years ago

Hi @mbossenbroek,

Unfortunately I can't file a PR against the wiki, so I'm unsure how you'd like me to best supply these docs.

I've documented PigPen in an EMR environment for our use here, the guide is completely generic and you're free to use it:

https://github.com/technion/bigdatawanna/tree/master/pigpen

mbossenbroek commented 7 years ago

Thanks! I'll incorporate this into the wiki