copyright by edX PageLinuxFoundationX: LFS103x Introduction to Apache Hadoop
Next Steps in Mastering the Hadoop Ecosystem
Even though this course has a lot of material packed into it, we have barely scratched the surface of most Hadoop ecosystem components.
We hope that it was enough to get you convinced that big data in general and Hadoop in particular are not scary and, in fact, can be a lot of fun.
There is a lot of value in advancing your understanding of the Hadoop ecosystem and becoming a data professional. Here are a few pointers to get you going:
Documentation published by Hadoop and its ecosystem projects
Since almost all Hadoop ecosystem projects (including Hadoop itself) are developed as Apache Software Foundation projects, you should make yourself comfortable with their websites and the documentation that is published there. Here's a few for the project we've seen in this course:
Apache Hadoop Documentation
Apache Spark Documentation
Apache Hive Documentation
Apache Ambari Documentation
Apache Bigtop Documentation
Hadoop: The Definitive Guide, 4th Edition by Tom White
If you plan to get only one book on Hadoop and its ecosystem, then get this one. It provides a very comprehensive coverage of all the Hadoop components we have talked about, and then some!
Hadoop vendors' training and certification programs
All Hadoop vendors provide training and certification programs. If you are already working with Hadoop distribution that came from a vendor, make sure to check their website.
Finally, while continuing on your Hadoop journey, keep in mind that the beauty of open source doesn't simply mean you get to play with all this software for free, but, most importantly, it means you get to be part of a very vibrant community. Whenever you feel stuck, have a question, or would like to share your experience, join the project communities and get that conversation going. And, who knows, one of those conversations may eventually turn out into your first direct contribution to the project itself (be it code, documentation, or a wiki page). Joining is easy - all it takes is a mailing list subscription:
Join Hadoop community
Join Spark community
Join Hive community
Join Ambari community
Join Bigtop community
So, what are you waiting for? It takes a village to raise the Hadoop elephant and the only person still missing in that global village we call Apache Software Foundation is you!
Next Steps in Mastering the Hadoop Ecosystem
Even though this course has a lot of material packed into it, we have barely scratched the surface of most Hadoop ecosystem components. We hope that it was enough to get you convinced that big data in general and Hadoop in particular are not scary and, in fact, can be a lot of fun. There is a lot of value in advancing your understanding of the Hadoop ecosystem and becoming a data professional. Here are a few pointers to get you going:
Since almost all Hadoop ecosystem projects (including Hadoop itself) are developed as Apache Software Foundation projects, you should make yourself comfortable with their websites and the documentation that is published there. Here's a few for the project we've seen in this course:
If you plan to get only one book on Hadoop and its ecosystem, then get this one. It provides a very comprehensive coverage of all the Hadoop components we have talked about, and then some!
All Hadoop vendors provide training and certification programs. If you are already working with Hadoop distribution that came from a vendor, make sure to check their website.
Finally, while continuing on your Hadoop journey, keep in mind that the beauty of open source doesn't simply mean you get to play with all this software for free, but, most importantly, it means you get to be part of a very vibrant community. Whenever you feel stuck, have a question, or would like to share your experience, join the project communities and get that conversation going. And, who knows, one of those conversations may eventually turn out into your first direct contribution to the project itself (be it code, documentation, or a wiki page). Joining is easy - all it takes is a mailing list subscription:
Join Hadoop community Join Spark community Join Hive community Join Ambari community Join Bigtop community
So, what are you waiting for? It takes a village to raise the Hadoop elephant and the only person still missing in that global village we call Apache Software Foundation is you!