This course is for novice programmers or business people who would like to understand the core tools used to wrangle and analyze big data. With no prior experience, you will have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. You will be comfortable explaining the specific components and basic processes of the Hadoop architecture, software stack, and execution environment. In the assignments you will be guided in how data scientists apply the important concepts and techniques such as Map-Reduce that are used to solve fundamental problems in big data. You'll feel empowered to have conversations about big data and the data analysis process.
UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory.
- 5 stars45.41%
- 4 stars28.05%
- 3 stars12.38%
- 2 stars6.70%
- 1 star7.43%
Super hands on introduction to key Hadoop components, such as Spark, Map Reduce, Hive, Pig, HBase, HDFS, YARN, Squoop and Flume.
I can't wait to the next course on the specialization.
Very detailed , thorough introduction to a lot of the Hadoop ecosystem. Nice explanation and assignment to get a feel for Spark. At times a bit dry but altogether a well structured and taught course.
A very nice course covering the basics of the Hadoop ecosystem and Apache spark. The lectures are high quality and the presenters do a very good work of explaining the concepts. Thanks
This course gives a nice introduction to Hadoop basics. Unfortunatly, i faced many issues to work with cloudera VM and some commands in tutorials are obsolete. Thank you very much for your efforts.