课程信息
3.9
2,691 个评分
660 个审阅
This course is for novice programmers or business people who would like to understand the core tools used to wrangle and analyze big data. With no prior experience, you will have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. You will be comfortable explaining the specific components and basic processes of the Hadoop architecture, software stack, and execution environment. In the assignments you will be guided in how data scientists apply the important concepts and techniques such as Map-Reduce that are used to solve fundamental problems in big data. You'll feel empowered to have conversations about big data and the data analysis process....
Globe

100% 在线课程

立即开始,按照自己的计划学习。
Calendar

可灵活调整截止日期

根据您的日程表重置截止日期。
Clock

Approx. 19 hours to complete

建议:5 weeks of study, 1-2 hours/week...
Comment Dots

English

字幕:English...

您将获得的技能

Python ProgrammingApache HadoopMapreduceApache Spark
Globe

100% 在线课程

立即开始,按照自己的计划学习。
Calendar

可灵活调整截止日期

根据您的日程表重置截止日期。
Clock

Approx. 19 hours to complete

建议:5 weeks of study, 1-2 hours/week...
Comment Dots

English

字幕:English...

教学大纲 - 您将从这门课程中学到什么

Week
1
Clock
完成时间为 2 小时

Hadoop Basics

Welcome to the first module of the Big Data Platform course. This first module will provide insight into Big Data Hype, its technologies opportunities and challenges. We will take a deeper look into the Hadoop stack and tool and technologies associated with Big Data solutions. ...
Reading
7 个视频(共 53 分钟), 4 个阅读材料, 1 个测验
Video7 个视频
The Apache Framework: Basic Modules3分钟
Hadoop Distributed File System (HDFS)5分钟
The Hadoop "Zoo"5分钟
Hadoop Ecosystem Major Components11分钟
Exploring the Cloudera VM: Hands-On Part 116分钟
Exploring the Cloudera VM: Hands-On Part 26分钟
Reading4 个阅读材料
Apache Hadoop Ecosystem10分钟
Lesson 1 Slides (PDF)10分钟
Hardware & Software Requirements10分钟
Lesson 2 Slides - Cloudera VM Tour10分钟
Quiz1 个练习
Basic Hadoop Stack20分钟
Week
2
Clock
完成时间为 3 小时

Introduction to the Hadoop Stack

In this module we will take a detailed look at the Hadoop stack ranging from the basic HDFS components, to application execution frameworks, and languages, services....
Reading
10 个视频(共 70 分钟), 6 个阅读材料, 3 个测验
Video10 个视频
The Hadoop Distributed File System (HDFS) and HDFS28分钟
MapReduce Framework and YARN8分钟
The Hadoop Execution Environment4分钟
YARN, Tez, and Spark11分钟
Hadoop Resource Scheduling6分钟
Hadoop-Based Applications3分钟
Introduction to Apache Pig7分钟
Introduction to Apache HIVE7分钟
Introduction to Apache HBASE7分钟
Reading6 个阅读材料
Hadoop Basics - Lesson 1 Slides10分钟
Lesson 2: Hadoop Execution Environment - Slides10分钟
Lesson 3: Hadoop-Based Applications Overview - All Slides10分钟
Command list for Applications Slides10分钟
Tips to handle service connection errors10分钟
References for Applications10分钟
Quiz3 个练习
Overview of Hadoop Stack10分钟
Hadoop Execution Environment14分钟
Hadoop Applications12分钟
Week
3
Clock
完成时间为 2 小时

Introduction to Hadoop Distributed File System (HDFS)

In this module we will take a detailed look at the Hadoop Distributed File System (HDFS). We will cover the main design goals of HDFS, understand the read/write process to HDFS, the main configuration parameters that can be tuned to control HDFS performance and robustness, and get an overview of the different ways you can access data on HDFS....
Reading
9 个视频(共 58 分钟), 5 个阅读材料, 3 个测验
Video9 个视频
The HDFS Performance Envelope5分钟
Read/Write Processes in HDFS4分钟
HDFS Tuning Parameters6分钟
HDFS Performance and Robustness9分钟
Overview of HDFS Access, APIs, and Applications5分钟
HDFS Commands8分钟
Native Java API for HDFS4分钟
REST API for HDFS8分钟
Reading5 个阅读材料
Lesson 1: Introduction to HDFS - Slides10分钟
HDFS references10分钟
Lesson 2: HDFS Performance and Tuning - Slides10分钟
HDFS Access, APIs10分钟
Lesson 3: HDFS Access, APIs, Applications - Slides10分钟
Quiz3 个练习
HDFS Architecture12分钟
HDFS performance,tuning, and robustness10分钟
Accessing HDFS12分钟
Week
4
Clock
完成时间为 7 小时

Introduction to Map/Reduce

This module will introduce Map/Reduce concepts and practice. You will learn about the big idea of Map/Reduce and you will learn how to design, implement, and execute tasks in the map/reduce framework. You will also learn the trade-offs in map/reduce and how that motivates other tools....
Reading
9 个视频(共 27 分钟), 3 个阅读材料, 3 个测验
Video9 个视频
The Map/Reduce Framework2分钟
A MapReduce Example: Wordcount in detail4分钟
MapReduce: Intro to Examples and Principles2分钟
MapReduce Example: Trending Wordcount1分钟
MapReduce Example: Joining Data4分钟
MapReduce Example: Vector Multiplication2分钟
Computational Costs of Vector Multiplication3分钟
MapReduce Summary2分钟
Reading3 个阅读材料
Lesson 1: Introduction to MapReduce - Slides10分钟
A note on debugging map/reduce programs.10分钟
Lesson 2: MapReduce Examples and Principles - Slides10分钟
Quiz1 个练习
Lesson 1 Review14分钟
3.9
Briefcase

83%

通过此课程获得实实在在的工作福利

热门审阅

创建者 GMFeb 1st 2016

I'm forced to give 5 stars. I don't want to have a certification on a poor quality course (another coursera mistake). This material needs tremendous amount of work to get finished and revised.

创建者 GCOct 25th 2015

Super hands on introduction to key Hadoop components, such as Spark, Map Reduce, Hive, Pig, HBase, HDFS, YARN, Squoop and Flume.\n\nI can't wait to the next course on the specialization.

讲师

Natasha Balac

Director, Predictive Analytics Center of Excellence (PACE)
San Diego Supercomputer Center

Paul Rodriguez

Research Programmer
San Diego Supercomputer Center (SDSC)

Andrea Zonca

HPC Applications Specialist
San Diego Supercomputer Center (SDSC)

关于 University of California San Diego

UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory....

常见问题

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

还有其他问题吗?请访问 学生帮助中心