课程信息
4.5
5,007 ratings
806 reviews
Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data....
Stacks

Course 3 of 10 in the

Globe

100% 在线课程

立即开始,按照自己的计划学习。
Calendar

可灵活调整截止日期

根据您的日程表重置截止日期。
Clock

建议:5 hours/week

完成时间大约为14 小时
Comment Dots

English

字幕:English, French, Chinese (Simplified), Vietnamese, Russian

您将学到的内容有

  • Check
    Apply data cleaning basics to make data "tidy"
  • Check
    Obtain usable data from the web, APIs, and databases
  • Check
    Understand common data storage systems
  • Check
    Use R for text and date manipulation

您将获得的技能

R ProgrammingData CleansingRegular ExpressionData Manipulation
Stacks

Course 3 of 10 in the

Globe

100% 在线课程

立即开始,按照自己的计划学习。
Calendar

可灵活调整截止日期

根据您的日程表重置截止日期。
Clock

建议:5 hours/week

完成时间大约为14 小时
Comment Dots

English

字幕:English, French, Chinese (Simplified), Vietnamese, Russian

教学大纲 - 您将从这门课程中学到什么

1

章节
Clock
完成时间为 2 小时

Week 1

In this first week of the course, we look at finding data and reading different file types....
Reading
9 个视频(共 67 分钟), 4 个阅读材料, 1 个测验
Video9 个视频
Raw and Processed Data7分钟
Components of Tidy Data9分钟
Downloading Files7分钟
Reading Local Files4分钟
Reading Excel Files3分钟
Reading XML12分钟
Reading JSON5分钟
The data.table Package11分钟
Reading4 个阅读材料
Welcome to Week 110分钟
Syllabus10分钟
Pre-Course Survey10分钟
Practical R Exercises in swirl Part 110分钟
Quiz1 个练习
Week 1 Quiz10分钟

2

章节
Clock
完成时间为 1 小时

Week 2

Welcome to Week 2 of Getting and Cleaning Data! The primary goal is to introduce you to the most common data storage systems and the appropriate tools to extract data from web or from databases like MySQL. ...
Reading
5 个视频(共 41 分钟), 1 个测验
Video5 个视频
Reading from HDF56分钟
Reading from The Web6分钟
Reading From APIs7分钟
Reading From Other Sources4分钟
Quiz1 个练习
Week 2 Quiz10分钟

3

章节
Clock
完成时间为 10 小时

Week 3

Welcome to Week 3 of Getting and Cleaning Data! This week the lectures will focus on organizing, merging and managing the data you have collected using the lectures from Weeks 1 and 2. ...
Reading
7 个视频(共 60 分钟), 1 个阅读材料, 4 个测验
Video7 个视频
Summarizing Data11分钟
Creating New Variables10分钟
Reshaping Data9分钟
Managing Data Frames with dplyr - Introduction3分钟
Managing Data Frames with dplyr - Basic Tools12分钟
Merging Data6分钟
Reading1 个阅读材料
Practical R Exercises in swirl Part 210分钟
Quiz1 个练习
Week 3 Quiz10分钟

4

章节
Clock
完成时间为 6 小时

Week 4

Welcome to Week 4 of Getting and Cleaning Data! This week we finish up with lectures on text and date manipulation in R. In this final week we will also focus on peer grading of Course Projects. ...
Reading
5 个视频(共 34 分钟), 2 个阅读材料, 3 个测验
Video5 个视频
Regular Expressions I5分钟
Regular Expressions II8分钟
Working with Dates6分钟
Data Resources3分钟
Reading2 个阅读材料
Practical R Exercises in swirl Part 410分钟
Post-Course Survey10分钟
Quiz1 个练习
Week 4 Quiz10分钟
4.5
Direction Signs

40%

完成这些课程后已开始新的职业生涯
Briefcase

83%

通过此课程获得实实在在的工作福利

热门审阅

创建者 DHFeb 2nd 2016

Easy, mostly instructive Course. The Assignments and quizzes are quite good, and illustrates the lessons very well.\n\nSee the videos for general presentation, but use the energy on the excersizes.

创建者 BEOct 26th 2016

This course is really a challenging and compulsory for any one who wants to be a data scientist or working in any sort of data. It teaches you how to make very palatable data-set fro ma messy data.

讲师

Jeff Leek, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health

Roger D. Peng, PhD

Associate Professor, Biostatistics
Bloomberg School of Public Health

Brian Caffo, PhD

Professor, Biostatistics
Bloomberg School of Public Health

关于 Johns Hopkins University

The mission of The Johns Hopkins University is to educate its students and cultivate their capacity for life-long learning, to foster independent and original research, and to bring the benefits of discovery to the world....

关于 Data Science 专项课程

Ask the right questions, manipulate data sets, and create visualizations to communicate results. This Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results. In the final Capstone Project, you’ll apply the skills learned by building a data product using real-world data. At completion, students will have a portfolio demonstrating their mastery of the material....
Data Science

常见问题

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

还有其他问题吗?请访问 学生帮助中心