Reading XML - Week 1 | Coursera

Reading XML

Video placeholder

Loading...

Getting and Cleaning Data

Johns Hopkins University

4.5 (8,048 ratings)

|

210K Students Enrolled

Course 3 of 5 in the Data Science: Foundations using R Specialization

Enroll for Free

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.

Skills You'll Learn

Data Manipulation, Regular Expression (REGEX), R Programming, Data Cleansing

Reviews

4.5 (8,048 ratings)

5 stars
67.48%
4 stars
23.65%
3 stars
5.86%
2 stars
1.62%
1 star
1.36%

BE

Oct 25, 2016

This course is really a challenging and compulsory for any one who wants to be a data scientist or working in any sort of data. It teaches you how to make very palatable data-set fro ma messy data.

NA

Jun 7, 2020

A very useful course. The audio quality of some lectures (especially those by the main instructor) was not good. This course completes the sister course of R programming and they work together.

From the lesson

Week 1

In this first week of the course, we look at finding data and reading different file types.

Obtaining Data Motivation5:38

Raw and Processed Data7:07

Components of Tidy Data9:25

Downloading Files7:09

Reading Local Files4:55

Reading Excel Files3:50

Reading XML12:39

Reading JSON5:03

The data.table Package11:18

Taught By

Jeff Leek, PhD
Chief Data Officer, Vice President, and J Orin Edson Foundation Chair of Biostatistics in Public Health Sciences
Roger D. Peng, PhD
Professor of Statistics and Data Sciences
Brian Caffo, PhD
Professor, Biostatistics

Try the Course for Free

Explore our Catalog

Join for free and get personalized recommendations, updates and offers.