What is prediction? - Week 1: Prediction, Errors, and Cross Validation | Coursera

What is prediction?

Video placeholder

Loading...

Practical Machine Learning

Johns Hopkins University

4.5 (3,243 ratings)

|

150K Students Enrolled

Course 3 of 5 in the Data Science: Statistics and Machine Learning Specialization

Enroll for Free

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including regression, classification trees, Naive Bayes, and random forests. The course will cover the complete process of building prediction functions including data collection, feature creation, algorithms, and evaluation.

Skills You'll Learn

Random Forest, Machine Learning (ML) Algorithms, Machine Learning, R Programming

Reviews

4.5 (3,243 ratings)

5 stars
66.41%
4 stars
22.35%
3 stars
6.90%
2 stars
2.52%
1 star
1.78%

DH

Jun 17, 2018

Excellent introduction to basic ML techniques. A lot of material covered in a short period of time! I will definitely seek more advanced training out of the inspiration provided by this class.

AM

Aug 30, 2017

Highly recommend this course. It makes you read a lot, do lot's of practical exercises. The final project is a must do. After finishing this course you can start playing with kaggle data sets.

From the lesson

Week 1: Prediction, Errors, and Cross Validation

This week will cover prediction, relative importance of steps, errors, and cross validation.

Prediction motivation8:26

What is prediction?8:39

Relative importance of steps9:45

In and out of sample errors6:57

Prediction study design9:05

Types of errors10:35

Receiver Operating Characteristic5:03

Cross validation8:20

What data should you use?6:00

Taught By

Jeff Leek, PhD
Chief Data Officer, Vice President, and J Orin Edson Foundation Chair of Biostatistics in Public Health Sciences
Roger D. Peng, PhD
Professor of Statistics and Data Sciences
Brian Caffo, PhD
Professor, Biostatistics

Try the Course for Free

Explore our Catalog

Join for free and get personalized recommendations, updates and offers.