Statistical Data Visualization with Seaborn From UST

4.6
152 个评分
提供方
Coursera Project Network
6,932 人已注册
在此免费指导项目中,您将:

Produce and customize various chart types with Seaborn

Apply feature selection and feature extraction methods with scikit-learn

Build a boosted decision tree classifier with XGBoost

在面试中展现此实践经验

Clock1.5 hours
Intermediate中级
Cloud无需下载
Video分屏视频
Comment Dots英语(English)
Laptop仅限桌面

Welcome to this Guided Project on Statistical Data Visualization with Seaborn, From UST. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by their purpose, they partner with clients from design to operation. With this Guided Project from UST, you can quickly build in-demand job skills and expand your career opportunities in the Data Science field. Producing visualizations is an important first step in exploring and analyzing real-world data sets. As such, visualization is an indispensable method in any data scientist's toolbox as well as a powerful tool to identify problems in analyses and for illustrating results. In this project, we will employ the statistical data visualization library, Seaborn, to discover and explore the relationships in the Breast Cancer Wisconsin (Diagnostic) data set. Using the exploratory data analysis (EDA) results from the Breast Cancer Diagnosis – Exploratory Data Analysis Guided Project, you will practice dropping correlated features, implement feature selection and utilize several feature extraction methods including; feature selection with correlation, univariate feature selection, recursive feature elimination, principal component analysis (PCA) and tree based feature selection methods. Lastly, we will build a boosted decision tree classifier with XGBoost to classify tumors as either malignant or benign. By the end of this Guided Project, you should feel more confident about working with data, creating visualizations for data analysis, and have practiced several methods which apply to a Data Scientist’s role. Let's get started!

必备条件

Some experience in the basic programming commands of Python and a general understanding of machine learning.

您要培养的技能

  • Data Science
  • Machine Learning
  • Python Programming
  • Seaborn
  • Data Visualization (DataViz)

分步进行学习

在与您的工作区一起在分屏中播放的视频中,您的授课教师将指导您完成每个步骤:

  1. Project Overview

  2. Importing Libraries and Data

  3. Dropping Correlated Columns from Feature List

  4. Classification using XGBoost (minimal feature selection)

  5. Univariate Feature Selection

  6. Recursive Feature Elimination with Cross-Validation

  7. Plot CV Scores vs Number of Features Selected

  8. Feature Extraction using Principal Component Analysis

指导项目工作原理

您的工作空间就是浏览器中的云桌面,无需下载

在分屏视频中,您的授课教师会为您提供分步指导

审阅

来自STATISTICAL DATA VISUALIZATION WITH SEABORN FROM UST的热门评论

查看所有评论

常见问题

常见问题

还有其他问题吗?请访问 学生帮助中心