课程信息
4.0
107 个评分
26 个审阅
100% 在线

100% 在线

立即开始,按照自己的计划学习。
可灵活调整截止日期

可灵活调整截止日期

根据您的日程表重置截止日期。
中级

中级

完成时间(小时)

完成时间大约为14 小时

建议:4 weeks of study, 3-6 hours/week...
可选语言

英语(English)

字幕:英语(English)

您将获得的技能

Python ProgrammingStatistical AnalysisSentiment AnalysisR Programming
100% 在线

100% 在线

立即开始,按照自己的计划学习。
可灵活调整截止日期

可灵活调整截止日期

根据您的日程表重置截止日期。
中级

中级

完成时间(小时)

完成时间大约为14 小时

建议:4 weeks of study, 3-6 hours/week...
可选语言

英语(English)

字幕:英语(English)

教学大纲 - 您将从这门课程中学到什么

1
完成时间(小时)
完成时间为 3 小时

Introduction to Data Analytics

In this first unit of the course, several concepts related to social media data and data analytics are introduced. We start by first discussing two kinds of data - structured and unstructured. Then look at how structured data, the primary focus of this course, is analyzed and what one could gain by doing such analysis. Finally, we briefly cover some of the visualizations for exploring and presenting data.Make sure to go through the material for this unit in the sequence it's provided. First, watch the four short videos, then take the practice test, followed by the two quizzes. Finally, read the documents about installation and configuration of Python and R. This is very important - before proceeding to the next units, make sure you have installed necessary tools, and also learned how to install new packages/libraries for them. The course expects students to have programming experience in Python and R....
Reading
4 个视频 (总计 33 分钟), 4 个阅读材料, 2 个测验
Video4 个视频
Video-2: Structured vs. Unstructured Data10分钟
Video-3: Analyzing Structured Data10分钟
Video-4: Visualization of Data8分钟
Reading4 个阅读材料
Anaconda Installation20分钟
Python installation, configuration, and usage30分钟
R installation30分钟
R/RStudio Setup Guide (on Windows)20分钟
Quiz2 个练习
Quiz-115分钟
Quiz-215分钟
2
完成时间(小时)
完成时间为 4 小时

Collecting and Extracting Social Media Data

In this unit we will see how to collect data from Twitter and YouTube. The unit will start with an introduction to Python programming. Then we will use a Python script, with a little editing, to extract data from Twitter. A similar exercise will then be done with YouTube. In both the cases, we will also see how to create developer accounts and what information to obtain to use the data collection APIs. Once again, make sure to go item-by-item in the order provided. Before beginning this unit, ensure that you have all the right tools (Python, R, Anaconda) ready and configured. The lessons depend on them and also your ability to install required packages....
Reading
4 个视频 (总计 47 分钟), 6 个阅读材料, 3 个测验
Video4 个视频
Video-2: Introduction to Python Programming16分钟
Video-3: Using Python to Extract Data from Twitter15分钟
Video-4: Using Python to Extract Data from YouTube11分钟
Reading6 个阅读材料
Errata: please read this first1分钟
Python Packages Installation5分钟
(Optional) Introduction to Python for Econometrics, Statistics and Data Analysis30分钟
Script: twitter_search.py0
Twitter libraries10分钟
Script: youtube_search.py0
Quiz2 个练习
Python Programming Exercise2分钟
YouTube data download using Python6分钟
3
完成时间(小时)
完成时间为 4 小时

Data Analysis, Visualization, and Exploration

In this unit, we will focus on analyzing and visualizing the data from various social media services. We will first use the data collected before from YouTube to do various statistics analyses such as correlation and regression. We will then introduce R - a platform for doing statistical analysis. Using R, then we will analyze a much larger dataset obtained from Yelp. Make sure you have covered the material in the previous units before proceeding with this. That means, having all the tools (Anaconda, Python, and R) as well as various packages installed. We will also need new packages this time, so make sure you know how to install them to your Python or R. If needed, please review some basic concepts in statistics - specifically, correlation and regression - before or during working on this unit....
Reading
4 个视频 (总计 87 分钟), 8 个阅读材料, 2 个测验
Video4 个视频
Video-2: Analyzing Social Media Data Using Python26分钟
Video-3: Introduction to R26分钟
Video-4: Social Media Data Analysis with R32分钟
Reading8 个阅读材料
Script: twitter_process.py0
Data: iqsize.csv0
R Installation Guide10分钟
Installing R Packages5分钟
Statistical Analysis with R10分钟
Read this first2分钟
Scripts for converting json to csv2分钟
Data Visualization with ggplot2 (R) - Cheat Sheet10分钟
Quiz1 个练习
Statistical Analysis with Twitter Data6分钟
4
完成时间(小时)
完成时间为 3 小时

Case Studies

In the final unit of this course, we will work on two case studies - both using Twitter and focusing on unstructured data (in this case, text). The first case study will involve doing sentiment analysis with Python. The second case study will take us through basic text mining application using R. We wrap up the unit with a conclusion of what we did in this course and where to go next for further learning and exploration....
Reading
4 个视频 (总计 47 分钟), 4 个阅读材料, 2 个测验
Video4 个视频
Video-2: Sentiment Analysis with Twitter Data21分钟
Video-3: Text Mining of Twitter Data15分钟
Video-4: Conclusion6分钟
Reading4 个阅读材料
Script: twitter_sentiments.py0
NLTK10分钟
Script: text_mining_twitter.r0
An Introduction to Network Analysis with R and statnet10分钟
Quiz1 个练习
Sentiment Analysis with Twitter6分钟

讲师

Avatar

Chirag Shah

Associate Professor
Information and Computer Science

关于 Rutgers the State University of New Jersey

常见问题

  • 注册以便获得证书后,您将有权访问所有视频、测验和编程作业(如果适用)。只有在您的班次开课之后,才可以提交和审阅同学互评作业。如果您选择在不购买的情况下浏览课程,可能无法访问某些作业。

  • 您购买证书后,将有权访问所有课程材料,包括评分作业。完成课程后,您的电子课程证书将添加到您的成就页中,您可以通过该页打印您的课程证书或将其添加到您的领英档案中。如果您只想阅读和查看课程内容,可以免费旁听课程。

还有其他问题吗?请访问 学生帮助中心