Quantitative Text Analysis and Measures of Readability in R

提供方
Coursera Project Network
在此指导项目中,您将:

Estimate the readability of a text document or corpus of documents.

Plot the variation in readability levels in a text corpus over time.

Clock1 hour
Beginner初级
Cloud无需下载
Video分屏视频
Comment Dots英语(English)
Laptop仅限桌面

By the end of this project, you will be able to load textual data into R and turn it into a corpus object. You will also understand the concept of measures of readability in textual analysis. You will know how to estimate the level of readability of a text document or corpus of documents using a number of different readability metrics and how to plot the variation in readability levels in a text document corpus over time at the document and paragraph level. This project is aimed at beginners who have a basic familiarity with the statistical programming language R and the RStudio environment, or people with a small amount of experience who would like to learn how to measure the readability of textual data.

您要培养的技能

  • Text Analysis
  • Data Wrangling
  • Data Visualization (DataViz)
  • Text Corpus
  • Readability

分步进行学习

在与您的工作区一起在分屏中播放的视频中,您的授课教师将指导您完成每个步骤:

  1. Load textual data into R and turn it into a corpus object. You will also understand the concept of measures of readability in textual analysis.

  2. Estimate the level of readability of a text document or corpus of documents using a number of different readability metrics

  3. Prepare the textual data for plotting by extracting key information from text document filenames and combining these with readability data in a dataframe.

  4. Plot the variation in readability levels in a text document corpus over time.

  5. Reshape the data to paragraph level and plot the distribution of readability over time by paragraph.

指导项目工作原理

您的工作空间就是浏览器中的云桌面,无需下载

在分屏视频中,您的授课教师会为您提供分步指导

常见问题

常见问题

还有其他问题吗?请访问 学生帮助中心