Introduction to Text Classification in R with quanteda

提供方
在此指导项目中,您将:

Import text documents, reshape texts from documents to paragraphs, and turn your texts into a machine readable format.

Classify presidential concession speeches by political party using a Naive Bayes algorithm and assess the accuracy of the predictions.   

2 hours
初级
无需下载
分屏视频
英语(English)
仅限桌面

In this guided project you will learn how to import textual data stored in raw text files into R, turn these files into a corpus (a collection of textual documents), reshape them into paragraphs from documents and tokenize the text all using the R software package quanteda. You will then learn how to classify the texts using the Naive Bayes algorithm. This guided project is for beginners interested in quantitative text analysis in R. It assumes no knowledge of textual analysis and focuses on exploring textual data (US Presidential Concession Speeches). Users should have a basic understanding of the statistical programming language R.

您要培养的技能

  • Ordered Pair

  • Text Analysis

  • Algorithms

  • Statistical Programming Languages

  • Computer Programming

分步进行学习

在与您的工作区一起在分屏中播放的视频中,您的授课教师将指导您完成每个步骤:

  1. Load text documents into R studio, convert a number of text documents into a corpus, and extract data from text document file names and add them to a new column in a dataframe. 

  2. Reshape the dataset into paragraphs from documents and check for balance in your labels. 

  3. Split up a text document corpus into tokens, or individual words and punctuations. Then clean the data by removing specific words and spellings.

  4. Create a Document Feature Matrix, divide it into train and test sets and run a Naive Bayes model. Then examine the model’s prediction accuracy and learn about accuracy, precision, and recall.   

  5. Run Naive Bayes models for a second and third time. Then examine the models' predictions and compare the model outputs with results from the previous task.

指导项目工作原理

您的工作空间就是浏览器中的云桌面,无需下载

在分屏视频中,您的授课教师会为您提供分步指导

常见问题

购买指导项目后,您将获得完成指导项目所需的一切,包括通过 Web 浏览器访问云桌面工作空间,工作空间中包含您需要了解的文件和软件,以及特定领域的专家提供的分步视频说明。

由于您的工作空间包含适合笔记本电脑或台式计算机使用的云桌面,因此指导项目不在移动设备上提供。

指导项目授课教师是特定领域的专家,他们在项目的技能、工具或领域方面经验丰富,并且热衷于分享自己的知识以影响全球数百万的学生。

您可以从指导项目中下载并保留您创建的任何文件。为此,您可以在访问云桌面时使用‘文件浏览器’功能。

指导项目不符合退款条件。请查看我们完整的退款政策

指导项目不提供助学金。

指导项目不支持旁听。

您可在页面顶部点按此指导项目的经验级别,查看任何知识先决条件。对于指导项目的每个级别,您的授课教师会逐步为您提供指导。

是,您可以在浏览器的云桌面中获得完成指导项目所需的一切。

您可以直接在浏览器中于分屏环境下完成任务,以此从做中学。在屏幕的左侧,您将在工作空间中完成任务。在屏幕的右侧,您将看到有授课教师逐步指导您完成项目。