Creating a Wordcloud using NLP and TF-IDF in Python

提供方
在此指导项目中,您将:

Learn how to clean a dataset by removing encodings and unwanted words/characters

Learn how to lemmatize a text and fit a TF-IDF model

Learn how to create a wordcloud using TF-IDF scores

1.5 hours
初级
无需下载
分屏视频
英语(English)
仅限桌面

By the end of this project, you will learn how to create a professional looking wordcloud from a text dataset in Python. You will use an open source dataset containing Christmas recipes and will create a wordcloud of the most important ingredients used in these recipes. I will teach you how load a JSON dataset, clean the dataset by removing encodings and unwanted characters, and lemmatize your dataset. I will also teach you how to calculate TF-IDF weights of words in your dataset and use these weights to create a wordcloud. You will create a ready-to-use Jupyter notebook for creating a wordcloud on any text dataset. Lemmatization is a process of removing inflectional endings only and to return the base or dictionary form of a word, which is known as the lemma. TF-IDF stands for term frequency-inverse document frequency. TF-IDF gives a weight to each word which tells how important that term is. Using both lemmatization and TF-IDF, one can find the important words in the text dataset and use these important words to create the wordcloud. For example, these datasets could be customer complaints and the business can focus on the important issues that the customers are facing. Wordcloud is a powerful resource which can be used in reports and presentations. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

您要培养的技能

  • Natural Language Toolkit (NLTK)

  • Python Programming

  • Term Frequency Inverse Document Frequency (TF-IDF)

  • Wordnet

分步进行学习

在与您的工作区一起在分屏中播放的视频中,您的授课教师将指导您完成每个步骤:

  1. Load a JSON dataset in Python

  2. Clean the dataset

  3. Remove encodings

  4. Lemmatize the text

  5. Fit TF-IDF model

  6. Create a Wordcloud

指导项目工作原理

您的工作空间就是浏览器中的云桌面,无需下载

在分屏视频中,您的授课教师会为您提供分步指导

常见问题

购买指导项目后,您将获得完成指导项目所需的一切,包括通过 Web 浏览器访问云桌面工作空间,工作空间中包含您需要了解的文件和软件,以及特定领域的专家提供的分步视频说明。

由于您的工作空间包含适合笔记本电脑或台式计算机使用的云桌面,因此指导项目不在移动设备上提供。

指导项目授课教师是特定领域的专家,他们在项目的技能、工具或领域方面经验丰富,并且热衷于分享自己的知识以影响全球数百万的学生。

您可以从指导项目中下载并保留您创建的任何文件。为此,您可以在访问云桌面时使用‘文件浏览器’功能。

指导项目不符合退款条件。请查看我们完整的退款政策

指导项目不提供助学金。

指导项目不支持旁听。

您可在页面顶部点按此指导项目的经验级别,查看任何知识先决条件。对于指导项目的每个级别,您的授课教师会逐步为您提供指导。

是,您可以在浏览器的云桌面中获得完成指导项目所需的一切。

您可以直接在浏览器中于分屏环境下完成任务,以此从做中学。在屏幕的左侧,您将在工作空间中完成任务。在屏幕的右侧,您将看到有授课教师逐步指导您完成项目。