Real-time OCR and Text Detection with Tensorflow, OpenCV and Tesseract

48 个评分
Coursera Project Network
1,655 人已注册

Train Tensorflow to recognize a Region of Interest (ROI) in an image or frame of a video.

Extract and enhance relevant image segments with OpenCV .

Use Tesseract to extract, export text data for use in real-time.

Clock2 hours
Comment Dots英语(English)

In this 1-hour long project-based course, you will learn how to collect and label images and use them to train a Tensorflow CNN (convolutional neural network) model to recognize relevant areas of (typeface) text in any image, video frame or frame from webcam video. You will learn how to extract image segments that your detector has identified as containing text and enhance them using various image filters from the OpenCV module. Then you will learn how to pass the result image to Google's open-source OCR (Optical Character Recognition) software using the pytesseract python library and read the text to whatever form of output you like. All of this will be done on Windows, but can be accomplished with very little alteration on Linux as well. We will be using the IDLE development environment to write a single script to scan our video, webcam input, or array of images for text and read that text into our output. Tensorflow, the Tensorflow Object Detection API, Tesseract, the pytesseract library, labelImg for image annotation, OpenCV, and all other required software has already been installed for you in your Rhyme desktop. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.


TensorflowDeep Learning in PythonObject DetectionOptical Character RecognitionComputer Vision



  1. Set up a new Real Time Text Detection script

  2. Collect and Label Images for recognition of Region of Interest (ROI)

  3. Train Tensorflow to recognize Region of Interest (ROI)

  4. Capture webcam video stream, frames from a video file, or a static image

  5. Extract and enhance relevant image segments with OpenCV

  6. Use Tesseract to extract, export text data for use






  • 购买指导项目后,您将获得完成指导项目所需的一切,包括通过 Web 浏览器访问云桌面工作空间,工作空间中包含您需要了解的文件和软件,以及特定领域的专家提供的分步视频说明。

  • 由于您的工作空间包含适合笔记本电脑或台式计算机使用的云桌面,因此指导项目不在移动设备上提供。

  • 指导项目讲师是特定领域的专家,他们在项目的技能、工具或领域方面经验丰富,并且热衷于分享自己的知识以影响全球数百万的学生。

  • 您可以从指导项目中下载并保留您创建的任何文件。为此,您可以在访问云桌面时使用‘文件浏览器’功能。

  • 指导项目不符合退款条件。请查看我们完整的退款政策

  • 指导项目不提供助学金。

  • 指导项目不支持旁听。

  • 您可在页面顶部点按此指导项目的经验级别,查看任何知识先决条件。对于指导项目的每个级别,您的讲师会逐步为您提供指导。

  • 是,您可以在浏览器的云桌面中获得完成指导项目所需的一切。

  • 您可以直接在浏览器中于分屏环境下完成任务,以此从做中学。在屏幕的左侧,您将在工作空间中完成任务。在屏幕的右侧,您将看到有讲师逐步指导您完成项目。

还有其他问题吗?请访问 学生帮助中心