Monday, February 28, 2022

New tutorial: “Multi-Column Table OCR”

Hi,

This week you'll learn about Multi-Column Table OCR.

Since its introduction, Optical Character Recognition (OCR) has taken the world by storm. OCR is presently employed in a variety of real-world applications, ranging from scanning license plates to automating checkbook transactions.

One of the main use cases of text recognition is to read tabular data from images. It's easy to read text data from images, but to read it in the exact orientation it is present in is a challenging problem.

Today, we'll learn to work with Multiple Column Table Optical Character Recognition.

The big picture: We organize our world through structured tabular data. Our eyes can easily detect those from images. But making a computer detect tabular-oriented data will also involve estimating the region of interest (ROI) and keeping the orientation intact.

How it works: Using OpenCV's vast array of tools, we'll first estimate the general region of the table in the input image. After that, we will use PyTesseract to extract the text from the ROI and finally apply an algorithm known as Hierarchical Agglomerative Clustering (HAC) to get our output in its original intended form.

Our thoughts: The HAC algorithm is the integral puzzle piece that connects text recognition with traditional structural data.

Yes, but: The HAC algorithm isn't foolproof since it doesn't learn from data.

Stay smart: Do not stop here. There can be multiple solutions to a single problem. We recommend you go through computer vision and deep learning topics and craft a solution to solve tabular data text detection.

Click here to read the full tutorial

PyImageSearch University

This lesson is part of PyImageSearch University, our flagship program to help you master computer vision, deep learning, and OpenCV. PyImageSearch University is updated each week with new lessons.

Don't spend your time trying to piece together code when you could get pristine code right now. Want the full, working code of every blog post plus a Colab Notebook and video walkthrough of the code?

I want the code

Having problems with your local development environment or IDE? Fortunately, our pre-configured Colab Notebooks allow you to run code the moment you join PyImageSearch University. But, of course, you don't want to be a sys-admin, so don't waste time messing with your development environment.

Want to Master Computer Vision and Deep Learning?

Do you think mastering computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?

That's not the case. All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And that's exactly what we do. Our mission is to change education and how complex Artificial Intelligence topics are taught.

Inside PyImageSearch University, you'll find:

Code that works for almost any OpenCV or TensorFlow CV problem
35+ courses on the hottest computer vision, deep learning, and OpenCV topics
35+ Certificates of Completion (one for each course)
39+ hours of on-demand video
Pre-configured Jupyter Notebooks running in Google Colab
Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev environment configuration required!)
Access to centralized code repos for all 500+ tutorials on the PyImageSearch blog
Easy one-click downloads for code, datasets, pre-trained models, etc.
Access on mobile, laptop, desktop, etc.
New courses released regularly and new tutorials weekly, ensuring you can keep up with state-of-the-art techniques

Click here to join PyImageSearch University

The PyImageSearch Team

To ensure you keep getting these emails, please add adrian@pyimagesearch.com to your address book or whitelist us. Want out of the loop? Unsubscribe.

Our postal address: 317 Commercial St NE, Ste A, PMB 530, Albuquerque, NM 87102

Trendlist

Monday, February 28, 2022

New tutorial: “Multi-Column Table OCR”

PyImageSearch University

Want to Master Computer Vision and Deep Learning?

No comments:

Post a Comment

Generate a catchy title for a collection of newfangled music by making it your own

Report Abuse

Labels