Hi,

This week you'll learn about Multi-Column Table OCR.

Image

Since its introduction, Optical Character Recognition (OCR) has taken the world by storm. OCR is presently employed in a variety of real-world applications, ranging from scanning license plates to automating checkbook transactions.

One of the main use cases of text recognition is to read tabular data from images. It's easy to read text data from images, but to read it in the exact orientation it is present in is a challenging problem.

Today, we'll learn to work with Multiple Column Table Optical Character Recognition

The big picture: We organize our world through structured tabular data. Our eyes can easily detect those from images. But making a computer detect tabular-oriented data will also involve estimating the region of interest (ROI) and keeping the orientation intact.

How it works: Using OpenCV's vast array of tools, we'll first estimate the general region of the table in the input image. After that, we will use PyTesseract to extract the text from the ROI and finally apply an algorithm known as Hierarchical Agglomerative Clustering (HAC) to get our output in its original intended form. 

Our thoughts: The HAC algorithm is the integral puzzle piece that connects text recognition with traditional structural data. 

Yes, but: The HAC algorithm isn't foolproof since it doesn't learn from data. 

Stay smart: Do not stop here. There can be multiple solutions to a single problem. We recommend you go through computer vision and deep learning topics and craft a solution to solve tabular data text detection.

Click here to read the full tutorial

PyImageSearch University

This lesson is part of PyImageSearch University, our flagship program to help you master computer vision, deep learning, and OpenCV.  PyImageSearch University is updated each week with new lessons.

Don't spend your time trying to piece together code when you could get pristine code right now.  Want the full, working code of every blog post plus a Colab Notebook and video walkthrough of the code?

I want the code

Having problems with your local development environment or IDE?  Fortunately, our pre-configured Colab Notebooks allow you to run code the moment you join PyImageSearch University.  But, of course, you don't want to be a sys-admin, so don't waste time messing with your development environment.

Want to Master Computer Vision and Deep Learning?

Do you think mastering computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?

That's not the case. All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And that's exactly what we do. Our mission is to change education and how complex Artificial Intelligence topics are taught.

Inside PyImageSearch University, you'll find:

  • Code that works for almost any OpenCV or TensorFlow CV problem
  • 35+ courses on the hottest computer vision, deep learning, and OpenCV topics
  • 35+ Certificates of Completion (one for each course)
  • 39+ hours of on-demand video
  • Pre-configured Jupyter Notebooks running in Google Colab
  • Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev environment configuration required!)
  • Access to centralized code repos for all 500+ tutorials on the PyImageSearch blog
  • Easy one-click downloads for code, datasets, pre-trained models, etc.
  • Access on mobile, laptop, desktop, etc.
  • New courses released regularly and new tutorials weekly, ensuring you can keep up with state-of-the-art techniques

Click here to join PyImageSearch University


The PyImageSearch Team