Hello trend,
This is Satya Mallick from LearnOpenCV.com.
We have a new video for our OCR Nerds!
Today, We will explore the new state-of-the-art model where you'll learn:
✅ What is the architecture of TrOCR?
✅ What models does the TrOCR family include?
✅ How were the TrOCR models pretrained?
✅ How to run inference using TrOCR and Hugging Face?
So without further ado, let's jump into the tutorial
TrOCR – Getting Started with Transformer Based OCR |
Accompanying code for the blog post can be found here:
Download Code |
If you want to learn more about Transformers, Text Detection, and OCR, I'm sharing a list of resources that might be helpful for you:
- Optical Character Recognition using PaddleOCR
- Automatic License Plate Recognition using Deep Learning
- Understanding the Attention Mechanism in Transformers
- Implementing Vision Transformers in PyTorch
Build Computer Vision & Deep Learning Applications
Do you want to build similar systems and applications using advanced Computer Vision and Deep Learning techniques, and understand deployment using cloud-based services?
Check out the CVDL Applications course at OpenCV University.
The courses are closed for enrollment currently. I would urge you to sign up for the waitlist so that you stand a chance to enroll in the next cohort. Generally, you also get a 20% discount when you sign up for the waitlist.
Apply for Enrollment |
Cheers,
Satya
Courses / YouTube / Facebook / LinkedIn / Twitter / Instagram
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.