Hello trend,
This is Satya Mallick from LearnOpenCV.com.
A good data scientist is a good data janitor.
Without a dataset you have prepared with love and care, your machine learning model is a GIGO system - Garbage In, Garbage Out.
For a vast majority of problems, there is nothing more important than the quality of your data.
In today's video, we will learn how to systematically analyze a dataset BEFORE you start training.
If you are interested in object detection, the following material will help you get started.
- What is YOLOv5 Object Detector [Video]
- How to prepare Datasets for training YOLOv5 Object Detection [Video]
- Object Detection using YOLOv5 and OpenCV DNN in C++ and Python [Code]
- Train an Object Detection using YOLOv5 on Custom Dataset [Code]
An Expert at Work
When a model accuracy is bad, a novice tries a new architecture while an expert inspects the data.
Here is a true story.
A few years back, Davis King released a new face recognition model for Dlib.
His goal was not to build the best face recognition model but to show an application of deep metric learning that he had recently implemented in Dlib.
So, he used a popular model called Resnet as the backbone. Nothing fancy.
But his model's performance on standard benchmarks was comparable to other state-of-the-art models!
How?
Because the amount of effort he put into cleaning his dataset. Here is an excerpt from his blog post.
In contrast, he spent zero effort in coming up with a new neural network architecture.
A good data scientist is a good data janitor.
Cheers!
Satya
Courses / YouTube / Facebook / LinkedIn / Twitter / Instagram
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.