Getting Coordinate and Cropping an Image with OpenCV

ByFirhan Maulana Rusli June 6, 2020December 31, 2023

OpenCV is popular library for computer vision. OpenCV is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products.

OpenCV uses machine learning to detect object/faces in picture. for detecting face, the algorithm start from top left on picture unltil right down on the picture.

Why we use OpenCV for getting the cordinate?

there’se so much way to goes to Roma, same as here, there so much way to getting the cordinate. We can use countours detection or maybe R-cnn to get the cordinate. But for this situation we will use Opencv and Cascades.

First of all we need to download the xml of haar-cascades, here is the link

https://github.com/shantnu/FaceDetect/blob/master/haarcascade_frontalface_default.xml

After that we need to import some library:

import cv2
import matplotlib.pyplot as plt
from PIL import Image
%matplotlib inline

And then import the xml files:

cascPath = "haarcascade_frontalface_default.xml"

And then create the haar-cascade:

faceCascade = cv2.CascadeClassifier(cascPath)

After that, just import the image that we want to use:

path = "/content/ktp3.png"
image = cv2.imread(path)
image_crop = Image.open(path)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

After that, detect the face in the image:

faces = faceCascade.detectMultiScale(
    gray,
    scaleFactor=1.1,
    minNeighbors=2,
    minSize=(40, 60),
    flags = cv2.CASCADE_SCALE_IMAGE
)

print("Found {0} faces!".format(len(faces)))

Here is some explanation about the parameters:

Parameters: cascade – Haar classifier cascade (OpenCV 1.x API only). It can be loaded from XML or YAML file using Load(). When the cascade is not needed anymore, release it using cvReleaseHaarClassifierCascade(&cascade).
image – Matrix of the type CV_8U containing an image where objects are detected.
objects – Vector of rectangles where each rectangle contains the detected object.
scaleFactor – Parameter specifying how much the image size is reduced at each image scale.
minNeighbors – Parameter specifying how many neighbors each candidate rectangle should have to retain it.
flags – Parameter with the same meaning for an old cascade as in the function cvHaarDetectObjects. It is not used for a new cascade.
minSize – Minimum possible object size. Objects smaller than that are ignored.
maxSize – Maximum possible object size. Objects larger than that are ignored.

Parameters:	cascade – Haar classifier cascade (OpenCV 1.x API only). It can be loaded from XML or YAML file using `Load()`. When the cascade is not needed anymore, release it using `cvReleaseHaarClassifierCascade(&cascade)`. image – Matrix of the type `CV_8U` containing an image where objects are detected. objects – Vector of rectangles where each rectangle contains the detected object. scaleFactor – Parameter specifying how much the image size is reduced at each image scale. minNeighbors – Parameter specifying how many neighbors each candidate rectangle should have to retain it. flags – Parameter with the same meaning for an old cascade as in the function `cvHaarDetectObjects`. It is not used for a new cascade. minSize – Minimum possible object size. Objects smaller than that are ignored. maxSize – Maximum possible object size. Objects larger than that are ignored.

And then display the image:

for (x, y, w, h) in faces:
    cv2.rectangle(image, (x, y), (x+w, y+h), (0, 255, 0), 2)

plt.imshow(image)

And the result would be like this:

And then for cropping the image, we just need to use the cordinate that we just got from the image detector, it would be like this:

im_crop = image_crop.crop((x, y, (x+w), (y+h)))
plt.imshow(im_crop)

The result would be like this:

Here is some source that might be help:

Firhan Maulana Rusli

Linkedin : www.linkedin.com/in/firhan-rusli

Artificial Intelligence | Chatbot | RASA

Build Your First RASA Chatbot

ByAngela Marpaung May 7, 2020December 31, 2023

What is chatbot? Chatbot is an Artificial Intelligence (AI) agent that can understand as well as repond to human conversation. Chatbot can be deployed in many platform such as websites, mobile apps, and messaging channels. Chatbot works in a way that it is able to immitate human conversations and make it seems like they are…

Artificial Intelligence | Computer Vision | Data Science | Natural Language Processing (NLP)

Confidence in KTP-OCR using Pytesseract

ByFirhan Maulana Rusli July 18, 2020December 31, 2023

In previous blog, we already learn how to crop an image http://about.lovia.id/getting-cordinate-and-cropping-an-image-with-opencv/. Then we will learn how to got confidence using pytesseract, After much searching, there was some some ways to got confidence in my KTP-OCR. Pytesseract give us a lot of syntax that can we use, such as : #this line of code will…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | RASA

Build Contextual AI Assistant (Chatbot)

ByAngela Marpaung May 22, 2020December 31, 2023

We have discussed about chatbot and why should we build a chatbot on the previous post http://about.lovia.id/build-your-first-rasa-chatbot/. On the previous post, we build a simple chatbot that asks a user’s feeling and tries to cheer the user up when they’re sad.This time we are going to set our game up and build a contextual chatbot…

Artificial Intelligence | BPMN Workflow Automation

Importance of BPMN for Companies Development

ByAngela Marpaung April 27, 2020December 31, 2023

BPMN (Business Process Modelling Notation) is a method used to visualize business processes and is mainly used by enterprises to help relevant stakeholders understand their job. BPMN is also a tool that can be used not only for visualizing current state but also future state of business process. And how does BPMN benefits enterprises? As…

Data Science | Internships

Meeting 11/04/2020 – Amran – Scraping web

ByAmran April 11, 2020December 31, 2023

Overview : Scraping detail page Challenges : Struktur page tiap halaman berbeda-beda Baca-baca artikel tentang teknik penanganan struktur yang berbeda dengna tag yang sama Feedback : https://gitlab.com/lovia/jobsid-crawler/-/issues/2 Next Steps : Insert or update MongoDB jobPosting collection from JobPosting object Wrap crawler into Zeebe worker using zeebe-grpc (Python) Store HiringOrganization Logo Image to S3+MongoDB Run Workflow…

Artificial Intelligence | Chatbot | Natural Language Processing (NLP) | ngrok | RASA

Integrating RASA chatbot assistant to Facebook Messenger

ByAngela Marpaung June 15, 2020December 31, 2023

Here is tutorial on how we can integrate RASA chatbot assistant to Facebook Messenger. Here we’ll be using ngrok to expose a local server on the internet, so make sure you have installed ngrok before.If you haven’t installed ngrok, you can install it here. Make sure you have created an account in facebook for developers…

One Comment

Pingback: Confidence in KTP-OCR using Pytesseract - About Lovia

Comments are closed.

Similar Posts

One Comment