Document Scanner From Photo

Created a basic document scanner using OpenCV that detects a 4-sided document in an image and straightens it out for a full frontal view.

I created a basic document scanner using OpenCV that detects a 4-sided document in an image and straightens it out for a full frontal view. This is based on the excellent How to Build a Kick-Ass Mobile Document Scanner in Just 5 Minutes tutorial from pyimagesearch.

The basic steps are:

load the image and resize it (apparently the edge detection does not work well on large images)

detect edges

find all contours and return the top 5 largest of these contours
convert the contours to polygon approximations (e.g. if the contours are a list of many points tracing out a n-sided polygon, then just turn it into n points that represent a similar approximated polygon – this is done using the Ramer-Douglas-Peucker algorithm)
store the 4 points of the largest contour that can be approximated as a 4-sided polygon

calculate the width and height of the output image based on the 4-sided polygon
perspective warp image by corner-pinning the 4 points on the polygon to 4 corners of output image – this will straighten out the sides to a frontal view
apply thresholding for a photocopy look

Seems like this method works well only with text documents on white paper. I tried a bunch of other examples that are more complicated and the script did not manage to pick out the boundaries.

About Me

KS Lee

I am an independent and self-driven engineer with a strong background in Computer Science and a flair for artistic design, enabling me to create visually-appealing assets and improve the aesthetics and usability of applications. My interests lie in all things visual and interactive, driving my passion for merging technology with artistry.

This website is a collection of AI-related stuff that I have worked on: deep learning, machine learning, reinforcement learning, computer vision and many others. It also includes classical AI-like image & video processing techniques that I have implemented.

This is still a work-in-progress, please bear with me as I post my stuff one by one…

Content-Aware Image Resizing Using Seam Carving

Auto-Generation of Looping Cinemagraphs