DocuLens: A Document Scanner

A simple document scanning pipeline implemented in C++ with OpenCV. The program detects the largest 4-point contour in each frame of a video or webcam feed, applies a perspective transform, and shows a flattened, top-down “scan” of the document.

Steps

Preprocessing (grayscale -> blur -> Canny -> dilate -> erode)
Contour detection and 4-point polygon approximation
Automatic ordering of detected points
Perspective warp to get a top-down scan
2×2 stacked debug view (original, threshold, contour view, warped result)
Live processing from video file or webcam

Requirements

C++17
OpenCV 4.x
CMake >= 3.10

Install OpenCV (on Linux):

sudo apt install libopencv-dev

Build

Run build.sh to build the project.

Run

./build/document_scanner ./assets/testvideo.mp4

Output Windows

The program displays two windows:

Work Flow (2×2 grid)

Original frame
Thresholded frame
Contours
Warped (scanned) document

Result

Clean final warp of the detected document

Press q to quit.

Note

No ML, just pure classical OpenCV contour detection. Hence, works best with well-lit videos where the document edge is clear.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
include		include
legacy		legacy
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
build.sh		build.sh
compile_commands.json		compile_commands.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocuLens: A Document Scanner

Steps

Requirements

Build

Run

Output Windows

About

Uh oh!

Releases

Packages

Languages

maskedsyntax/doculens

Folders and files

Latest commit

History

Repository files navigation

DocuLens: A Document Scanner

Steps

Requirements

Build

Run

Output Windows

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages