DeepVision — Face + Voice Authentication Suite

DeepVision is a finished, end‑to‑end biometric authentication project that combines face recognition, voice verification, and real‑time attention monitoring into a single interactive system. The goal is to provide a practical, demo‑ready pipeline that can be run locally or backed by a cloud database.

Highlights

Multi‑modal authentication: face recognition + voice verification for stronger identity checks.
Real‑time monitoring: eye aspect ratio (EAR) and head‑pitch cues for attention/drowsiness signals.
Gradio UI: a ready‑to‑run interface for demos and validation.
Two storage backends:
- Local SQLite for offline/demo usage.
- Azure SQL for cloud deployments.

Project structure

.
├── mygradio2.py           # Main application (SQLite-backed)
├── mygradio.py            # Azure SQL variant (pyodbc)
├── Models/                # Voice verification models (.h5)
├── persons/               # Reference face images (name-based)
├── voices/                # Reference voice samples (per-person folders)
├── Database/              # SQLite database (faces.db)
├── test.py                # Voice model evaluation/threshold script
└── requirements*.txt      # Python dependencies

How the system works (at a glance)

Face detection + embedding
MTCNN detects faces and FaceNet generates embeddings.
Face match
Embeddings are compared against the database via cosine similarity.
Voice verification
A Siamese model produces voice embeddings for similarity checks.
Attention monitoring
Eye aspect ratio and head pitch provide basic drowsiness cues.

Setup (local, finished workflow)

Install dependencies
```
pip install -r requirements.txt
```
Add reference data
- Put face images in persons/ named like person_name.jpg.
- Put voice samples in voices/person_name/*.wav.
Run the main app
```
python mygradio2.py
```

This launches the full Gradio UI and automatically seeds the local database (Database/faces.db).

Azure SQL deployment

Use mygradio.py when you want a cloud-backed deployment:

python mygradio.py

Note: You must provide valid Azure SQL credentials and have ODBC drivers installed.

Model notes

The system expects a trained voice model at:
```
Models/Voice_verification_model5.h5
```
Additional model versions are stored in Models/ for experimentation.

Evaluation & validation

Use test.py to evaluate the voice model, compute a similarity threshold, and export misclassifications:

python test.py

Status

✅ Complete — This repository represents a finished, working project with a runnable UI, local/cloud database support, and evaluation utilities.

If you are new to the code, start with mygradio2.py to see the full, working pipeline from capture → recognition → verification → monitoring.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.gradio		.gradio
Database		Database
Models		Models
__pycache__		__pycache__
persons		persons
voices		voices
.gitignore		.gitignore
README.md		README.md
mygradio.py		mygradio.py
mygradio2.py		mygradio2.py
requirements.txt		requirements.txt
requirements2.txt		requirements2.txt
test.py		test.py
wrong_predictions.csv		wrong_predictions.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepVision — Face + Voice Authentication Suite

Highlights

Project structure

How the system works (at a glance)

Setup (local, finished workflow)

Azure SQL deployment

Model notes

Evaluation & validation

Status

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

DeepVision1/Face-recognition-project

Folders and files

Latest commit

History

Repository files navigation

DeepVision — Face + Voice Authentication Suite

Highlights

Project structure

How the system works (at a glance)

Setup (local, finished workflow)

Azure SQL deployment

Model notes

Evaluation & validation

Status

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages