Active Learning for Quality Engineering

This repo contains the code for replicating the results from the paper "Active Learning for Industrial Applications" published in Quality Engineering.

The images used in this paper are from the MVTec datasets and they can be requested here:

Active Learning for Industrial Applications

This repository contains the code for replicating results from the paper Active Learning for Industrial Applications, published in Quality Engineering. The project explores active learning strategies to improve data efficiency in real-world industrial applications, reducing labeling costs while maintaining high model performance.

📌 Overview

Supervised learning models often require large labeled datasets, which can be costly and time-consuming to obtain. Active learning addresses this by selecting the most informative samples for annotation, improving model efficiency with fewer labeled examples. Here we show how to apply active learning methods to industrial quality control and fault detection tasks, demonstrating how active learning improves labeling efficiency in manufacturing settings.

🛠 Setup Instructions

Clone the repository:

git clone https://github.com/yourusername/active-learning-industrial.git
cd active-learning-industrial

Install dependencies:
```
pip install -r requirements.txt
```
Run the example notebook:
```
jupyter notebook active_learning.ipynb
```

📂 Code Structure

requirements.txt: file that contains a list of all the packages installed in the environment used, along with their version numbers.
feature_extraction.ipynb: notebook showing how to obtain preprocessed features from the images using a pre-trained ResNet-18.
sampling_strategies.py: functions for performing random sampling and margin sampling.
active_learning.ipynb: notebook showing how the implementation of the active learning strategy.

📊 Visualizing Active Learning

Below is an illustration of clustering-based sampling in action:

The black points indicate samples with the most representative data points of each clusters (subpopulation), which are selected for annotation in active learning.

Comparison of Active Learning Strategies

From the paper, we analyze different active learning strategies in an industrial setting:

📜 Citation

If you use this code, please cite:

Cacciarelli, D., & Kulahci, M. (2025). Active Learning for Industrial Applications. Quality Engineering.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
LICENSE		LICENSE
README.md		README.md
active_learning.ipynb		active_learning.ipynb
clusters.png		clusters.png
feature_extraction.ipynb		feature_extraction.ipynb
parts.png		parts.png
preprocessing.py		preprocessing.py
requirements.txt		requirements.txt
results.png		results.png
sampling_strategies.py		sampling_strategies.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Active Learning for Quality Engineering

Active Learning for Industrial Applications

📌 Overview

🛠 Setup Instructions

📂 Code Structure

📊 Visualizing Active Learning

Comparison of Active Learning Strategies

📜 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

dcacciarelli/active-learning-quality-engineering

Folders and files

Latest commit

History

Repository files navigation

Active Learning for Quality Engineering

Active Learning for Industrial Applications

📌 Overview

🛠 Setup Instructions

📂 Code Structure

📊 Visualizing Active Learning

Comparison of Active Learning Strategies

📜 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages