MLOps Project Template

A comprehensive template for building production-ready MLOps projects with modern best practices, containerization, and cloud deployment.

Overview

This template provides a complete structure for building end-to-end machine learning operations (MLOps) projects, featuring:

Modern Python tooling with uv for fast dependency management
Microservices architecture with FastAPI and containerized services
Database integrations with PostgreSQL and ChromaDB vector database
Cloud-native deployment to Google Cloud Platform (GCP) with Kubernetes
Infrastructure as Code using Pulumi
CI/CD pipeline with GitHub Actions
Best practices for secrets management, testing, and code quality

Architecture

┌─────────────────────────────────────────────────────────────┐
│                         Frontend                             │
│                    (React Application)                       │
└────────────────────────┬────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────┐
│                      API Service                             │
│            (FastAPI with Modular Routers)                    │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐                  │
│  │   Data   │  │   RAG    │  │  Model   │                  │
│  │  Router  │  │  Router  │  │  Router  │                  │
│  └──────────┘  └──────────┘  └──────────┘                  │
└───┬─────────────────┬─────────────────┬────────────────────┘
    │                 │                 │
    ▼                 ▼                 ▼
┌────────┐      ┌──────────┐     ┌──────────┐
│PostgreSQL│    │ ChromaDB │     │  GCS     │
│   DB    │    │ Vector DB│     │ Buckets  │
└────────┘      └──────────┘     └──────────┘
    │                                   │
    ▼                                   ▼
┌─────────────────────────────────────────────┐
│         Standalone Services                 │
│  ┌──────────────┐  ┌──────────────┐        │
│  │     Data     │  │     Data     │        │
│  │  Collector   │  │  Processor   │        │
│  └──────────────┘  └──────────────┘        │
│  ┌──────────────┐  ┌──────────────┐        │
│  │    Model     │  │    Model     │        │
│  │   Training   │  │    Deploy    │        │
│  └──────────────┘  └──────────────┘        │
│  ┌──────────────┐                          │
│  │ ML Workflow  │                          │
│  │ (Vertex AI)  │                          │
│  └──────────────┘                          │
└─────────────────────────────────────────────┘

Project Structure

.
├── .github/
│   └── workflows/
│       └── ci.yml                    # GitHub Actions CI/CD pipeline
├── src/
│   ├── api-service/                  # Main FastAPI application
│   │   ├── routers/                  # API route handlers
│   │   │   ├── data.py              # Data operations endpoints
│   │   │   ├── rag.py               # RAG and vector search
│   │   │   └── model.py             # Model inference endpoints
│   │   ├── services/                # Business logic layer
│   │   ├── models/                  # Pydantic models
│   │   ├── migrations/              # Database migrations
│   │   ├── tests/                   # Unit tests
│   │   ├── main.py                  # FastAPI app entry point
│   │   ├── Dockerfile               # Container configuration
│   │   └── pyproject.toml           # Dependencies
│   ├── data-collector/              # Data ingestion service
│   ├── data-processor/              # Data preprocessing service
│   ├── model-training/              # ML training service
│   ├── model-deploy/                # Model serving service
│   ├── ml-workflow/                 # Vertex AI orchestration
│   └── frontend-react/              # React frontend
├── infrastructure/                   # Pulumi IaC for GCP
│   ├── __main__.py                  # Infrastructure definition
│   ├── Pulumi.yaml                  # Pulumi configuration
│   └── requirements.txt             # Pulumi dependencies
├── tests/                           # Integration tests
├── docker-compose.yml               # Local development (PostgreSQL, ChromaDB, API)
├── .env.example                     # Environment variables template
├── .gitignore                       # Git ignore rules
├── pyproject.toml                   # Root Python configuration
├── Makefile                         # Helper commands
└── README.md                        # This file

Quick Start

Prerequisites

Python 3.11+
uv - Fast Python package installer (Install)
Docker & Docker Compose - For containerization
Node.js 20+ - For React frontend (optional)
Google Cloud SDK - For GCP deployment (optional)
Pulumi CLI - For infrastructure management (optional)

1. Clone and Setup

# Clone the repository
git clone <your-repo-url>
cd project-setup-template

# Copy environment variables template
cp .env.example .env

# Edit .env with your configuration
nano .env

2. Install Dependencies

# Install Python dependencies with uv
make install

# Or manually
uv sync

3. Start Local Development Environment

# Start PostgreSQL, ChromaDB, and API service
make up

# Verify services are running
make ps

This will start:

PostgreSQL on port 5432
ChromaDB on port 8001
API Service on port 8000

Access the API:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc
Health Check: http://localhost:8000/health

4. Run Tests

# From project root
make test

# With coverage
make test-cov

Development Workflow

Local Development

Start all services:
```
make up
```
This starts PostgreSQL, ChromaDB, and the API service.

View logs:

make logs              # All services
make logs-postgres     # PostgreSQL only
make logs-chroma       # ChromaDB only

Run tests:
```
make test
```
Code formatting:
```
make format
make lint
```

Building Docker Images

# Build API service
make build-api

# Build all services
make build-all

Services

API Service

FastAPI-based REST API serving as the main HTTP gateway.

Routers: Modular endpoints for data, RAG, and models
Port: 8000
Documentation: /docs (Swagger UI)
See: src/api-service/README.md

Data Collector

Standalone service for collecting data from various sources.

Fetch data from APIs, databases, or files
Can be triggered via API or scheduled
See: src/data-collector/README.md

Data Processor

Data preprocessing and feature engineering service.

Clean and transform data
Generate embeddings for vector search
Prepare data for model training
See: src/data-processor/README.md

Model Training

ML model training service with support for multiple frameworks.

Train models locally or on Vertex AI
Hyperparameter tuning
Experiment tracking with MLflow
See: src/model-training/README.md

Model Deploy

Model serving for inference.

Load models from GCS
Single and batch predictions
Model versioning and A/B testing
See: src/model-deploy/README.md

ML Workflow

Pipeline orchestration using Vertex AI Pipelines.

Define end-to-end ML pipelines
Schedule automated retraining
Manage pipeline runs on GCP
See: src/ml-workflow/README.md

Frontend React

React-based UI for interacting with the ML system.

Data upload and visualization
Model predictions interface
RAG query interface
See: src/frontend-react/README.md

Databases

PostgreSQL

Relational database for structured data.

Port: 5432
Default credentials: See .env.example
Migrations: Use Alembic in api-service/migrations/

ChromaDB

Vector database for embeddings and similarity search.

Port: 8001
API: http://localhost:8001
Collections: Defined in application code

Deployment

Google Cloud Platform (GCP)

This template is designed for deployment to GCP using:

GKE (Google Kubernetes Engine) for container orchestration
GCR (Google Container Registry) for Docker images
GCS (Google Cloud Storage) for data and models
Vertex AI for ML workflows

Infrastructure Setup

Install Pulumi:
```
curl -fsSL https://get.pulumi.com | sh
```

Configure GCP:

gcloud auth login
gcloud config set project YOUR_PROJECT_ID

Deploy infrastructure:

cd infrastructure
pip install -r requirements.txt
pulumi stack init dev
pulumi config set gcp:project YOUR_PROJECT_ID
pulumi up

See infrastructure/README.md for detailed instructions.

CI/CD Pipeline

GitHub Actions workflow (.github/workflows/ci.yml) handles:

Linting: Ruff and Black checks
Testing: Pytest with coverage
Building: Docker image builds
Deployment: Push to GCR (when configured)

Required Secrets:

GCP_PROJECT_ID: Your GCP project ID
GCP_SA_KEY: Service account key JSON

Environment Variables

All configuration is managed through environment variables. Copy .env.example to .env and fill in your values:

cp .env.example .env

Key variables:

DATABASE_URL: PostgreSQL connection string
CHROMA_HOST: ChromaDB host
GCP_PROJECT_ID: Google Cloud project
GOOGLE_APPLICATION_CREDENTIALS: Path to service account key

Important: Never commit .env or service account keys to Git!

Secrets Management

Local Development

Use .env file (gitignored)
Place GCP service account keys outside this repository

Production

Use Docker secrets (see docker-compose.prod.yml)
Use GCP Secret Manager
Use Kubernetes Secrets

Testing

Unit Tests

Located in each service's tests/ directory:

cd src/api-service
uv run pytest

Integration Tests

Located in tests/ at project root:

pytest tests/

Coverage

make test-cov
open htmlcov/index.html

Code Quality

Linting

make lint          # Check for issues
make lint-fix      # Auto-fix issues

Formatting

make format        # Format code
make format-check  # Check formatting

Common Commands

The Makefile provides helpful shortcuts:

make help          # Show all available commands
make install       # Install dependencies
make up            # Start databases
make down          # Stop databases
make logs          # View logs
make test          # Run tests
make lint          # Lint code
make format        # Format code
make clean         # Clean temporary files
make build-all     # Build all Docker images

Best Practices

Code Organization

One service per directory in src/
Each service is independently containerized
Shared code goes in common packages

API Design

Use modular routers for different domains
Implement proper error handling
Document all endpoints with docstrings
Use Pydantic models for validation

Database

Use migrations for schema changes
Never commit database credentials
Use connection pooling for performance

Security

Use environment variables for secrets
Never commit .env or credentials
Use non-root users in Docker containers
Enable HTTPS in production
Implement authentication/authorization

Docker

Multi-stage builds for smaller images
Use .dockerignore to exclude files
Non-root user for security
Health checks for monitoring

Testing

Write tests for all critical functionality
Use fixtures for common setup
Mock external services
Aim for high coverage

Troubleshooting

Database Connection Issues

# Check if databases are running
make ps

# View logs
make logs-postgres
make logs-chroma

# Restart services
make restart

Docker Build Failures

# Clean Docker cache
docker system prune -a

# Rebuild without cache
docker build --no-cache -t service-name .

Import Errors

# Reinstall dependencies
make clean
make install

Contributing

Adding a New Service

Create service directory in src/
Add Dockerfile and pyproject.toml
Update docker-compose.yml if needed
Add tests in service tests/ directory
Document in service README
Update CI/CD workflow

Making Changes

Create a feature branch
Make your changes
Run tests: make test
Format code: make format
Lint code: make lint
Commit with descriptive message
Push and create pull request

Resources

Documentation

Tools

License

[Add your license here]

Support

For questions or issues:

Check the service-specific READMEs
Review the troubleshooting section
Open an issue on GitHub
Contact the maintainers

Happy Building! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github/workflows		.github/workflows
infrastructure		infrastructure
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

dlops-io/project-setup-template

Folders and files

Latest commit

History

Repository files navigation

MLOps Project Template

Overview

Architecture

Project Structure

Quick Start

Prerequisites

1. Clone and Setup

2. Install Dependencies

3. Start Local Development Environment

4. Run Tests

Development Workflow

Local Development

Building Docker Images

Services

API Service

Data Collector

Data Processor

Model Training

Model Deploy

ML Workflow

Frontend React

Databases

PostgreSQL

ChromaDB

Deployment

Google Cloud Platform (GCP)

Infrastructure Setup

CI/CD Pipeline

Environment Variables

Secrets Management

Local Development

Production

Testing

Unit Tests

Integration Tests

Coverage

Code Quality

Linting

Formatting

Common Commands

Best Practices

Code Organization

API Design

Database

Security

Docker

Testing

Troubleshooting

Database Connection Issues

Docker Build Failures

Import Errors

Contributing

Adding a New Service

Making Changes

Resources

Documentation

Tools

License

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages