Distributed Rate Limiter

A distributed rate limiter built with Go and Redis, implementing the token bucket algorithm with multiple strategies for coordinating rate limits across service instances.

Quick Start

git clone https://github.com/brutally-Honest/distributed-rate-limiter.git
cd distributed-rate-limiter
docker-compose up --build --scale 'go=3' # Tune based on requirements 
curl -v http://localhost/api

Overview

When running multiple instances of a service, each instance needs to share rate limit state to prevent a single user from bypassing limits by hitting different servers. This project solves that by using Redis as shared storage, with three token bucket implementations that trade off atomicity and performance.

Features

Distributed State Management: Redis-backed coordination across multiple service instances
Multiple Token Bucket Implementations: Hash-based, transaction-based, and Lua script-based strategies
Atomic Operations: Lua and Redis Transaction strategies eliminate race conditions
Extensible Architecture: Factory pattern enables pluggable rate limiting strategies
Clean Separation of Concerns: Interface-based design with dependency injection
Structured Logging: Instance-aware logging for distributed debugging
Environment-Driven Configuration: Validation and type-safe configuration management

Architecture

Distributed Coordination

flowchart TB
subgraph clients["Multiple Clients"]
C1["Client A - 192.168.1.100"]
C2["Client B - 192.168.1.101"]
C3["Client A - 192.168.1.100"]
end
subgraph lb["Load Balancer"]
LB["Distribution Layer"]
end
subgraph instances["Service Instances"]
I1["Instance 1 - Port 8080"]
I2["Instance 2 - Port 8081"]
I3["Instance 3 - Port 8082"]
end
subgraph redis["Redis - Shared State"]
RD["Token Buckets"]
K1["ratelimit:192.168.1.100 | tokens: 95 | last_refill: 1699123456"]
K2["ratelimit:192.168.1.101 | tokens: 100 | last_refill: 1699123450"]
end
RD --- K1 & K2
C1 --> LB
C2 --> LB
C3 --> LB
LB --> I1 & I2 & I3
I1 <-- Read/Write --> RD
I2 <-- Read/Write --> RD
I3 <-- Read/Write --> RD
style C1 fill:#3b3b3b,stroke:#555,color:#f2f2f2
style C2 fill:#3b3b3b,stroke:#555,color:#f2f2f2
style C3 fill:#3b3b3b,stroke:#555,color:#f2f2f2
style LB fill:#4b4b4b,stroke:#666,color:#f2f2f2
style I1 fill:#4b4b4b,stroke:#666,color:#f2f2f2
style I2 fill:#4b4b4b,stroke:#666,color:#f2f2f2
style I3 fill:#4b4b4b,stroke:#666,color:#f2f2f2
style RD fill:#5b5b5b,stroke:#777,color:#f2f2f2
style K1 fill:#5b5b5b,stroke:#777,color:#f2f2f2
style K2 fill:#5b5b5b,stroke:#777,color:#f2f2f2
style clients fill:#2c2c2c,stroke:#444,stroke-width:2px,color:#f2f2f2
style lb fill:#2c2c2c,stroke:#444,stroke-width:2px,color:#f2f2f2
style instances fill:#2c2c2c,stroke:#444,stroke-width:2px,color:#f2f2f2
style redis fill:#2c2c2c,stroke:#444,stroke-width:2px,color:#f2f2f2
linkStyle 0 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none
linkStyle 1 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none
linkStyle 2 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none
linkStyle 3 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none
linkStyle 4 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none
linkStyle 5 stroke:#aaa,stroke-width:2px,color:#aaa,fill:none

How It Works

Request arrives → Middleware extracts client identifier (IP address)
Strategy executes → Selected implementation (Hash/Transaction/Lua) checks Redis
Token bucket logic → Read current tokens, calculate refill, check availability, update state
Decision → Allow request (200 + headers) or reject (429 rate limited)

Project Structure

cmd/server/           # Application entry point
├── main.go          # Bootstrap and dependency injection

internal/
├── config/          # Configuration management
├── server/          # HTTP server setup and routing
├── middlewares/     # HTTP middleware chain
├── ratelimiter/     # Rate limiting abstractions
│   ├── limiter.go   # RateLimiter interface
│   └── redis/       # Redis-based implementations
│       ├── factory.go           # Rate limiter factory
│       └── tokenbucket/         # Token bucket implementations
│           ├── config.go        # Token bucket configuration
│           ├── hash.go          # Hash-based (has race conditions)
│           ├── transaction.go   # Transaction-based (atomic)
│           ├── lua.go           # Lua script-based (recommended)
│           └── README.md        # Implementation comparison
├── redis/           # Redis client wrapper
└── http/            # HTTP handlers

Implementation Highlights

Design Patterns

Factory Pattern: Strategy-based rate limiter instantiation with configuration-driven selection
Dependency Injection: Constructor injection throughout, enabling testability and loose coupling
Middleware Chain: Composable HTTP middleware for cross-cutting concerns
Adapter Pattern: Redis client abstraction isolating external dependencies

Idiomatic Go:

internal/ package for encapsulation
Error wrapping with context preservation
Context-aware operations throughout
Proper resource lifecycle management

Redis Optimization:

Connection pooling with configurable parameters
Single atomic operation per rate limit check (Lua strategy)
Hash-based storage minimizing network round-trips

Minimal Dependencies:

Single external dependency: github.com/redis/go-redis/v9
Standard library for core functionality

API Endpoints

1. Rate-Limit Endpoint

GET /api

Success Response (200):

{
  "msg": "Successfully Hit",
  "time": "2025-11-09T11:37:54+05:30",
  "instanceId": "46950-059eff"
}

Rate Limited Response (429):

{
  "error": "Rate limit exceeded"
}

Response Headers:

X-RateLimit-Remaining: Tokens remaining in bucket

2. Health Check Endpoint

GET /health

Success Response (200):

{
  "status": "healthy",
  "timestamp": "2025-11-09T11:37:54+05:30",
  "uptime": "44.398377667s",
  "instanceId": "46950-059eff",
  "services": {
    "redis": {
      "status": "connected",
      "latency": "1.2ms"
    }
  }
}

Failed Response (503):

{
  "status": "unhealthy",
  "timestamp": "2025-11-09T11:37:54+05:30", 
  "uptime": "44.398377667s",
  "instanceId": "46950-059eff",
  "services": {
    "redis": {
      "status": "disconnected",
      "latency": ""
    }
  }
}

Rate Limiting Strategies

Token Bucket Implementations

The system provides three Redis-based token bucket implementations with different trade-offs:

Strategy	Atomicity	Performance
Lua Script	Atomic	Highest
Transaction	Atomic	Medium
Hash-based	Non-atomic	High

Lua Script Strategy (Recommended):

Single atomic Redis operation
Zero race conditions
Precise refill calculations
Minimal network overhead

See Token Bucket Implementation Details for comprehensive comparison.

Configuration

Environment-based configuration with validation:

# Server
PORT=8080

# Redis
REDIS_ADDR=localhost:6379
REDIS_PASSWORD=
REDIS_DB=0
REDIS_POOL_SIZE=10

# Rate Limiting
LIMITER_STRATEGY=tokenbucket-lua  # tokenbucket-lua | tokenbucket-transaction | tokenbucket-hash
LIMITER_CAPACITY=100              # Max tokens in bucket
LIMITER_REFILL_RATE=10            # Tokens added per second

Strategy Selection:

tokenbucket-lua: Atomic, highest performance (recommended)
tokenbucket-transaction: Atomic with WATCH/MULTI/EXEC
tokenbucket-hash: Non-atomic (development only)

The system is designed for easy extension with new rate limiting strategies

Running

Local Development

# Start Redis
docker run -d -p 6379:6379 redis:alpine

# Run single instance
export PORT=1783

# Desired Redis configuration
export REDIS_ADDR=localhost:6379
export REDIS_PASSWORD=" "

# Desired strategy configuration 
export LIMITER_STRATEGY=tokenbucket-lua  
export LIMITER_CAPACITY=50               
export LIMITER_REFILL_RATE=10

go run cmd/server/main.go

Multiple Instances with Docker Compose

# Start 3 instances + Redis
docker-compose up --build --scale go=3

See Load Test Observation Details for comprehensive testing results.

Future Enhancements

Observability: Prometheus metrics, OpenTelemetry tracing, structured logging with trace IDs
Resilience: Circuit breaker for Redis failures, retry logic with exponential backoff
Testing: Unit, integration tests with race condition validation
Enhanced Features: Additional rate limit headers (X-RateLimit-Reset), hot config reload, multi-tier limits (user/IP/endpoint-based)

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
cmd/server		cmd/server
internal		internal
tests/load		tests/load
.gitignore		.gitignore
README.md		README.md
docker-compose.yaml		docker-compose.yaml
dockerfile		dockerfile
go.mod		go.mod
go.sum		go.sum
nginx.conf		nginx.conf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Rate Limiter

Quick Start

Overview

Features

Architecture

Distributed Coordination

How It Works

Project Structure

Implementation Highlights

API Endpoints

1. Rate-Limit Endpoint

2. Health Check Endpoint

Rate Limiting Strategies

Token Bucket Implementations

Configuration

Running

Local Development

Multiple Instances with Docker Compose

Future Enhancements

About

Uh oh!

Releases

Packages

Languages

brutally-Honest/Distributed-Rate-Limiter

Folders and files

Latest commit

History

Repository files navigation

Distributed Rate Limiter

Quick Start

Overview

Features

Architecture

Distributed Coordination

How It Works

Project Structure

Implementation Highlights

API Endpoints

1. Rate-Limit Endpoint

2. Health Check Endpoint

Rate Limiting Strategies

Token Bucket Implementations

Configuration

Running

Local Development

Multiple Instances with Docker Compose

Future Enhancements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages