Pinned Loading
Repositories
Showing 10 of 34 repositories
- llm-compressor Public
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vllm-project/llm-compressor’s past year of commit activity - semantic-router Public
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge
vllm-project/semantic-router’s past year of commit activity