vLLM

vllm Public
A high-throughput and memory-efficient inference and serving engine for LLMs

vllm-project/vllm’s past year of commit activity

Python 52,128 Apache-2.0 8,670 1,808 (10 issues need help) 772 Updated Jul 13, 2025
vllm-ascend Public
Community maintained hardware plugin for vLLM on Ascend

vllm-project/vllm-ascend’s past year of commit activity

Python 868 Apache-2.0 251 206 (6 issues need help) 112 Updated Jul 13, 2025
aibrix Public
Cost-efficient and pluggable Infrastructure components for GenAI inference

vllm-project/aibrix’s past year of commit activity

Go 3,917 Apache-2.0 394 195 (20 issues need help) 16 Updated Jul 13, 2025
llm-compressor Public
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

vllm-project/llm-compressor’s past year of commit activity

Python 1,627 Apache-2.0 174 27 (5 issues need help) 31 Updated Jul 13, 2025
vllm-gaudi Public

vllm-project/vllm-gaudi’s past year of commit activity

Python 4 1 0 0 Updated Jul 12, 2025
ci-infra Public
This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

vllm-project/ci-infra’s past year of commit activity

HCL 14 29 0 8 Updated Jul 12, 2025
vllm-spyre Public
Community maintained hardware plugin for vLLM on Spyre

vllm-project/vllm-spyre’s past year of commit activity

Python 30 Apache-2.0 18 11 (1 issue needs help) 17 Updated Jul 11, 2025
guidellm Public
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

vllm-project/guidellm’s past year of commit activity

Python 398 Apache-2.0 53 46 (4 issues need help) 10 Updated Jul 10, 2025
production-stack Public
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

vllm-project/production-stack’s past year of commit activity

Python 1,487 Apache-2.0 225 54 (3 issues need help) 41 Updated Jul 10, 2025
flash-attention Public Forked from Dao-AILab/flash-attention
Fast and memory-efficient exact attention

vllm-project/flash-attention’s past year of commit activity

Python 80 BSD-3-Clause 1,817 0 12 Updated Jul 10, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pinned Loading

Repositories

Uh oh!

People

Sponsors

Top languages

Most used topics