Stars
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
FastAPI, SQLAlchemy, aio_pika, RabbitMQ and Docker boilerplate
CloudNativePG is a comprehensive platform designed to seamlessly manage PostgreSQL databases within Kubernetes environments, covering the entire operational lifecycle from initial deployment to ong…
Asynchronous MinIO Client SDK for Python
Docker image based on alpine with minio client and crond
A high-throughput and memory-efficient inference and serving engine for LLMs
Large World Model -- Modeling Text and Video with Millions Context
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Large Language Model Text Generation Inference
a small scripting language for the web
A real world full-stack application using LlamaIndex
This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project
Provider agnostic OAuth2 Authorization Code flow with PKCE for React
Pytorch implementations of various Deep Reinforcement Learning algorithms on pybullet environments.
Observe FastAPI app with three pillars of observability: Traces (Tempo), Metrics (Prometheus), Logs (Loki) on Grafana through OpenTelemetry and OpenMetrics.
Python program that creates mosaic images from a main photo comprised of many small images
Instrument your FastAPI with Prometheus metrics.
Modularized Implementation of Deep RL Algorithms in PyTorch