yyihuang

Follow

eigen yyihuang

Follow

GPU architect.

10 followers · 21 following

CMU, Pittsburgh
22:56 (UTC -04:00)

Pinned Loading

sglang sglang Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 1
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1
flexflow/flexflow-serve flexflow/flexflow-serve Public

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 34 4