Skip to content
@neuralmagic

Neural Magic

Neural Magic (Acquired by Red Hat) empowers developers to optimize & deploy LLMs at scale. Our model compression & acceleration enable top performance with vLLM

Pinned Loading

  1. nm-vllm-certs nm-vllm-certs Public

    General Information, model certifications, and benchmarks for nm-vllm enterprise distributions

    12 2

  2. deepsparse deepsparse Public archive

    Sparsity-aware deep learning inference runtime for CPUs

    Python 3.2k 188

  3. sparseml sparseml Public archive

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2.1k 158

  4. docs docs Public archive

    Top-level directory for documentation and general content

    MDX 122 7

  5. sparsezoo sparsezoo Public archive

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 392 29

  6. guidellm guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    Python 380 50

Repositories

Showing 10 of 74 repositories
  • research Public

    Repository to enable research flows

    neuralmagic/research’s past year of commit activity
    Python 1 0 0 1 Updated Jul 1, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/vllm’s past year of commit activity
    Python 13 Apache-2.0 8,534 0 10 Updated Jul 1, 2025
  • guidellm Public

    Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

    neuralmagic/guidellm’s past year of commit activity
    Python 380 Apache-2.0 50 42 (4 issues need help) 15 Updated Jul 1, 2025
  • neuralmagic/model-validation-configs’s past year of commit activity
    0 0 0 9 Updated Jul 1, 2025
  • compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    neuralmagic/compressed-tensors’s past year of commit activity
    Python 131 Apache-2.0 15 2 25 Updated Jul 1, 2025
  • speculators Public
    neuralmagic/speculators’s past year of commit activity
    Python 5 Apache-2.0 0 19 5 Updated Jul 1, 2025
  • axolotl Public Forked from axolotl-ai-cloud/axolotl

    Go ahead and axolotl questions

    neuralmagic/axolotl’s past year of commit activity
    Python 0 Apache-2.0 1,066 0 4 Updated Jun 22, 2025
  • nm-actions Public

    Neural Magic GHA

    neuralmagic/nm-actions’s past year of commit activity
    Python 0 Apache-2.0 0 0 3 Updated Jun 20, 2025
  • LMCache Public Forked from LMCache/LMCache

    Redis for LLMs

    neuralmagic/LMCache’s past year of commit activity
    Python 1 Apache-2.0 279 0 1 Updated Jun 18, 2025
  • lmms-eval Public Forked from EvolvingLMMs-Lab/lmms-eval

    Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

    neuralmagic/lmms-eval’s past year of commit activity
    Python 0 321 0 9 Updated Jun 17, 2025