Skip to content
@ModelCloud

ModelCloud.ai

Our mission is to give allow everyone, including bots, unlimited and free access to llm/ai models.

Pinned Loading

  1. GPTQModel GPTQModel Public

    Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    Python 396 56

  2. Device-SMI Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it y…

    Python 10 1

Repositories

Showing 10 of 12 repositories
  • GPTQModel Public

    Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

    ModelCloud/GPTQModel’s past year of commit activity
    Python 396 Apache-2.0 56 26 8 Updated Mar 28, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    ModelCloud/vllm’s past year of commit activity
    Python 0 Apache-2.0 6,618 0 0 Updated Mar 27, 2025
  • LogBar Public

    A unified Logger and ProgressBar util with zero dependencies.

    ModelCloud/LogBar’s past year of commit activity
    Python 4 Apache-2.0 0 1 0 Updated Mar 26, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    ModelCloud/lm-evaluation-harness’s past year of commit activity
    Python 0 MIT 2,272 0 0 Updated Mar 20, 2025
  • rockthem Public
    ModelCloud/rockthem’s past year of commit activity
    Cuda 0 Apache-2.0 0 0 0 Updated Mar 13, 2025
  • Tokenicer Public

    A (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed.

    ModelCloud/Tokenicer’s past year of commit activity
    Python 7 Apache-2.0 2 0 1 Updated Mar 12, 2025
  • ModelCloud/platinum-benchmarks’s past year of commit activity
    Python 0 CC-BY-4.0 1 0 0 Updated Mar 6, 2025
  • peft Public Forked from huggingface/peft

    🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

    ModelCloud/peft’s past year of commit activity
    Python 0 Apache-2.0 1,824 0 0 Updated Mar 4, 2025
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    ModelCloud/sglang’s past year of commit activity
    Python 0 Apache-2.0 1,393 0 0 Updated Mar 4, 2025
  • Device-SMI Public

    Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself.

    ModelCloud/Device-SMI’s past year of commit activity
    Python 10 Apache-2.0 1 2 1 Updated Mar 1, 2025

Top languages

Loading…

Most used topics

Loading…