Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[ROCm] Fix HIP version check for HIPBLAS V2 API compatibility ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14744 opened Jul 17, 2025 by danielholanda Loading…
metail: SSM_SCAN performance Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#14743 opened Jul 17, 2025 by gabe-l-hart Loading…
kv-cache : fix k-shift for multiple streams
#14742 opened Jul 17, 2025 by ggerganov Loading…
Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14741 opened Jul 17, 2025 by ORippler Loading…
Improve Mistral models integration with llama.cpp python python script changes
#14737 opened Jul 17, 2025 by juliendenize Draft
Documentation: Update build.md's Vulkan section documentation Improvements or additions to documentation
#14736 opened Jul 17, 2025 by rspOverflow Loading…
CUDA: skip masked out KQ slices in mma FA kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#14735 opened Jul 17, 2025 by JohannesGaessler Loading…
use max work group size for device to replace the magic number ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14732 opened Jul 17, 2025 by NeoZhangJianyu Loading…
nix: use optionalAttrs for env mkDerivation attrset argument devops improvements to build systems and github actions nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#14726 opened Jul 16, 2025 by amozeo Loading…
CUDA: set_rows + cpy.cu refactor ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14712 opened Jul 16, 2025 by am17an Loading…
vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274) ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14707 opened Jul 16, 2025 by Peter0x44 Loading…
Fix KleidiAI compilation errors with -DGGML_NATIVE=OFF (issue #14464) ggml changes relating to the ggml tensor library for machine learning
#14700 opened Jul 15, 2025 by baonudesifeizhai Loading…
kleidiai: add support for get_rows ggml changes relating to the ggml tensor library for machine learning
#14676 opened Jul 14, 2025 by chaxu01 Loading…
Add Pad Reflect 1D CUDA support ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14659 opened Jul 13, 2025 by YavorGIvanov Loading…
Model: Add support for Ernie 4.5 MoE python python script changes
#14658 opened Jul 13, 2025 by pwilkin Loading…
Add CUDA non-contiguous Unary Ops support build Compilation issues documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14639 opened Jul 11, 2025 by YavorGIvanov Loading…
common: add config presets for falcon
#14638 opened Jul 11, 2025 by 0xs1d Loading…
OpenCL: add mul_mat_f16_f32_image kernel ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14635 opened Jul 11, 2025 by rmatif Loading…
ProTip! Exclude everything labeled bug with -label:bug.