Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Vulkan: Support fp32 accumulator in quantized matmul to fix GLM4-32B incoherence ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#13607 opened May 17, 2025 by 0cc4m Draft
ggml: aarch64: Implement SVE F32 kernels for Mamba Model ggml changes relating to the ggml tensor library for machine learning
#13602 opened May 17, 2025 by vineelabhinav Loading…
ggml : add memset_tensor for rpc ggml changes relating to the ggml tensor library for machine learning
#13601 opened May 17, 2025 by gkpln3 Loading…
ggml : fix race-condition in ggml-rpc ggml changes relating to the ggml tensor library for machine learning
#13600 opened May 17, 2025 by gkpln3 Loading…
SYCL: Avoid using SYCL-Graph for unsupported nodes ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13587 opened May 16, 2025 by EwanC Loading…
CUDA: skip fully masked-out KV in FA vec kernel ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13584 opened May 16, 2025 by JohannesGaessler Loading…
server : separate the notion of position and KV tokens, remove prompt truncation breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples python python script changes server
#13576 opened May 15, 2025 by ngxson Loading…
Update python verions examples python python script changes server
#13574 opened May 15, 2025 by robbiemu Loading…
Granite Four Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#13550 opened May 14, 2025 by gabe-l-hart Draft
2 tasks
sycl : reviewing the backend documentation documentation Improvements or additions to documentation examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13544 opened May 14, 2025 by Alcpz Loading…
Fix build on OpenBSD examples
#13541 opened May 14, 2025 by percypiper Loading…
sycl: disable reorder for sycl mulmat ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13536 opened May 14, 2025 by sgeor255 Loading…
ci : upgraded oneAPI version in SYCL workflows and dockerfile devops improvements to build systems and github actions
#13532 opened May 14, 2025 by Alcpz Loading…
cuda: set cuda compiler path (#13527) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#13528 opened May 14, 2025 by lizhenneng Loading…
convert: Swap GLM4 EOS / EOT token python python script changes
#13505 opened May 13, 2025 by henk717 Loading…
[SYCL] Overcoming workaround for mmap() allocation on Windows and remove useless wait examples ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13482 opened May 12, 2025 by s-Nick Loading…
docker : enable RPC for docker images devops improvements to build systems and github actions
#13474 opened May 12, 2025 by rgerganov Draft
Support Seed-Coder chat template
#13472 opened May 12, 2025 by yeahdongcn Loading…
2 tasks done
Webui dynamic config examples server
#13429 opened May 10, 2025 by ServeurpersoCom Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.