-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml: aarch64: Implement SVE F32 kernels for Mamba Model
ggml
changes relating to the ggml tensor library for machine learning
#13602
opened May 17, 2025 by
vineelabhinav
Loading…
ggml : add memset_tensor for rpc
ggml
changes relating to the ggml tensor library for machine learning
#13601
opened May 17, 2025 by
gkpln3
Loading…
ggml : fix race-condition in ggml-rpc
ggml
changes relating to the ggml tensor library for machine learning
#13600
opened May 17, 2025 by
gkpln3
Loading…
SYCL: Avoid using SYCL-Graph for unsupported nodes
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13587
opened May 16, 2025 by
EwanC
Loading…
CUDA: skip fully masked-out KV in FA vec kernel
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13584
opened May 16, 2025 by
JohannesGaessler
Loading…
server : separate the notion of position and KV tokens, remove prompt truncation
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#13576
opened May 15, 2025 by
ngxson
Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method
python
python script changes
#13561
opened May 15, 2025 by
CISC
Loading…
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
sycl : reviewing the backend documentation
documentation
Improvements or additions to documentation
examples
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13544
opened May 14, 2025 by
Alcpz
Loading…
sycl: disable reorder for sycl mulmat
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13536
opened May 14, 2025 by
sgeor255
Loading…
ci : upgraded oneAPI version in SYCL workflows and dockerfile
devops
improvements to build systems and github actions
#13532
opened May 14, 2025 by
Alcpz
Loading…
cuda: set cuda compiler path (#13527)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#13528
opened May 14, 2025 by
lizhenneng
Loading…
webui: Add editing assistant messages (#11849)
examples
server
#13522
opened May 14, 2025 by
lr1729
Loading…
convert: Swap GLM4 EOS / EOT token
python
python script changes
#13505
opened May 13, 2025 by
henk717
Loading…
llama: Add configuration presets for chat and reranking servers
#13462
opened May 12, 2025 by
heyyymonth
Loading…
Break down main function in llama-server
examples
server
#13425
opened May 10, 2025 by
ericcurtin
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.