Skip to content

Actions: ggml-org/llama.cpp

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
95,466 workflow run results
95,466 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22060: Pull request #12000 synchronize by gcp
February 21, 2025 23:52 19s gcp:cpy_cuda_quants
February 21, 2025 23:52 19s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19658: Pull request #12000 synchronize by gcp
February 21, 2025 23:52 34m 23s gcp:cpy_cuda_quants
February 21, 2025 23:52 34m 23s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11061: Pull request #12000 synchronize by gcp
February 21, 2025 23:52 4m 38s gcp:cpy_cuda_quants
February 21, 2025 23:52 4m 38s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8400: Pull request #12000 synchronize by gcp
February 21, 2025 23:52 13s
February 21, 2025 23:52 13s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22059: Pull request #12000 synchronize by gcp
February 21, 2025 23:14 Action required gcp:cpy_cuda_quants
February 21, 2025 23:14 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19657: Pull request #12000 synchronize by gcp
February 21, 2025 23:14 Action required gcp:cpy_cuda_quants
February 21, 2025 23:14 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11060: Pull request #12000 synchronize by gcp
February 21, 2025 23:14 Action required gcp:cpy_cuda_quants
February 21, 2025 23:14 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8399: Pull request #12000 synchronize by gcp
February 21, 2025 23:14 11s
February 21, 2025 23:14 11s
vulkan: matmul dequantization improvements
CI #19656: Pull request #12015 opened by netrunnereve
February 21, 2025 22:35 39m 50s netrunnereve:vulkan_mm
February 21, 2025 22:35 39m 50s
vulkan: matmul dequantization improvements
EditorConfig Checker #22058: Pull request #12015 opened by netrunnereve
February 21, 2025 22:35 17s netrunnereve:vulkan_mm
February 21, 2025 22:35 17s
vulkan: matmul dequantization improvements
Server #11059: Pull request #12015 opened by netrunnereve
February 21, 2025 22:35 7m 50s netrunnereve:vulkan_mm
February 21, 2025 22:35 7m 50s
vulkan: matmul dequantization improvements
Pull Request Labeler #8398: Pull request #12015 opened by netrunnereve
February 21, 2025 22:35 13s
February 21, 2025 22:35 13s
CUDA: optimize FA for GQA + large batches
EditorConfig Checker #22057: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 19s JohannesGaessler:cuda-fa-mma-23
February 21, 2025 22:19 19s
CUDA: optimize FA for GQA + large batches
Python Type-Check #1882: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 1m 9s JohannesGaessler:cuda-fa-mma-23
February 21, 2025 22:19 1m 9s
CUDA: optimize FA for GQA + large batches
Pull Request Labeler #8397: Pull request #12014 opened by JohannesGaessler
February 21, 2025 22:19 17s
February 21, 2025 22:19 17s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22056: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19654: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11057: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Pull Request Labeler #8396: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 2m 9s
February 21, 2025 21:51 2m 9s
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
EditorConfig Checker #22055: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
CI #19653: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976)
Server #11056: Pull request #12000 synchronize by gcp
February 21, 2025 21:51 Action required gcp:cpy_cuda_quants
February 21, 2025 21:51 Action required