Workflow runs · ggml-org/llama.cpp

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

95,466 workflow run results

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) EditorConfig Checker #22060: Pull request #12000 synchronize by gcp

February 21, 2025 23:52

19s gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:52

19s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) CI #19658: Pull request #12000 synchronize by gcp

February 21, 2025 23:52

34m 23s gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:52

34m 23s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Server #11061: Pull request #12000 synchronize by gcp

February 21, 2025 23:52

4m 38s gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:52

4m 38s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Pull Request Labeler #8400: Pull request #12000 synchronize by gcp

February 21, 2025 23:52

13s

February 21, 2025 23:52

13s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) EditorConfig Checker #22059: Pull request #12000 synchronize by gcp

February 21, 2025 23:14

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:14

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) CI #19657: Pull request #12000 synchronize by gcp

February 21, 2025 23:14

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:14

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Server #11060: Pull request #12000 synchronize by gcp

February 21, 2025 23:14

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 23:14

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Pull Request Labeler #8399: Pull request #12000 synchronize by gcp

February 21, 2025 23:14

11s

February 21, 2025 23:14

11s

vulkan: matmul dequantization improvements CI #19656: Pull request #12015 opened by netrunnereve

February 21, 2025 22:35

39m 50s netrunnereve:vulkan_mm

netrunnereve:vulkan_mm

February 21, 2025 22:35

39m 50s

vulkan: matmul dequantization improvements EditorConfig Checker #22058: Pull request #12015 opened by netrunnereve

February 21, 2025 22:35

17s netrunnereve:vulkan_mm

netrunnereve:vulkan_mm

February 21, 2025 22:35

17s

vulkan: matmul dequantization improvements Server #11059: Pull request #12015 opened by netrunnereve

February 21, 2025 22:35

7m 50s netrunnereve:vulkan_mm

netrunnereve:vulkan_mm

February 21, 2025 22:35

7m 50s

vulkan: matmul dequantization improvements Pull Request Labeler #8398: Pull request #12015 opened by netrunnereve

February 21, 2025 22:35

13s

February 21, 2025 22:35

13s

CUDA: optimize FA for GQA + large batches CI #19655: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

46m 7s JohannesGaessler:cuda-fa-mma-23

JohannesGaessler:cuda-fa-mma-23

February 21, 2025 22:19

46m 7s

CUDA: optimize FA for GQA + large batches Server #11058: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

8m 58s JohannesGaessler:cuda-fa-mma-23

JohannesGaessler:cuda-fa-mma-23

February 21, 2025 22:19

8m 58s

CUDA: optimize FA for GQA + large batches EditorConfig Checker #22057: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

19s JohannesGaessler:cuda-fa-mma-23

JohannesGaessler:cuda-fa-mma-23

February 21, 2025 22:19

19s

CUDA: optimize FA for GQA + large batches flake8 Lint #17499: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

21s JohannesGaessler:cuda-fa-mma-23

JohannesGaessler:cuda-fa-mma-23

February 21, 2025 22:19

21s

CUDA: optimize FA for GQA + large batches Python Type-Check #1882: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

1m 9s JohannesGaessler:cuda-fa-mma-23

JohannesGaessler:cuda-fa-mma-23

February 21, 2025 22:19

1m 9s

CUDA: optimize FA for GQA + large batches Pull Request Labeler #8397: Pull request #12014 opened by JohannesGaessler

February 21, 2025 22:19

17s

February 21, 2025 22:19

17s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) EditorConfig Checker #22056: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) CI #19654: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Server #11057: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Pull Request Labeler #8396: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

2m 9s

February 21, 2025 21:51

2m 9s

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) EditorConfig Checker #22055: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) CI #19653: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (#10976) Server #11056: Pull request #12000 synchronize by gcp

February 21, 2025 21:51

Action required gcp:cpy_cuda_quants

gcp:cpy_cuda_quants

February 21, 2025 21:51

Action required

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: ggml-org/llama.cpp

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong.

All workflows

All workflows

Actions

Loading...
Loading