deps: bump llama.cpp correctness backports by davide221 · Pull Request #453 · Luce-Org/lucebox-hub

davide221 · 2026-06-25T19:15:37Z

Follow-up hub bump for Luce-Org/llama.cpp-dflash-ggml#21.

This updates server/deps/llama.cpp from the RDNA3.5 MMQ override merge to the
current luce-dflash head:

old: 6fd3d84e2168476d5e199a2fa1221d82ba883c21
new: 30c9d7dac8fee3c6c4cb1dddf7b97c0355cab00f

Included llama.cpp changes:

CUDA/HIP flash-attention MMA mask offset overflow fix.
CUDA GGML_OP_REPEAT support check restricted to implemented F32/F16 paths,
avoiding runtime asserts for unsupported types.

This PR intentionally only changes the submodule pointer.

cubic-dev-ai

No issues found across 1 file

_{Re-trigger cubic}

deps: bump llama.cpp correctness backports

065231b

cubic-dev-ai Bot reviewed Jun 25, 2026

View reviewed changes

davide221 merged commit 9610083 into main Jun 25, 2026
5 checks passed

davide221 deleted the deps/llama-cuda-correctness-backports branch June 26, 2026 11:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

deps: bump llama.cpp correctness backports#453

deps: bump llama.cpp correctness backports#453
davide221 merged 1 commit into
mainfrom
deps/llama-cuda-correctness-backports

davide221 commented Jun 25, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

davide221 commented Jun 25, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

davide221 commented Jun 25, 2026 •

edited by cubic-dev-ai Bot

Loading