llama.cpp releases · July 1, 2026 · 1 min read

b9859

#developer-tool

Mirrored from llama.cpp releases for archival readability. Support the source by reading on the original site.

Like Read original ↗

opencl: allow loading precompiled binary kernels from library (#23042)

opencl: allow loading binary kernel
opencl: add libdl.h
ggml-backend-dl is in ggml, which depends backend libs, thus
ggml-opencl cannot depend on ggml-backend-dl
add libdl.h to break cyclic dep
opencl: allow loading bin kernel lib
opencl: load gemm_moe_mxfp4_f32_ns from kernel lib if available
opencl: load q8_0 gemm from kernel lib
opencl: load q4_0 moe gemm from kernel lib
opencl: load q4_1 moe gemm from kernel lib
opencl: load q4_k moe gemm from kernel lib
opencl: always declare get_adreno_bin_kernel_func_t
opencl: rephrase message
opencl: fix for rebase
opencl: update doc

macOS/iOS:

macOS Apple Silicon (arm64)
macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED
macOS Intel (x64)
iOS XCFramework

Linux:

Android:

Android arm64 (CPU)

Windows:

openEuler:

DISABLED
openEuler x86 (310p)
openEuler x86 (910b, ACL Graph)
openEuler aarch64 (310p)
openEuler aarch64 (910b, ACL Graph)

UI:

UI

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

No comments yet. Sign in and be the first to say something.

More from llama.cpp releases