llama.cpp releases · July 2, 2026 · 1 min read

b9860

Mirrored from llama.cpp releases for archival readability. Support the source by reading on the original site.

llama : add llama_model_ftype_name() (#25134)

llama : add llama_model_ftype_name()

Expose the model file type (quantization) name, e.g. "Q8_0" or
"Q4_K - Medium", through a new public C API. The returned pointer is
valid for the lifetime of the model and nullptr when the model is
invalid or the file type is unknown.

Signed-off-by: Adrien Gallouët angt@huggingface.co

Export enum

Signed-off-by: Adrien Gallouët angt@huggingface.co

s/llama_model_ftype_name/llama_ftype_name/

Signed-off-by: Adrien Gallouët angt@huggingface.co

Move "(guessed)" to the front in llama_ftype_name

Prepend the "(guessed)" label instead of appending it. This allows removing
the non-thread-safe static std::string, making the function allocation-free.

Signed-off-by: Adrien Gallouët angt@huggingface.co

Add LLAMA_FTYPE_PREFIX

Signed-off-by: Adrien Gallouët angt@huggingface.co

Dont check for model

Signed-off-by: Adrien Gallouët angt@huggingface.co

macOS/iOS:

macOS Apple Silicon (arm64)
macOS Apple Silicon (arm64, KleidiAI enabled) DISABLED
macOS Intel (x64)
iOS XCFramework

Linux:

Android:

Android arm64 (CPU)

Windows:

openEuler:

DISABLED
openEuler x86 (310p)
openEuler x86 (910b, ACL Graph)
openEuler aarch64 (310p)
openEuler aarch64 (910b, ACL Graph)

UI:

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from llama.cpp releases