Releases · foldl/chatllm.cpp

Release list

v2026.05 Latest

Latest

foldl released this 27 May 06:55

v2026.05

1dd24bc

New models

Gemma-4 (known issue #128)
InternVL3.5

Other updates

Video input implemented for all models that support it.
q4_k quantization works now.
A modified version of WebUI from llama.cpp is used.

Assets 3

v0.22

foldl released this 27 Mar 10:34

v0.22

6f1db57

New Models

Qianfan-OCR
Penguin-VL

Bug fixing

A nearly one-year old bug (Issue #65)

Assets 3

v0.21

foldl released this 10 Mar 10:45

v0.21

bf2ddae

New models:

Qwen3.5
GLM-OCR
Youtu-VL
Youtu-LLM
Qwen3-TTS (voice clone only support xvec)

Assets 3

v0.20

foldl released this 10 Feb 02:03

v0.20

70045db

Support new models:

QWen3-ASR
QWen3-ForcedAligner

Assets 3

v0.19

foldl released this 22 Jan 09:45

v0.19

f85eb5b

As always, more models are supported. Note that most of the new models are special in some way.
- Step3-VL: strong vision capability
- GLM-4.7-Flash: strong coding capability
- TranslateGemma: translation
- WeDLM: diffusion with AR
- QWen3-VL-Embedding/Reranker: multimodal embedding
- HY-MT: translation
- GLM-ASR-Nano: ASR
- Qwen3-VL: strong vision capability

Assets 3

v0.18

foldl released this 27 Dec 02:02

v0.18

58782ae

As always, more models are supported.
Windows: prebuilt binary with Vulkan (1.4.335.0). Use -ngl all to run whole model on default GPU.
New server.exe with built-in llama.cpp WebUI

Assets 3

v0.17

foldl released this 27 Oct 02:04

v0.17

cf94e0b

As always, more models are supported, notably LLaDA2.0.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

v0.16

foldl released this 13 Oct 11:15

v0.16

84f755f

As always, more models are supported, notably Janus-Pro.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

v0.15

foldl released this 07 Sep 10:47

v0.15

febc457

As always, more models are supported.
Windows: prebuilt binary with Vulkan (1.4.321.1). Use -ngl all to run whole model on default GPU.

Assets 3

v0.14

foldl released this 18 Aug 08:32

v0.14

d2464d6

Fix main_nim.exe: could not download models that are > 2GB due to this.

Assets 3

Releases: foldl/chatllm.cpp

Release list

v2026.05

New models

Other updates

Uh oh!

v0.22

New Models

Bug fixing

Uh oh!

v0.21

Uh oh!

v0.20

Uh oh!

v0.19

Uh oh!

v0.18

Uh oh!

v0.17

Uh oh!

v0.16

Uh oh!

v0.15

Uh oh!

v0.14

Uh oh!