Commit a070f61
Ralf Waldukat
Update to llama.cpp 2026-01-01
- Update llama.cpp submodule (2025-08-14 → 2026-01-01)
- Remove deprecated KV cache functions (use llama_memory_* instead)
- Remove llama_sampler_init_softmax (deprecated)
- Add LLAMA_ROPE_TYPE_IMROPE constant
- Add llama_flash_attn_type enum (AUTO/DISABLED/ENABLED)
- Add llama_params_fit_status enum
- Add llama_model_meta_key enum for sampling metadata
- Add llama_model_params fields: no_host, no_alloc
- Replace llama_context_params.flash_attn bool with flash_attn_type enum
- Add 15 new API functions:
- llama_max_tensor_buft_overrides
- llama_n_ctx_seq
- llama_model_n_embd_inp
- llama_model_is_hybrid
- llama_flash_attn_type_name
- llama_model_meta_key_str
- llama_adapter_meta_* functions (5)
- llama_log_get/set
- llama_memory_breakdown_print
- Add ggml_log_callback typedef
- Disable LLAVA build (CMake incompatibility in upstream mtmd)
- Bump version 0.3.16 → 0.4.0
Breaking changes:
- flash_attn bool removed, use flash_attn_type enum
- KV cache functions removed, use llama_memory_* API
Tested with Nemotron-3-Nano-30B hybrid model.1 parent c37132b commit a070f61
File tree
6 files changed
+248
-315
lines changed- llama_cpp
- vendor
6 files changed
+248
-315
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
153 | 153 | | |
154 | 154 | | |
155 | 155 | | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
156 | 162 | | |
157 | 163 | | |
158 | 164 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
4 | | - | |
| 4 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
341 | 341 | | |
342 | 342 | | |
343 | 343 | | |
344 | | - | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
345 | 349 | | |
346 | 350 | | |
347 | 351 | | |
| |||
934 | 938 | | |
935 | 939 | | |
936 | 940 | | |
937 | | - | |
| 941 | + | |
| 942 | + | |
938 | 943 | | |
939 | 944 | | |
940 | 945 | | |
| |||
1041 | 1046 | | |
1042 | 1047 | | |
1043 | 1048 | | |
1044 | | - | |
| 1049 | + | |
| 1050 | + | |
| 1051 | + | |
1045 | 1052 | | |
1046 | 1053 | | |
1047 | 1054 | | |
| |||
1112 | 1119 | | |
1113 | 1120 | | |
1114 | 1121 | | |
1115 | | - | |
| 1122 | + | |
| 1123 | + | |
| 1124 | + | |
1116 | 1125 | | |
1117 | 1126 | | |
1118 | 1127 | | |
| |||
1157 | 1166 | | |
1158 | 1167 | | |
1159 | 1168 | | |
1160 | | - | |
1161 | | - | |
1162 | | - | |
| 1169 | + | |
| 1170 | + | |
| 1171 | + | |
1163 | 1172 | | |
1164 | 1173 | | |
1165 | 1174 | | |
| |||
1315 | 1324 | | |
1316 | 1325 | | |
1317 | 1326 | | |
1318 | | - | |
| 1327 | + | |
1319 | 1328 | | |
1320 | 1329 | | |
1321 | 1330 | | |
| |||
2056 | 2065 | | |
2057 | 2066 | | |
2058 | 2067 | | |
2059 | | - | |
| 2068 | + | |
| 2069 | + | |
| 2070 | + | |
| 2071 | + | |
2060 | 2072 | | |
2061 | 2073 | | |
2062 | 2074 | | |
| |||
2096 | 2108 | | |
2097 | 2109 | | |
2098 | 2110 | | |
2099 | | - | |
| 2111 | + | |
| 2112 | + | |
| 2113 | + | |
| 2114 | + | |
2100 | 2115 | | |
2101 | 2116 | | |
2102 | 2117 | | |
| |||
2318 | 2333 | | |
2319 | 2334 | | |
2320 | 2335 | | |
2321 | | - | |
| 2336 | + | |
| 2337 | + | |
| 2338 | + | |
| 2339 | + | |
| 2340 | + | |
2322 | 2341 | | |
2323 | 2342 | | |
2324 | 2343 | | |
| |||
0 commit comments