- Notifications
You must be signed in to change notification settings - Fork14.2k
Pull requests: ggml-org/llama.cpp
Author
Uh oh!
There was an error while loading.Please reload this page.
Label
Uh oh!
There was an error while loading.Please reload this page.
Projects
Uh oh!
There was an error while loading.Please reload this page.
Milestones
Uh oh!
There was an error while loading.Please reload this page.
Reviews
Assignee
Assigned to nobodyLoading
Uh oh!
There was an error while loading.Please reload this page.
Sort
Pull requests list
server: add auto-sleep after N seconds of idle examples pythonpython script changes server
#18228 openedDec 20, 2025 byngxsonLoading…
server: /v1/responses (text generation only) examples server
#18227 openedDec 20, 2025 byopeningnowLoading…
webui: use server presets as parameter placeholders examples server
#18226 openedDec 20, 2025 byServeurpersoComLoading…
ggml-metal: guard buffer map slicing Apple Metalhttps://en.wikipedia.org/wiki/Metal_(API) ggmlchanges relating to the ggml tensor library for machine learning
#18225 openedDec 20, 2025 bySzymonPrajsLoading…
webui: apply webui_settings on first load examples server
#18223 openedDec 20, 2025 byServeurpersoComLoading…
common : reorganize includes to prioritize vendored deps
#18222 openedDec 20, 2025 byaldehirLoading…
ggml-metal: fix memset range and temp buffer leaks Apple Metalhttps://en.wikipedia.org/wiki/Metal_(API) ggmlchanges relating to the ggml tensor library for machine learning
#18221 openedDec 20, 2025 bySzymonPrajsLoading…
Make sure that CMAKE will always use JSON headers under vendor directory examples server testingEverything test related
#18218 openedDec 20, 2025 byThanatosShinjiLoading…
convert: rework ftype heuristics pythonpython script changes
#18214 openedDec 20, 2025 bytaronaeoLoading…
ggml-metal: fix bf16/f16 matmul kernels Apple Metalhttps://en.wikipedia.org/wiki/Metal_(API) ggmlchanges relating to the ggml tensor library for machine learning testingEverything test related
#18210 openedDec 20, 2025 bySzymonPrajsLoading…
test-backend-ops: improve msvc build time testingEverything test related
#18209 openedDec 19, 2025 byjeffbolznvLoading…
Fix BLAS Compile Definitions ggmlchanges relating to the ggml tensor library for machine learning
#18205 openedDec 19, 2025 byDaAwesomePLoading…
HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated ggmlchanges relating to the ggml tensor library for machine learning Nvidia GPUIssues specific to Nvidia GPUs
#18202 openedDec 19, 2025 byIMbackKLoading…
llamafile: add rvv support for sgemm kernels ggmlchanges relating to the ggml tensor library for machine learning
#18199 openedDec 19, 2025 bytaimur-10xLoading…
cmake: Added more x86_64 CPU backends when building withchanges relating to the ggml tensor library for machine learning
GGML_CPU_ALL_VARIANTS=On ggmlvulkan: fix im2col overflowing maxworkgroupcount ggmlchanges relating to the ggml tensor library for machine learning testingEverything test related VulkanIssues specific to the Vulkan backend
#18180 openedDec 18, 2025 byjeffbolznvLoading…
vulkan: Warptile tuning for Intel Xe2/Xe3 ggmlchanges relating to the ggml tensor library for machine learning VulkanIssues specific to the Vulkan backend
#18178 openedDec 18, 2025 byvirajwadLoading…
tool/ex/tests: consistently free ctx, then model examples testingEverything test related
#18168 openedDec 18, 2025 byJohannesGaesslerLoading…
spm: make llama a dynamic library; leave placeholder for ggml/gguf na…
#18165 openedDec 18, 2025 bysteven-moonLoading…
ggml-hexagon: gelu optimization ggmlchanges relating to the ggml tensor library for machine learning
#18151 openedDec 17, 2025 byjoeldushouyu • Draft
ggml-cpu: fix todo comment #15953 and SIMD-like calculate 4 elems ggmlchanges relating to the ggml tensor library for machine learning
#18150 openedDec 17, 2025 byGermanAizekLoading…
ProTip! Typegi on any issue or pull request to go back to the issue listing page.