Movatterモバイル変換

ggml-org/llama.cppPublic

NotificationsYou must be signed in to change notification settings
Fork14.2k
Star91.6k

New pull requestNew

614 Open 8,193 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

ci : only save ccache on master devops

improvements to build systems and github actions

#18207 openedDec 19, 2025 byCISC

Loading…

server: support autoload model, support preset-only options examples server testing

Everything test related

#18206 openedDec 19, 2025 byngxson

Loading…

Fix BLAS Compile Definitions ggml

changes relating to the ggml tensor library for machine learning

#18205 openedDec 19, 2025 byDaAwesomeP

Loading…

HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#18202 openedDec 19, 2025 byIMbackK

Loading…

ci : remove non-windows zip artifacts devops

improvements to build systems and github actions

#18201 openedDec 19, 2025 byCISC

Loading…

CLI: implemented non interactive mode examples

#18200 openedDec 19, 2025 byandrew-aladev • Draft

llamafile: add rvv support for sgemm kernels ggml

changes relating to the ggml tensor library for machine learning

#18199 openedDec 19, 2025 bytaimur-10x

Loading…

cmake: Added more x86_64 CPU backends when building withGGML_CPU_ALL_VARIANTS=On ggml

changes relating to the ggml tensor library for machine learning

#18186 openedDec 18, 2025 bybberberov • Draft

vulkan: fix im2col overflowing maxworkgroupcount ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18180 openedDec 18, 2025 byjeffbolznv

Loading…

vulkan: Warptile tuning for Intel Xe2/Xe3 ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18178 openedDec 18, 2025 byvirajwad

Loading…

tool/ex/tests: consistently free ctx, then model examples testing

Everything test related

#18168 openedDec 18, 2025 byJohannesGaessler

Loading…

Adding --direct-io flag for model loading examples

#18166 openedDec 18, 2025 byJTischbein

Loading…

spm: make llama a dynamic library; leave placeholder for ggml/gguf na…

#18165 openedDec 18, 2025 bysteven-moon

Loading…

ggml-hexagon: gelu optimization ggml

changes relating to the ggml tensor library for machine learning

#18151 openedDec 17, 2025 byjoeldushouyu • Draft

ggml-cpu: fix todo comment #15953 and SIMD-like calculate 4 elems ggml

changes relating to the ggml tensor library for machine learning

#18150 openedDec 17, 2025 byGermanAizek

Loading…

[WIP] Reduce the number of fa rows for Intel ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18138 openedDec 17, 2025 bymmerecki • Draft

[WIP] Enable cooperative matrix support for Intel Arrow Lake H GPUs ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18126 openedDec 17, 2025 bymmerecki • Draft

server: validate n_batch == n_ubatch for embeddings (#6263) examples server

#18123 openedDec 17, 2025 byyifant-code • Draft

[WIP] Cross Entropy Loss on Metal Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#18121 openedDec 17, 2025 byiliailmer • Draft

ggml-hexagon: Add lightweight atomic synchronization support to htp_ops_context for inter-task coordination ggml

changes relating to the ggml tensor library for machine learning

#18113 openedDec 16, 2025 byngdxzy

Loading…

ggml-cuda: Delta-Net linear attention for Qwen3-Next ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

#18102 openedDec 16, 2025 byhauhaut

Loading…

vulkan/cuda: fix topk_moe with exp_probs_b ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18071 openedDec 15, 2025 byjeffbolznv

Loading…

webui: add responsive chat width option to webui (#18067) examples server

#18068 openedDec 15, 2025 byImadSaddik

Loading…

vulkan: support GGML_UNARY_OP_XIELU ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18062 openedDec 15, 2025 byjeffbolznv

Loading…

vulkan: in graph_optimize, try to group ADD operations ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18060 openedDec 15, 2025 byjeffbolznv

Loading…

ProTip! Exclude everything labeledbug with-label:bug.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list