Movatterモバイル変換

ggml-org/llama.cppPublic

NotificationsYou must be signed in to change notification settings
Fork14.2k
Star91.6k

New pull requestNew

619 Open 8,202 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

server: add auto-sleep after N seconds of idle examples python

python script changes

server

#18228 openedDec 20, 2025 byngxson

Loading…

server: /v1/responses (text generation only) examples server

#18227 openedDec 20, 2025 byopeningnow

Loading…

webui: use server presets as parameter placeholders examples server

#18226 openedDec 20, 2025 byServeurpersoCom

Loading…

ggml-metal: guard buffer map slicing Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#18225 openedDec 20, 2025 bySzymonPrajs

Loading…

webui: apply webui_settings on first load examples server

#18223 openedDec 20, 2025 byServeurpersoCom

Loading…

common : reorganize includes to prioritize vendored deps

#18222 openedDec 20, 2025 byaldehir

Loading…

ggml-metal: fix memset range and temp buffer leaks Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#18221 openedDec 20, 2025 bySzymonPrajs

Loading…

model: support nvidia/llama-embed-nemotron model

Model specific

python

python script changes

#18220 openedDec 20, 2025 bysfallah • Draft

Make sure that CMAKE will always use JSON headers under vendor directory examples server testing

Everything test related

#18218 openedDec 20, 2025 byThanatosShinji

Loading…

convert: rework ftype heuristics python

python script changes

#18214 openedDec 20, 2025 bytaronaeo

Loading…

ggml-metal: fix bf16/f16 matmul kernels Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

#18210 openedDec 20, 2025 bySzymonPrajs

Loading…

test-backend-ops: improve msvc build time testing

Everything test related

#18209 openedDec 19, 2025 byjeffbolznv

Loading…

Fix BLAS Compile Definitions ggml

changes relating to the ggml tensor library for machine learning

#18205 openedDec 19, 2025 byDaAwesomeP

Loading…

HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#18202 openedDec 19, 2025 byIMbackK

Loading…

CLI: implemented non interactive mode examples

#18200 openedDec 19, 2025 byandrew-aladev • Draft

llamafile: add rvv support for sgemm kernels ggml

changes relating to the ggml tensor library for machine learning

#18199 openedDec 19, 2025 bytaimur-10x

Loading…

cmake: Added more x86_64 CPU backends when building withGGML_CPU_ALL_VARIANTS=On ggml

changes relating to the ggml tensor library for machine learning

#18186 openedDec 18, 2025 bybberberov • Draft

vulkan: fix im2col overflowing maxworkgroupcount ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#18180 openedDec 18, 2025 byjeffbolznv

Loading…

vulkan: Warptile tuning for Intel Xe2/Xe3 ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18178 openedDec 18, 2025 byvirajwad

Loading…

tool/ex/tests: consistently free ctx, then model examples testing

Everything test related

#18168 openedDec 18, 2025 byJohannesGaessler

Loading…

Adding --direct-io flag for model loading examples

#18166 openedDec 18, 2025 byJTischbein

Loading…

spm: make llama a dynamic library; leave placeholder for ggml/gguf na…

#18165 openedDec 18, 2025 bysteven-moon

Loading…

ggml-hexagon: gelu optimization ggml

changes relating to the ggml tensor library for machine learning

#18151 openedDec 17, 2025 byjoeldushouyu • Draft

ggml-cpu: fix todo comment #15953 and SIMD-like calculate 4 elems ggml

changes relating to the ggml tensor library for machine learning

#18150 openedDec 17, 2025 byGermanAizek

Loading…

[WIP] Reduce the number of fa rows for Intel ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#18138 openedDec 17, 2025 bymmerecki • Draft

ProTip! Typegi on any issue or pull request to go back to the issue listing page.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list