Movatterモバイル変換

ggml-org/llama.cppPublic

NotificationsYou must be signed in to change notification settings
Fork13.9k
Star90.6k

New pull requestNew

606 Open 7,843 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

clip: fix nb calculation for qwen3-vl examples

#17594 openedNov 29, 2025 byngxson

Loading…

Feature/kimi linear support ggml

changes relating to the ggml tensor library for machine learning

model

Model specific

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#17592 openedNov 29, 2025 bycacaview

Loading…

updateLLAMA_ARG_KV_SPLIT -->LLAMA_ARG_KV_UNIFIED to match CLI argument

#17588 openedNov 29, 2025 byddh0

Loading…

Override SSM_A op for Qwen3 Next to reduce splits model

Model specific

#17587 openedNov 29, 2025 bypwilkin

Loading…

Improve Qwen3-Next Speed model

Model specific

#17585 openedNov 29, 2025 bylovedheart • Draft

Add support for CUMSUM and TRI for CUDA. ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

#17584 openedNov 28, 2025 bypwilkin

Loading…

cmake: fix macOS build with-DGGML_BACKEND_DL=ON ggml

changes relating to the ggml tensor library for machine learning

#17581 openedNov 28, 2025 bygiladgd

Loading…

Add safetensors support

#17580 openedNov 28, 2025 byericcurtin • Draft

Add PagedAttention support (experimental, CUDA only) ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17579 openedNov 28, 2025 byericcurtin

Loading…

model: LFM2-VL fixes examples ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

#17577 openedNov 28, 2025 bytdakhran

Loading…

HIP: enable WMMA-MMQ INT kernels for RDNA 3 ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17576 openedNov 28, 2025 byjiachengjason • Draft

mtmd: support dots.ocr examples python

python script changes

#17575 openedNov 28, 2025 byngxson • Draft

[SYCL] enhance argsort for UT ggml

changes relating to the ggml tensor library for machine learning

SYCL

https://en.wikipedia.org/wiki/SYCL - GPU programming language

#17573 openedNov 28, 2025 byNeoZhangJianyu

Loading…

Server: Change Invalid Schema from Server Error (500) to User Error (400) examples python

python script changes

server testing

Everything test related

#17572 openedNov 28, 2025 bychadvoegele

Loading…

ggml-hexagon: fixrope failure attest-backend-ops ggml

changes relating to the ggml tensor library for machine learning

#17565 openedNov 28, 2025 bychraac

Loading…

CANN: The Ger operator of OUT_PROD is not supported on the 310p device Ascend NPU

issues specific to Ascend NPUs

ggml

changes relating to the ggml tensor library for machine learning

#17563 openedNov 28, 2025 byTianHao324

Loading…

Fix unreadable user markdown colors and truncate long texts in deletion dialogs examples server

#17555 openedNov 27, 2025 byServeurpersoCom

Loading…

New llama-run examples server

#17554 openedNov 27, 2025 byericcurtin

Loading…

cmake : add option to build and link LibreSSL

#17552 openedNov 27, 2025 byangt

Loading…

ggml-cpu: Add operator-level execution time profiling ggml

changes relating to the ggml tensor library for machine learning

#17544 openedNov 27, 2025 bykimminsu38oo

Loading…

CANN: add support for partial RoPE and Vision mode Ascend NPU

issues specific to Ascend NPUs

ggml

changes relating to the ggml tensor library for machine learning

#17543 openedNov 27, 2025 bynoemotiovon

Loading…

vulkan: Fix mismatch in TOPK_MOE unit test ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17541 openedNov 27, 2025 byrillomas • Draft

server: explicitly set the function name in lambda examples server

#17538 openedNov 27, 2025 byhaiyuewa

Loading…

llama.cpp with sentencepiece testing

Everything test related

#17529 openedNov 26, 2025 byawenzel67

Loading…

ggml-cpu: BMI2 is only available on amd64 ggml

changes relating to the ggml tensor library for machine learning

#17528 openedNov 26, 2025 bycandrews

Loading…

ProTip! What’s not been updated in a month:updated:<2025-10-29.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list