Movatterモバイル変換

ggml-org/llama.cppPublic

NotificationsYou must be signed in to change notification settings
Fork13.9k
Star90.7k

New pull requestNew

614 Open 7,880 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

vulkan: perf_logger improvements ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17672 openedDec 2, 2025 byjeffbolznv

Loading…

Add a couple of file types to the text section examples server

#17670 openedDec 1, 2025 bypwilkin

Loading…

server: explicitly set exec path when create new instance examples server

#17669 openedDec 1, 2025 byngxson

Loading…

server: remove default "gpt-3.5-turbo" model name examples python

python script changes

server

#17668 openedDec 1, 2025 byngxson

Loading…

Add context info to server error examples server

#17663 openedDec 1, 2025 byallozaur

Loading…

vulkan: fix top_k bug when there are ties in the input ggml

changes relating to the ggml tensor library for machine learning

testing

Everything test related

Vulkan

Issues specific to the Vulkan backend

#17659 openedDec 1, 2025 byjeffbolznv

Loading…

codeowners : remove ericcurtin

#17658 openedDec 1, 2025 byericcurtin

Loading…

ggml-cpu: Add operator-level execution time profiling ggml

changes relating to the ggml tensor library for machine learning

#17657 openedDec 1, 2025 bykimminsu38oo

Loading…

ggml: use 'exists( const std::filesystem::path&, std::error_code&)' instead of 'exists( const std::filesystem::path&)' to enhance robustness ggml

changes relating to the ggml tensor library for machine learning

#17653 openedDec 1, 2025 byflyinskyin2013

Loading…

ggml: added missing cast sections in memcpy ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17651 openedDec 1, 2025 byGermanAizek

Loading…

ggml-cpu: remove duplicate conditional check 'iid' ggml

changes relating to the ggml tensor library for machine learning

#17650 openedDec 1, 2025 byGermanAizek

Loading…

gguf: llama: use= default for trivial constructors and destructors ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17649 openedDec 1, 2025 byGermanAizek

Loading…

sgemm: reuse loaded vector in AVX dot product calculation ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17648 openedDec 1, 2025 byGermanAizek

Loading…

llama-vocab: replace postfix with prefix increment for iterators vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17646 openedDec 1, 2025 byGermanAizek

Loading…

vec: optimize AVX2/FMA sum-of-squares with loop unrolling and FMA ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17642 openedDec 1, 2025 byGermanAizek

Loading…

ggml-quants: use _mm256_testz_si256 for mask checks in AVX2 ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17641 openedDec 1, 2025 byGermanAizek

Loading…

ggml-alloc: optimize free block shifting withmemmove ggml

changes relating to the ggml tensor library for machine learning

vibe-coded

Created with heavy use of LLM assistants, requires human verification

#17640 openedDec 1, 2025 byGermanAizek

Loading…

ggml-cuda: reorder only relevant nodes ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#17639 openedDec 1, 2025 byam17an

Loading…

vulkan: Replace deprecated VK_EXT_validation_features ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17637 openedDec 1, 2025 byrillomas

Loading…

common : compute average token length from vocabulary

#17632 openedDec 1, 2025 byyifant-code • Draft

llama-router, the C++ "llama-swap" for llama.cpp examples need feedback

Testing and feedback with results are needed

server testing

Everything test related

#17629 openedNov 30, 2025 byServeurpersoCom • Draft

vulkan: set all memory allocations to high priority ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17624 openedNov 30, 2025 byjeffbolznv • Draft

vulkan: Reduce temporary memory usage for TOP_K ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#17623 openedNov 30, 2025 byjeffbolznv

Loading…

model : Fix marker placement for LFM2-VL in single turn llama-mtmd-cli examples

#17616 openedNov 30, 2025 bytdakhran

Loading…

ggml : remove redundant n_copies check when setting input/output ggml

changes relating to the ggml tensor library for machine learning

#17612 openedNov 30, 2025 bydanbev

Loading…

ProTip!no:milestone will show everything without a milestone.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list