Movatterモバイル変換

ggml-org/llama.cppPublic

NotificationsYou must be signed in to change notification settings
Fork12.4k
Star83.2k

New pull requestNew

479 Open 6,307 Closed

Author

Label

Projects

Milestones

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Sort

feat: Add extended sampling API with candidate token lists #14612

#14765 openedJul 19, 2025 bybaonudesifeizhai

Loading…

webui: add missing messages in export (#13552) examples server

#14764 openedJul 18, 2025 bysrogmann

Loading…

cuda : implement bf16 cpy ops and enable bf16 cont ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14763 openedJul 18, 2025 byCISC

Loading…

tests : add non-cont K,V FA tests testing

Everything test related

#14756 openedJul 18, 2025 byggerganov

Loading…

Fix MinicpmV model converter and clip to avoid using hardcode. examples python

python script changes

#14750 openedJul 18, 2025 bygryffindor-rr

Loading…

[ROCm] Fix HIP version check for HIPBLAS V2 API compatibility ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14744 openedJul 17, 2025 bydanielholanda

Loading…

metal: SSM_SCAN performance Apple Metal

https://en.wikipedia.org/wiki/Metal_(API)

ggml

changes relating to the ggml tensor library for machine learning

#14743 openedJul 17, 2025 bygabe-l-hart

Loading…

examples : predicted output for text generation examples

#14739 openedJul 17, 2025 byiamlemec

Loading…

Improve Mistral models integration with llama.cpp python

python script changes

#14737 openedJul 17, 2025 byjuliendenize • Draft

Documentation: Update build.md's Vulkan section documentation

Improvements or additions to documentation

#14736 openedJul 17, 2025 byrspOverflow

Loading…

CUDA: skip masked out KQ slices in mma FA kernel ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

python

python script changes

#14735 openedJul 17, 2025 byJohannesGaessler

Loading…

feat: Add optional prompt processing progress streaming examples server

#14731 openedJul 17, 2025 bybaonudesifeizhai

Loading…

mtmd : Support jinja in libmtmd (Only for QwenVL and Qwen Omni) examples

#14730 openedJul 17, 2025 byalielmorsy

Loading…

server: add prompt processing progress streaming for /completion endpoint #14685 examples server

#14728 openedJul 16, 2025 bybaonudesifeizhai

Loading…

vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274) ggml

changes relating to the ggml tensor library for machine learning

Vulkan

Issues specific to the Vulkan backend

#14707 openedJul 16, 2025 byPeter0x44

Loading…

Fix KleidiAI compilation errors with -DGGML_NATIVE=OFF (issue #14464) ggml

changes relating to the ggml tensor library for machine learning

#14700 openedJul 15, 2025 bybaonudesifeizhai

Loading…

Adding a simple-function-call example - hopefully not doing anything wrong examples

#14682 openedJul 14, 2025 byklogdotwebsitenotdotcom

Loading…

kleidiai: add support for get_rows ggml

changes relating to the ggml tensor library for machine learning

#14676 openedJul 14, 2025 bychaxu01

Loading…

bug fix: handle saving/loading null layers in recurrent memory

#14675 openedJul 14, 2025 byl3utterfly

Loading…

Add Pad Reflect 1D CUDA support ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14659 openedJul 13, 2025 byYavorGIvanov

Loading…

webui : add a preset feature to the settings examples server

#14649 openedJul 12, 2025 bygabriellarson

Loading…

Add CUDA non-contiguous Unary Ops support build

Compilation issues

documentation

Improvements or additions to documentation

ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

testing

Everything test related

#14639 openedJul 11, 2025 byYavorGIvanov

Loading…

common: add config presets for falcon

#14638 openedJul 11, 2025 by0xs1d

Loading…

OpenCL: addmul_mat_f16_f32_image kernel ggml

changes relating to the ggml tensor library for machine learning

OpenCL

Issues specific to the OpenCL backend

#14635 openedJul 11, 2025 byrmatif

Loading…

HIP: Enable Matrix cores for MMQ Kernels, Enable stream-K for CDNA 3 devops

improvements to build systems and github actions

ggml

changes relating to the ggml tensor library for machine learning

Nvidia GPU

Issues specific to Nvidia GPUs

#14624 openedJul 10, 2025 bydeepsek

Loading…

ProTip! What’s not been updated in a month:updated:<2025-06-18.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Movatterモバイル変換

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list