Pull requests · ggml-org/llama.cpp

New pull request New

1,054 Open 11,144 Closed

examples python

#24163 opened Jun 5, 2026 by tc-mb Contributor

Loading…

ggml model python

#24162 opened Jun 5, 2026 by am17an Contributor • Draft

ggml OpenCL

#24160 opened Jun 5, 2026 by lhez Contributor • Draft

examples python server

#24154 opened Jun 5, 2026 by Anuj-Attri

Loading…

ggml SYCL

#24152 opened Jun 5, 2026 by Spruill-1

Loading…

examples server

#24150 opened Jun 4, 2026 by ngxson Contributor

Loading…

examples server/ui

#24149 opened Jun 4, 2026 by ngxson Contributor

Loading…

examples python server

#24143 opened Jun 4, 2026 by alainnothere

Loading…

examples

#24133 opened Jun 4, 2026 by pcuenca Contributor

Loading…

ggml Nvidia GPU

#24129 opened Jun 4, 2026 by harkgill-amd

Loading…

ggml Nvidia GPU

#24127 opened Jun 4, 2026 by JohannesGaessler Contributor

Loading…

vulkan: add v_dot2_f32_f16 support in matrix-matrix multiplication and Flash Attention ggml Vulkan

#24123 opened Jun 4, 2026 by 0cc4m Contributor

Loading…

examples ggml python

#24122 opened Jun 4, 2026 by Donovoi • Draft

devops examples python server

#24114 opened Jun 4, 2026 by dacorvo

Loading…

examples python server

#24113 opened Jun 4, 2026 by dacorvo

Loading…

ggml

#24094 opened Jun 3, 2026 by banksio

Loading…

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list