Skip to content

Pull requests: abetlen/llama-cpp-python

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

security: fix SSRF in multimodal image URL loading (_load_image)
#2220 opened May 16, 2026 by hoangperry Loading…
5 tasks done
fix: improve error message when LlamaModel fails to load
#2187 opened Apr 21, 2026 by Anai-Guo Contributor Loading…
Add chat template for gemma models
#2183 opened Apr 13, 2026 by C00kieFact0ry Loading…
fix: prevent KV cache corruption on SWA/ISWA models + hot-path perf
#2180 opened Apr 12, 2026 by avion23 Contributor Loading…
perf: vectorize KV cache prefix matching with numpy
#2179 opened Apr 11, 2026 by nausicaalii Loading…
4 tasks done
build: disable soname to reduce binary size
#2177 opened Apr 9, 2026 by Bing-su Loading…
feat: add reasoning_effort to chat completions API
#2167 opened Mar 30, 2026 by abetlen Owner Loading…
ci: refactor cpu wheel build workflow
#2164 opened Mar 26, 2026 by Bing-su Loading…
feat: support Granite-Docling model
#2109 opened Jan 4, 2026 by dhdaines Loading…
Include x64 directory for CUDA DLLs on Windows
#2083 opened Oct 24, 2025 by ajparsons Loading…
Better Qwen2.5-VL chat template.
#2066 opened Sep 7, 2025 by alcoftTAO Contributor Loading…
Improve error message when model file is missing
#2041 opened Jul 9, 2025 by NITHIN0710 Loading…
ARM Runners support CUDA SBSA
#2039 opened Jul 7, 2025 by johnnynunez Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.