| route-pattern | /_view_fragments/voltron/pull_requests/show/:user_id/:repository/:id/pull_request_layout(.:format) |
| route-controller | voltron_pull_requests_fragments |
| route-action | pull_request_layout |
| fetch-nonce | v2:16e395c0-95b6-fa87-d602-5f4e825cc044 |
| current-catalog-service-hash | ae870bc5e265a340912cde392f23dad3671a0a881730ffdadd82f2f57d81641b |
| request-id | D478:E7822:260D13B:30D1693:69748E70 |
| html-safe-nonce | 185606db877f835c174c8a7f18f118c4bc040312c06b78642ef708829565250e |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJENDc4OkU3ODIyOjI2MEQxM0I6MzBEMTY5Mzo2OTc0OEU3MCIsInZpc2l0b3JfaWQiOiI4NjM0MjU0OTM2ODIyMjg3OTg0IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | a023971c052b2e22b58e4788a3bfa486b511dd40985596b39baedec877596d48 |
| hovercard-subject-tag | pull_request:3140985140 |
| github-keyboard-shortcuts | repository,pull-request-list,pull-request-conversation,pull-request-files-changed,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | ///voltron/pull_requests_fragments/pull_request_layout |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/voltron/pull_requests/show/abetlen/llama-cpp-python/2108/pull_request_layout |
| twitter:image | https://opengraph.githubassets.com/65ad375daff450db0cd223394c431231015a029cccfb995ac0f16352d1cf0951/abetlen/llama-cpp-python/pull/2108 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/65ad375daff450db0cd223394c431231015a029cccfb995ac0f16352d1cf0951/abetlen/llama-cpp-python/pull/2108 |
| og:image:alt | Bindings were 5 months outdated, preventing newer model architectures from loading.
Updates bindings to llama.cpp commit be47fb92 (2026-01-01).
Removed
14 llama_kv_self_* functions (use llama_memo... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | avion23 |
| hostname | github.com |
| expected-hostname | github.com |
| None | 4a4bf5f4e28041a9d2e5c107d7d20b78b4294ba261cab243b28167c16a623a1f |
| turbo-cache-control | no-preview |
| go-import | github.com/abetlen/llama-cpp-python git https://github.com/abetlen/llama-cpp-python.git |
| octolytics-dimension-user_id | 6826477 |
| octolytics-dimension-user_login | abetlen |
| octolytics-dimension-repository_id | 617868717 |
| octolytics-dimension-repository_nwo | abetlen/llama-cpp-python |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 617868717 |
| octolytics-dimension-repository_network_root_nwo | abetlen/llama-cpp-python |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 488b30e96dfd057fbbe44c6665ccbc030b729dde |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
| Skip to content | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#start-of-content |
|
| https://patch-diff.githubusercontent.com/ |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fabetlen%2Fllama-cpp-python%2Fpull%2F2108 |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fabetlen%2Fllama-cpp-python%2Fpull%2F2108 |
|
Sign up
| https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fpull_requests_fragments%2Fpull_request_layout&source=header-repo&source_repo=abetlen%2Fllama-cpp-python |
| Reload | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Reload | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Reload | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
|
abetlen
| https://patch-diff.githubusercontent.com/abetlen |
| llama-cpp-python | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2Fabetlen%2Fllama-cpp-python |
|
Fork
1.3k
| https://patch-diff.githubusercontent.com/login?return_to=%2Fabetlen%2Fllama-cpp-python |
|
Star
9.9k
| https://patch-diff.githubusercontent.com/login?return_to=%2Fabetlen%2Fllama-cpp-python |
|
Code
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python |
|
Issues
591
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/issues |
|
Pull requests
95
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pulls |
|
Discussions
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/discussions |
|
Actions
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/actions |
|
Projects
0
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/projects |
|
Security
1
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/security |
|
Insights
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pulse |
|
Code
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python |
|
Issues
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/issues |
|
Pull requests
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pulls |
|
Discussions
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/discussions |
|
Actions
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/actions |
|
Projects
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/projects |
|
Security
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/security |
|
Insights
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pulse |
| Sign up for GitHub
| https://patch-diff.githubusercontent.com/signup?return_to=%2Fabetlen%2Fllama-cpp-python%2Fissues%2Fnew%2Fchoose |
| terms of service | https://docs.github.com/terms |
| privacy statement | https://docs.github.com/privacy |
| Sign in | https://patch-diff.githubusercontent.com/login?return_to=%2Fabetlen%2Fllama-cpp-python%2Fissues%2Fnew%2Fchoose |
| Jump to bottom | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issue-comment-box |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| abetlen:main | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/tree/main |
| avion23:update-llama-cpp-2026-01 | https://patch-diff.githubusercontent.com/avion23/llama-cpp-python/tree/update-llama-cpp-2026-01 |
|
Update to llama.cpp 2026-01-01
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#top |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| abetlen:main | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/tree/main |
| avion23:update-llama-cpp-2026-01 | https://patch-diff.githubusercontent.com/avion23/llama-cpp-python/tree/update-llama-cpp-2026-01 |
|
Conversation
26
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
|
Commits
2
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/commits |
|
Checks
0
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/checks |
|
Files changed
9
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/files |
| https://github.co/hiddenchars |
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/{{ revealButtonHref }} |
|
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 1, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issue-3774978292 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 1, 2026 19:40 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21820692484 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/502532a510c6a039a88f1d9514f73ecfe6d0b497..23c10e80b657f2395e510674b5564308032089d3 |
| 502532a | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/502532a510c6a039a88f1d9514f73ecfe6d0b497 |
| 23c10e8 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/23c10e80b657f2395e510674b5564308032089d3 |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/502532a510c6a039a88f1d9514f73ecfe6d0b497..23c10e80b657f2395e510674b5564308032089d3 |
| January 1, 2026 19:50 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21820745228 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 1, 2026 19:52 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21820754374 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 1, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3704063057 |
| https://github.com/avion23/llama-cpp-python.git@update-llama-cpp-2026-01 | https://github.com/avion23/llama-cpp-python.git@update-llama-cpp-2026-01 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3707592463 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3707602712 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3707688779 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
|
Jan 4, 2026
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#ref-pullrequest-3778855848 |
|
feat: support Granite-Docling model
#2109
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2109 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/23c10e80b657f2395e510674b5564308032089d3..a070f613e1622c1ca2e1ee3af97267e700b46a4e |
| 23c10e8 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/23c10e80b657f2395e510674b5564308032089d3 |
| a070f61 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/a070f613e1622c1ca2e1ee3af97267e700b46a4e |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/23c10e80b657f2395e510674b5564308032089d3..a070f613e1622c1ca2e1ee3af97267e700b46a4e |
| January 4, 2026 12:05 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21841063731 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
|
Jan 4, 2026
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#pullrequestreview-3624994808 |
|
View reviewed changes
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/files |
| llama_cpp/llama_cpp.py | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/files#diff-9184e090a770a03ec97535fbef520d03252b635dafbed7fa99e59a5cca569fbc |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#discussion_r2659648819 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#discussion_r2660020036 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 5, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#discussion_r2660995558 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 4, 2026 13:41 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21841452107 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/504229647d78f05991be990a15ef50bc156b05cb..d14a24f7ec5edbc2a24ab65ca89583b1b3763ef2 |
| 5042296 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/504229647d78f05991be990a15ef50bc156b05cb |
| d14a24f | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/d14a24f7ec5edbc2a24ab65ca89583b1b3763ef2 |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/504229647d78f05991be990a15ef50bc156b05cb..d14a24f7ec5edbc2a24ab65ca89583b1b3763ef2 |
| January 4, 2026 13:42 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21841455871 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 4, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3708098993 |
| @dhdaines | https://github.com/dhdaines |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/64b087c265002f75920f24eb11487a0932bf28e1..3ffec023c59661bcd62c53334a42dc8f165c9d97 |
| 64b087c | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/64b087c265002f75920f24eb11487a0932bf28e1 |
| 3ffec02 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/3ffec023c59661bcd62c53334a42dc8f165c9d97 |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/64b087c265002f75920f24eb11487a0932bf28e1..3ffec023c59661bcd62c53334a42dc8f165c9d97 |
| January 5, 2026 10:18 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21850878200 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 5, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3709815850 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/6dbddacdc5ca540814988bdf6fe10699bf38b0dd..39a2ee8413d5ce41d25d9ef95246b13bd8370ffa |
| 6dbddac | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/6dbddacdc5ca540814988bdf6fe10699bf38b0dd |
| 39a2ee8 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/39a2ee8413d5ce41d25d9ef95246b13bd8370ffa |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/6dbddacdc5ca540814988bdf6fe10699bf38b0dd..39a2ee8413d5ce41d25d9ef95246b13bd8370ffa |
| January 5, 2026 14:35 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21855472431 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 5, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3711670884 |
| @abetlen | https://github.com/abetlen |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/39a2ee8413d5ce41d25d9ef95246b13bd8370ffa..103f671b53c3b44cfd4c940c305492bb114fabec |
| 39a2ee8 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/39a2ee8413d5ce41d25d9ef95246b13bd8370ffa |
| 103f671 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/103f671b53c3b44cfd4c940c305492bb114fabec |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/39a2ee8413d5ce41d25d9ef95246b13bd8370ffa..103f671b53c3b44cfd4c940c305492bb114fabec |
| January 6, 2026 19:17 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21883271058 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 6, 2026 19:22 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21883357095 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 6, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3715995395 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 8, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3724301390 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| llama-cli.txt | https://github.com/user-attachments/files/24498100/llama-cli.txt |
| build.txt | https://github.com/user-attachments/files/24498101/build.txt |
| https://github.com/avion23/llama-cpp-python@update-llama-cpp-2026-01 | https://github.com/avion23/llama-cpp-python@update-llama-cpp-2026-01 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 9, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3728560879 |
| @abetlen | https://github.com/abetlen |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/e3516426647449879a1a8f2ccba026fe86eb9a28..235a3d42785ee9597610b50244822f73ee27a174 |
| e351642 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/e3516426647449879a1a8f2ccba026fe86eb9a28 |
| 235a3d4 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/235a3d42785ee9597610b50244822f73ee27a174 |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/e3516426647449879a1a8f2ccba026fe86eb9a28..235a3d42785ee9597610b50244822f73ee27a174 |
| January 12, 2026 06:03 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21981154364 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 12, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3737010178 |
| @oss-roettger | https://github.com/oss-roettger |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/235a3d42785ee9597610b50244822f73ee27a174..17aae47ec6468e977ee8819e833da9f9fe393e54 |
| 235a3d4 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/235a3d42785ee9597610b50244822f73ee27a174 |
| 17aae47 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/17aae47ec6468e977ee8819e833da9f9fe393e54 |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/235a3d42785ee9597610b50244822f73ee27a174..17aae47ec6468e977ee8819e833da9f9fe393e54 |
| January 12, 2026 06:35 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-21981526201 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 12, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3738101612 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| @avion23 | https://github.com/avion23 |
| https://github.com/avion23/llama-cpp-python@update-llama-cpp-2026-01 | https://github.com/avion23/llama-cpp-python@update-llama-cpp-2026-01 |
| https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF | https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF |
| Llama_test.txt | https://github.com/user-attachments/files/24562603/Llama_test.txt |
| log.txt | https://github.com/user-attachments/files/24565445/log.txt |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 12, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3738502128 |
| … | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| abetlen/llama-cpp-python#2108 | https://github.com/abetlen/llama-cpp-python/pull/2108 |
| #2108 (comment) | https://github.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3738101612 |
| @avion23 | https://github.com/avion23 |
| https://github.com/avion23 | https://github.com/avion23 |
| https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF | https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF |
| https://github.com/user-attachments/files/24562603/Llama_test.txt | https://github.com/user-attachments/files/24562603/Llama_test.txt |
| #2108 (comment) | https://github.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3738101612 |
| https://github.com/notifications/unsubscribe-auth/ACTGWLHPMSGS7UVCWIRBTY34GOAP5AVCNFSM6AAAAACQO5EZGKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTOMZYGEYDCNRRGI | https://github.com/notifications/unsubscribe-auth/ACTGWLHPMSGS7UVCWIRBTY34GOAP5AVCNFSM6AAAAACQO5EZGKVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTOMZYGEYDCNRRGI |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 12, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3738688920 |
| log.txt | https://github.com/user-attachments/files/24565445/log.txt |
| Llama_test.txt | https://github.com/user-attachments/files/24562603/Llama_test.txt |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Update llama.cpp to 2026-01-01 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/commits/831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
| 831dbe5 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/commits/831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| force-pushed | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/17aae47ec6468e977ee8819e833da9f9fe393e54..831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
| 17aae47 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/17aae47ec6468e977ee8819e833da9f9fe393e54 |
| 831dbe5 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/commit/831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
|
Compare
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/compare/17aae47ec6468e977ee8819e833da9f9fe393e54..831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
| January 13, 2026 03:19 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-22005015137 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 13, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3741652207 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 13, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3742364530 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| @oss-roettger | https://github.com/oss-roettger |
| 831dbe5 | https://github.com/abetlen/llama-cpp-python/commit/831dbe5f7c43e8508b9f53fc13d4a38edb94d2db |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 13, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3743966596 |
| @avion23 | https://github.com/avion23 |
| test2026-01-13.txt | https://github.com/user-attachments/files/24588751/test2026-01-13.txt |
| https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF/blob/main/Nemotron-3-Nano-30B-A3B-Q4_K_M.gguf | https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF/blob/main/Nemotron-3-Nano-30B-A3B-Q4_K_M.gguf |
| https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF/blob/main/Nemotron-Nano-3-30B-A3B-Q4_K_M.gguf | https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF/blob/main/Nemotron-Nano-3-30B-A3B-Q4_K_M.gguf |
| test2026-01-13.txt | https://github.com/user-attachments/files/24588751/test2026-01-13.txt |
| https://huggingface.co/bartowski/openai_gpt-oss-20b-GGUF?show_file_info=openai_gpt-oss-20b-Q4_K_M.gguf | https://huggingface.co/bartowski/openai_gpt-oss-20b-GGUF?show_file_info=openai_gpt-oss-20b-Q4_K_M.gguf |
| https://huggingface.co/Face314/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE/blob/main/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE.gguf | https://huggingface.co/Face314/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE/blob/main/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE.gguf |
| https://huggingface.co/bartowski/google_gemma-3-27b-it-GGUF?show_file_info=google_gemma-3-27b-it-Q4_K_M.gguf | https://huggingface.co/bartowski/google_gemma-3-27b-it-GGUF?show_file_info=google_gemma-3-27b-it-Q4_K_M.gguf |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/dhdaines |
| dhdaines | https://patch-diff.githubusercontent.com/dhdaines |
| Jan 13, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3744350759 |
| #2109 | https://github.com/abetlen/llama-cpp-python/pull/2109 |
| https://huggingface.co/ggml-org/granite-docling-258M-GGUF | https://huggingface.co/ggml-org/granite-docling-258M-GGUF |
| https://huggingface.co/ggml-org/SmolVLM-256M-Instruct-GGUF | https://huggingface.co/ggml-org/SmolVLM-256M-Instruct-GGUF |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 13, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3744376999 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 13, 2026 13:34 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-22016918449 |
| fix: critical fixes for recurrent/hybrid model support | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/commits/f42739945a70b9b592fc0e81a4b2ea4beebd4c50 |
| f427399 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/commits/f42739945a70b9b592fc0e81a4b2ea4beebd4c50 |
| abetlen#2108 | https://github.com/abetlen/llama-cpp-python/pull/2108 |
| abetlen#2109 | https://github.com/abetlen/llama-cpp-python/pull/2109 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 14, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3747390686 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| #2109 | https://github.com/abetlen/llama-cpp-python/pull/2109 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| January 14, 2026 02:21 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#event-22031800881 |
| https://patch-diff.githubusercontent.com/oss-roettger |
| oss-roettger | https://patch-diff.githubusercontent.com/oss-roettger |
| Jan 14, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3749642451 |
| @avion23 | https://github.com/avion23 |
| https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF/blob/main/Nemotron-3-Nano-30B-A3B-Q4_K_M.gguf | https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF/blob/main/Nemotron-3-Nano-30B-A3B-Q4_K_M.gguf |
| https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF/blob/main/Nemotron-Nano-3-30B-A3B-Q4_K_M.gguf | https://huggingface.co/ggml-org/Nemotron-Nano-3-30B-A3B-GGUF/blob/main/Nemotron-Nano-3-30B-A3B-Q4_K_M.gguf |
| https://huggingface.co/bartowski/openai_gpt-oss-20b-GGUF?show_file_info=openai_gpt-oss-20b-Q4_K_M.gguf | https://huggingface.co/bartowski/openai_gpt-oss-20b-GGUF?show_file_info=openai_gpt-oss-20b-Q4_K_M.gguf |
| https://huggingface.co/Face314/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE/blob/main/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE.gguf | https://huggingface.co/Face314/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE/blob/main/Qwen3-30B-A3B-Instruct-2507-MXFP4_MOE.gguf |
| https://huggingface.co/bartowski/google_gemma-3-27b-it-GGUF?show_file_info=google_gemma-3-27b-it-Q4_K_M.gguf | https://huggingface.co/bartowski/google_gemma-3-27b-it-GGUF?show_file_info=google_gemma-3-27b-it-Q4_K_M.gguf |
| https://huggingface.co/mradermacher/Ling-mini-2.0-GGUF?show_file_info=Ling-mini-2.0.Q4_K_M.gguf | https://huggingface.co/mradermacher/Ling-mini-2.0-GGUF?show_file_info=Ling-mini-2.0.Q4_K_M.gguf |
| @abetlen | https://github.com/abetlen |
| @avion23 | https://github.com/avion23 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/avion23 |
| avion23 | https://patch-diff.githubusercontent.com/avion23 |
| Jan 19, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3767121051 |
| @abetlen | https://github.com/abetlen |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| https://patch-diff.githubusercontent.com/antheas |
| antheas | https://patch-diff.githubusercontent.com/antheas |
|
Jan 19, 2026
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#ref-issue-3830069643 |
|
On GB10 Spark, llama cpp generation fails with "No next state found"
dottxt-ai/outlines#1812
| https://patch-diff.githubusercontent.com/dottxt-ai/outlines/issues/1812 |
| https://patch-diff.githubusercontent.com/antheas |
| antheas | https://patch-diff.githubusercontent.com/antheas |
| Jan 19, 2026 | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108#issuecomment-3768910448 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| @avion23 | https://github.com/avion23 |
| dottxt-ai/outlines#1812 | https://github.com/dottxt-ai/outlines/issues/1812 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
| Sign up for free | https://patch-diff.githubusercontent.com/join?source=comment-repo |
| Sign in to comment | https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fabetlen%2Fllama-cpp-python%2Fpull%2F2108 |
|
| https://patch-diff.githubusercontent.com/dhdaines |
|
dhdaines
| https://patch-diff.githubusercontent.com/dhdaines |
|
| https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108/files/1f0241ea9e17d98660064c7879273010f68e24f2 |
| Please reload this page | https://patch-diff.githubusercontent.com/abetlen/llama-cpp-python/pull/2108 |
|
| https://patch-diff.githubusercontent.com/avion23 |
|
| https://patch-diff.githubusercontent.com/dhdaines |
|
| https://patch-diff.githubusercontent.com/oss-roettger |
|
| https://patch-diff.githubusercontent.com/antheas |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |