René's URL Explorer Experiment


Title: GitHub - imsizon/llama-cpp-python: Python bindings for llama.cpp

Open Graph Title: GitHub - imsizon/llama-cpp-python: Python bindings for llama.cpp

X Title: GitHub - imsizon/llama-cpp-python: Python bindings for llama.cpp

Description: Python bindings for llama.cpp. Contribute to imsizon/llama-cpp-python development by creating an account on GitHub.

Open Graph Description: Python bindings for llama.cpp. Contribute to imsizon/llama-cpp-python development by creating an account on GitHub.

X Description: Python bindings for llama.cpp. Contribute to imsizon/llama-cpp-python development by creating an account on GitHub.

Opengraph URL: https://github.com/imsizon/llama-cpp-python

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:1b99e3b4-c672-4fb2-ec5c-8ad6152ac8ea
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id8F90:5E3C:2876658:33E55DD:6975FA66
html-safe-nonce237cd4e9f481bb62aa6873a9e5f36bd1e49e55e9b29b13b4e1b86b4e7bddd03b
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4RjkwOjVFM0M6Mjg3NjY1ODozM0U1NUREOjY5NzVGQTY2IiwidmlzaXRvcl9pZCI6Ijc4NDMxNDExMjUwNDMyNTU5MTAiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac04370489e70937715fb03d1eee71fedff7683398bfd13c623c216d8a0c53f9c7
hovercard-subject-tagrepository:1140367271
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/imsizon/llama-cpp-python
twitter:imagehttps://opengraph.githubassets.com/dc5e80fb92d99d697c04b239914925aec67bdf1ffefbc82641d5a71fa6902e12/imsizon/llama-cpp-python
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/dc5e80fb92d99d697c04b239914925aec67bdf1ffefbc82641d5a71fa6902e12/imsizon/llama-cpp-python
og:image:altPython bindings for llama.cpp. Contribute to imsizon/llama-cpp-python development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
Nonec6814b4cc7afd45cd6e64525d0cff0e76dd802f315a5b0e55a7abda1d1d070d0
turbo-cache-controlno-preview
go-importgithub.com/imsizon/llama-cpp-python git https://github.com/imsizon/llama-cpp-python.git
octolytics-dimension-user_id872286
octolytics-dimension-user_loginimsizon
octolytics-dimension-repository_id1140367271
octolytics-dimension-repository_nwoimsizon/llama-cpp-python
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id920296412
octolytics-dimension-repository_parent_nwoJamePeng/llama-cpp-python
octolytics-dimension-repository_network_root_id617868717
octolytics-dimension-repository_network_root_nwoabetlen/llama-cpp-python
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release4ea235bfed58ef16c8a5642b3ac64b74f10c9f52
ui-targetcanary-1
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fimsizon%2Fllama-cpp-python
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fimsizon%2Fllama-cpp-python
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=imsizon%2Fllama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
imsizon https://patch-diff.githubusercontent.com/imsizon
llama-cpp-pythonhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
JamePeng/llama-cpp-pythonhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fimsizon%2Fllama-cpp-python
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fimsizon%2Fllama-cpp-python
Star 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fimsizon%2Fllama-cpp-python
llama-cpp-python.readthedocs.iohttps://llama-cpp-python.readthedocs.io
MIT license https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/LICENSE.md
0 stars https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/stargazers
1.3k forks https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/forks
Branches https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/branches
Tags https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tags
Activity https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fimsizon%2Fllama-cpp-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fimsizon%2Fllama-cpp-python
Code https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Pull requests 0 https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/pulls
Actions https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/actions
Projects 0 https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/projects
Security 0 https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/security
Insights https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/pulse
Code https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Pull requests https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/pulls
Actions https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/actions
Projects https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/projects
Security https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/security
Insights https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/pulse
Brancheshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/branches
Tagshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tags
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/branches
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tags
2,311 Commitshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/commits/main/
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/commits/main/
.githubhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/.github
.githubhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/.github
dockerhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/docker
dockerhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/docker
docshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/docs
docshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/docs
exampleshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/examples
exampleshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/examples
llama_cpphttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/llama_cpp
llama_cpphttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/llama_cpp
scriptshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/scripts
scriptshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/scripts
testshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/tests
testshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/tests
vendorhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/vendor
vendorhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/tree/main/vendor
.dockerignorehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.dockerignore
.gitignorehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.gitmodules
.readthedocs.yamlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.readthedocs.yaml
.readthedocs.yamlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/.readthedocs.yaml
CHANGELOG.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/CHANGELOG.md
CHANGELOG.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/CHANGELOG.md
CMakeLists.txthttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/CMakeLists.txt
LICENSE.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/LICENSE.md
LICENSE.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/LICENSE.md
Makefilehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/Makefile
Makefilehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/Makefile
README.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/README.md
mkdocs.ymlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/mkdocs.yml
mkdocs.ymlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/mkdocs.yml
pyproject.tomlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/pyproject.toml
pyproject.tomlhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/pyproject.toml
READMEhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Licensehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
https://raw.githubusercontent.com/abetlen/llama-cpp-python/main/docs/icon.svg
llama.cpphttps://github.com/ggml-org/llama.cpp
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#python-bindings-for-llamacpp
https://llama-cpp-python.readthedocs.io/en/latest/?badge=latest
https://github.com/abetlen/llama-cpp-python/actions/workflows/test.yaml
https://camo.githubusercontent.com/2a2ae22985121988e6ca3b4037a87d1d674b45b11823d00bd134fc2afcf391b9/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f7461672f4a616d6550656e672f6c6c616d612d6370702d707974686f6e
https://pypi.org/project/llama-cpp-python/
https://pypi.org/project/llama-cpp-python/
https://pepy.tech/projects/llama-cpp-python
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main
llama.cpphttps://github.com/ggml-org/llama.cpp
LangChain compatibilityhttps://python.langchain.com/docs/integrations/llms/llamacpp
LlamaIndex compatibilityhttps://docs.llamaindex.ai/en/stable/examples/llm/llama_2_llama_cpp.html
Local Copilot replacementhttps://llama-cpp-python.readthedocs.io/en/latest/server/#code-completion
Function Calling supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#function-calling
Vision API supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#multimodal-models
Multiple Modelshttps://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support
https://llama-cpp-python.readthedocs.io/en/latesthttps://llama-cpp-python.readthedocs.io/en/latest
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#installation
Visual Studio 2022 Build Toolshttps://download.visualstudio.microsoft.com/download/pr/6efb3484-905b-485c-8b5f-9d3a5f39e731/07908cd6d91e75b8ea4339d8f2cfa6e8d8bb4fd706af7b918ae391cd6fc2a066/vs_BuildTools.exe
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#installation-configuration
llama.cpp build docshttps://github.com/ggml-org/llama.cpp/blob/master/docs/build.md
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#supported-backends
https://developer.nvidia.com/cuda-toolkit-archivehttps://developer.nvidia.com/cuda-toolkit-archive
https://github.com/JamePeng/llama-cpp-python/releaseshttps://github.com/JamePeng/llama-cpp-python/releases
Vulkan SDKhttps://vulkan.lunarg.com/sdk/home#windows
Getting Started with the Linux Tarball Vulkan SDKhttps://vulkan.lunarg.com/doc/sdk/latest/linux/getting_started.html
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#windows-notes
mentioned in llama.cpp repohttps://github.com/ggerganov/llama.cpp#openblas
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#macos-notes
docs/install/macos.mdhttps://llama-cpp-python.readthedocs.io/en/latest/install/macos/
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#upgrading-and-reinstalling
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#high-level-api
API Referencehttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#high-level-api
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
__call__https://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.__call__
create_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_completion
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#pulling-models-from-hugging-face-hub
from_pretrainedhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.from_pretrained
from_pretrainedhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.from_pretrained
huggingface-clihttps://huggingface.co/docs/huggingface_hub/en/guides/cli
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#chat-completion
create_chat_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
create_chat_completion_openai_v1https://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion_openai_v1
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#json-and-json-schema-mode
create_chat_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#json-mode
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#json-schema-mode
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#function-calling
herehttps://huggingface.co/meetkai
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#multi-modal-models
llava-v1.5-7bhttps://huggingface.co/mys/ggml_llava-v1.5-7b
llava-v1.5-13bhttps://huggingface.co/mys/ggml_llava-v1.5-13b
llava-v1.6-34bhttps://huggingface.co/cjpais/llava-v1.6-34B-gguf
moondream2https://huggingface.co/vikhyatk/moondream2
nanollavahttps://huggingface.co/abetlen/nanollava-gguf
llama-3-vision-alphahttps://huggingface.co/abetlen/llama-3-vision-alpha-gguf
minicpm-v-2.6https://huggingface.co/openbmb/MiniCPM-V-2_6-gguf
gemma3https://huggingface.co/unsloth/gemma-3-27b-it-GGUF
glm4.1vhttps://huggingface.co/unsloth/GLM-4.1V-9B-Thinking-GGUF
glm4.6vhttps://huggingface.co/unsloth/GLM-4.6V-Flash-GGUF
granite-doclinghttps://huggingface.co/ibm-granite/granite-docling-258M-GGUF
lfm2-vlhttps://huggingface.co/LiquidAI/LFM2-VL-3B-GGUF
qwen2.5-vlhttps://huggingface.co/unsloth/Qwen2.5-VL-3B-Instruct-GGUF
qwen3-vlhttps://huggingface.co/unsloth/Qwen3-VL-8B-Thinking-GGUF
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#loading-a-local-image-with-qwen3vlthinkinginstruct
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#embeddings--reranking-gguf
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#key-features
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#support-embeddings--rerank-model
bge-m3-GGUFhttps://huggingface.co/gpustack/bge-m3-GGUF
bge-reranker-v2-m3-GGUFhttps://huggingface.co/gpustack/bge-reranker-v2-m3-GGUF
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#todojamepeng-needs-more-extensive-testing-with-various-embedding-and-rerank-models-
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#1-text-embeddings-vector-search
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#2-reranking-cross-encoder-scoring
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#3-normalization
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#legacy-usage-deprecated
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#speculative-decoding
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#adjusting-the-context-window
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#openai-compatible-web-server
http://localhost:8000/docshttp://localhost:8000/docs
llama_cpp/llama_chat_format.pyhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/llama_cpp/llama_chat_format.py
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#web-server-features
Local Copilot replacementhttps://llama-cpp-python.readthedocs.io/en/latest/server/#code-completion
Function Calling supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#function-calling
Vision API supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#multimodal-models
Multiple Modelshttps://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#docker-image
GHCRhttps://ghcr.io/abetlen/llama-cpp-python
Docker on termux (requires root)https://gist.github.com/FreddieOliveira/efe850df7ff3951cb62d74bd770dce27
termux support issuehttps://github.com/abetlen/llama-cpp-python/issues/389
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#low-level-api
API Referencehttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#low-level-api
ctypeshttps://docs.python.org/3/library/ctypes.html
llama_cpp/llama_cpp.pyhttps://github.com/abetlen/llama-cpp-python/blob/master/llama_cpp/llama_cpp.py
llama.hhttps://github.com/ggerganov/llama.cpp/blob/master/llama.h
examples folderhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/blob/main/examples/low_level_api
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#documentation
https://llama-cpp-python.readthedocs.io/https://llama-cpp-python.readthedocs.io/
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#development
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#faq
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#are-there-pre-built-binaries--binary-wheels-available
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#how-does-this-compare-to-other-python-bindings-of-llamacpp
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#oserror-libcudartsoxxcudart64_xxdll-cannot-open-shared-object-file-no-such-file-or-directory
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#filenotfounderror-could-not-find-module-like-ggmldll-ggml-cpudll-ggml-cudadll
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#why-are-libraries-compiled-by-other-authors-only-around-100mb-while-your-pre-compiled-versions-range-from-300mb-to-900mb
https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#license
llama-cpp-python.readthedocs.iohttps://llama-cpp-python.readthedocs.io
Readme https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python#MIT-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python
Activityhttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/activity
0 starshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/watchers
0 forkshttps://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fimsizon%2Fllama-cpp-python&report=imsizon+%28user%29
Releases 1https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/releases
main-metal Latest Jan 23, 2026 https://patch-diff.githubusercontent.com/imsizon/llama-cpp-python/releases/tag/main-metal
Packages 0https://patch-diff.githubusercontent.com/users/imsizon/packages?repo_name=llama-cpp-python
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.