René's URL Explorer Experiment


Title: GitHub - JamePeng/llama-cpp-python: Python bindings for llama.cpp

Open Graph Title: GitHub - JamePeng/llama-cpp-python: Python bindings for llama.cpp

X Title: GitHub - JamePeng/llama-cpp-python: Python bindings for llama.cpp

Description: Python bindings for llama.cpp. Contribute to JamePeng/llama-cpp-python development by creating an account on GitHub.

Open Graph Description: Python bindings for llama.cpp. Contribute to JamePeng/llama-cpp-python development by creating an account on GitHub.

X Description: Python bindings for llama.cpp. Contribute to JamePeng/llama-cpp-python development by creating an account on GitHub.

Opengraph URL: https://github.com/JamePeng/llama-cpp-python

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:df70b5b8-54da-a3c2-a595-de16001613bc
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idCC08:14207:78BBAB0:9B58309:6975AC93
html-safe-nonced22b3ee2328dc181d89f7bb4378dbd70d7d4cbebc35e1ab7e8e4c6f7c84f4d3a
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDQzA4OjE0MjA3Ojc4QkJBQjA6OUI1ODMwOTo2OTc1QUM5MyIsInZpc2l0b3JfaWQiOiIyNDg1MDM5OTA4NTQ0NDI1MTA3IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0=
visitor-hmac612cb658f0a9fa21bad2dba4610ccd68d86712e79a9f48f9c41365de9865af2b
hovercard-subject-tagrepository:920296412
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/JamePeng/llama-cpp-python
twitter:imagehttps://opengraph.githubassets.com/b99a14c102fd529251e83b09b46a4aaeadc8144de3aa42411fd71d3fbc5fd17a/JamePeng/llama-cpp-python
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/b99a14c102fd529251e83b09b46a4aaeadc8144de3aa42411fd71d3fbc5fd17a/JamePeng/llama-cpp-python
og:image:altPython bindings for llama.cpp. Contribute to JamePeng/llama-cpp-python development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None4a4bf5f4e28041a9d2e5c107d7d20b78b4294ba261cab243b28167c16a623a1f
turbo-cache-controlno-preview
go-importgithub.com/JamePeng/llama-cpp-python git https://github.com/JamePeng/llama-cpp-python.git
octolytics-dimension-user_id17095606
octolytics-dimension-user_loginJamePeng
octolytics-dimension-repository_id920296412
octolytics-dimension-repository_nwoJamePeng/llama-cpp-python
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id617868717
octolytics-dimension-repository_parent_nwoabetlen/llama-cpp-python
octolytics-dimension-repository_network_root_id617868717
octolytics-dimension-repository_network_root_nwoabetlen/llama-cpp-python
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release488b30e96dfd057fbbe44c6665ccbc030b729dde
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FJamePeng%2Fllama-cpp-python
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FJamePeng%2Fllama-cpp-python
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=JamePeng%2Fllama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Reloadhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
JamePeng https://patch-diff.githubusercontent.com/JamePeng
llama-cpp-pythonhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
abetlen/llama-cpp-pythonhttps://patch-diff.githubusercontent.com/abetlen/llama-cpp-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FJamePeng%2Fllama-cpp-python
Fork 21 https://patch-diff.githubusercontent.com/login?return_to=%2FJamePeng%2Fllama-cpp-python
Star 156 https://patch-diff.githubusercontent.com/login?return_to=%2FJamePeng%2Fllama-cpp-python
llama-cpp-python.readthedocs.iohttps://llama-cpp-python.readthedocs.io
MIT license https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/LICENSE.md
156 stars https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/stargazers
1.3k forks https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/forks
Branches https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/branches
Tags https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tags
Activity https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FJamePeng%2Fllama-cpp-python
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FJamePeng%2Fllama-cpp-python
Code https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Issues 3 https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/issues
Pull requests 0 https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/pulls
Actions https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/actions
Projects 0 https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/projects
Security 0 https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/security
Insights https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/pulse
Code https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Issues https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/issues
Pull requests https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/pulls
Actions https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/actions
Projects https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/projects
Security https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/security
Insights https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/pulse
Brancheshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/branches
Tagshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tags
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/branches
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tags
2,315 Commitshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/commits/main/
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/commits/main/
.githubhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/.github
.githubhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/.github
dockerhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/docker
dockerhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/docker
docshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/docs
docshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/docs
exampleshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/examples
exampleshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/examples
llama_cpphttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/llama_cpp
llama_cpphttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/llama_cpp
scriptshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/scripts
scriptshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/scripts
testshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/tests
testshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/tests
vendorhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/vendor
vendorhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/tree/main/vendor
.dockerignorehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.dockerignore
.gitignorehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.gitmodules
.readthedocs.yamlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.readthedocs.yaml
.readthedocs.yamlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/.readthedocs.yaml
CHANGELOG.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/CHANGELOG.md
CHANGELOG.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/CHANGELOG.md
CMakeLists.txthttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/CMakeLists.txt
LICENSE.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/LICENSE.md
LICENSE.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/LICENSE.md
Makefilehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/Makefile
Makefilehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/Makefile
README.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/README.md
mkdocs.ymlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/mkdocs.yml
mkdocs.ymlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/mkdocs.yml
pyproject.tomlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/pyproject.toml
pyproject.tomlhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/pyproject.toml
READMEhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Licensehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
https://raw.githubusercontent.com/abetlen/llama-cpp-python/main/docs/icon.svg
llama.cpphttps://github.com/ggml-org/llama.cpp
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#python-bindings-for-llamacpp
https://llama-cpp-python.readthedocs.io/en/latest/?badge=latest
https://github.com/abetlen/llama-cpp-python/actions/workflows/test.yaml
https://camo.githubusercontent.com/2a2ae22985121988e6ca3b4037a87d1d674b45b11823d00bd134fc2afcf391b9/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f762f7461672f4a616d6550656e672f6c6c616d612d6370702d707974686f6e
https://pypi.org/project/llama-cpp-python/
https://pypi.org/project/llama-cpp-python/
https://pepy.tech/projects/llama-cpp-python
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main
llama.cpphttps://github.com/ggml-org/llama.cpp
LangChain compatibilityhttps://python.langchain.com/docs/integrations/llms/llamacpp
LlamaIndex compatibilityhttps://docs.llamaindex.ai/en/stable/examples/llm/llama_2_llama_cpp.html
Local Copilot replacementhttps://llama-cpp-python.readthedocs.io/en/latest/server/#code-completion
Function Calling supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#function-calling
Vision API supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#multimodal-models
Multiple Modelshttps://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support
https://llama-cpp-python.readthedocs.io/en/latesthttps://llama-cpp-python.readthedocs.io/en/latest
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#installation
Visual Studio 2022 Build Toolshttps://download.visualstudio.microsoft.com/download/pr/6efb3484-905b-485c-8b5f-9d3a5f39e731/07908cd6d91e75b8ea4339d8f2cfa6e8d8bb4fd706af7b918ae391cd6fc2a066/vs_BuildTools.exe
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#installation-configuration
llama.cpp build docshttps://github.com/ggml-org/llama.cpp/blob/master/docs/build.md
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#supported-backends
https://developer.nvidia.com/cuda-toolkit-archivehttps://developer.nvidia.com/cuda-toolkit-archive
https://github.com/JamePeng/llama-cpp-python/releaseshttps://github.com/JamePeng/llama-cpp-python/releases
Vulkan SDKhttps://vulkan.lunarg.com/sdk/home#windows
Getting Started with the Linux Tarball Vulkan SDKhttps://vulkan.lunarg.com/doc/sdk/latest/linux/getting_started.html
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#windows-notes
mentioned in llama.cpp repohttps://github.com/ggerganov/llama.cpp#openblas
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#macos-notes
docs/install/macos.mdhttps://llama-cpp-python.readthedocs.io/en/latest/install/macos/
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#upgrading-and-reinstalling
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#high-level-api
API Referencehttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#high-level-api
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
__call__https://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.__call__
create_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_completion
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#pulling-models-from-hugging-face-hub
from_pretrainedhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.from_pretrained
from_pretrainedhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.from_pretrained
huggingface-clihttps://huggingface.co/docs/huggingface_hub/en/guides/cli
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#chat-completion
create_chat_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion
Llamahttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama
create_chat_completion_openai_v1https://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion_openai_v1
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#json-and-json-schema-mode
create_chat_completionhttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#llama_cpp.Llama.create_chat_completion
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#json-mode
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#json-schema-mode
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#function-calling
herehttps://huggingface.co/meetkai
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#multi-modal-models
llava-v1.5-7bhttps://huggingface.co/mys/ggml_llava-v1.5-7b
llava-v1.5-13bhttps://huggingface.co/mys/ggml_llava-v1.5-13b
llava-v1.6-34bhttps://huggingface.co/cjpais/llava-v1.6-34B-gguf
moondream2https://huggingface.co/vikhyatk/moondream2
nanollavahttps://huggingface.co/abetlen/nanollava-gguf
llama-3-vision-alphahttps://huggingface.co/abetlen/llama-3-vision-alpha-gguf
minicpm-v-2.6https://huggingface.co/openbmb/MiniCPM-V-2_6-gguf
gemma3https://huggingface.co/unsloth/gemma-3-27b-it-GGUF
glm4.1vhttps://huggingface.co/unsloth/GLM-4.1V-9B-Thinking-GGUF
glm4.6vhttps://huggingface.co/unsloth/GLM-4.6V-Flash-GGUF
granite-doclinghttps://huggingface.co/ibm-granite/granite-docling-258M-GGUF
lfm2-vlhttps://huggingface.co/LiquidAI/LFM2-VL-3B-GGUF
qwen2.5-vlhttps://huggingface.co/unsloth/Qwen2.5-VL-3B-Instruct-GGUF
qwen3-vlhttps://huggingface.co/unsloth/Qwen3-VL-8B-Thinking-GGUF
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#loading-a-local-image-with-qwen3vlthinkinginstruct
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#embeddings--reranking-gguf
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#key-features
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#support-embeddings--rerank-model
bge-m3-GGUFhttps://huggingface.co/gpustack/bge-m3-GGUF
bge-reranker-v2-m3-GGUFhttps://huggingface.co/gpustack/bge-reranker-v2-m3-GGUF
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#todojamepeng-needs-more-extensive-testing-with-various-embedding-and-rerank-models-
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#1-text-embeddings-vector-search
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#2-reranking-cross-encoder-scoring
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#3-normalization
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#legacy-usage-deprecated
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#speculative-decoding
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#adjusting-the-context-window
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#openai-compatible-web-server
http://localhost:8000/docshttp://localhost:8000/docs
llama_cpp/llama_chat_format.pyhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/llama_cpp/llama_chat_format.py
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#web-server-features
Local Copilot replacementhttps://llama-cpp-python.readthedocs.io/en/latest/server/#code-completion
Function Calling supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#function-calling
Vision API supporthttps://llama-cpp-python.readthedocs.io/en/latest/server/#multimodal-models
Multiple Modelshttps://llama-cpp-python.readthedocs.io/en/latest/server/#configuration-and-multi-model-support
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#docker-image
GHCRhttps://ghcr.io/abetlen/llama-cpp-python
Docker on termux (requires root)https://gist.github.com/FreddieOliveira/efe850df7ff3951cb62d74bd770dce27
termux support issuehttps://github.com/abetlen/llama-cpp-python/issues/389
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#low-level-api
API Referencehttps://llama-cpp-python.readthedocs.io/en/latest/api-reference/#low-level-api
ctypeshttps://docs.python.org/3/library/ctypes.html
llama_cpp/llama_cpp.pyhttps://github.com/abetlen/llama-cpp-python/blob/master/llama_cpp/llama_cpp.py
llama.hhttps://github.com/ggerganov/llama.cpp/blob/master/llama.h
examples folderhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/blob/main/examples/low_level_api
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#documentation
https://llama-cpp-python.readthedocs.io/https://llama-cpp-python.readthedocs.io/
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#development
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#faq
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#are-there-pre-built-binaries--binary-wheels-available
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#how-does-this-compare-to-other-python-bindings-of-llamacpp
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#oserror-libcudartsoxxcudart64_xxdll-cannot-open-shared-object-file-no-such-file-or-directory
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#filenotfounderror-could-not-find-module-like-ggmldll-ggml-cpudll-ggml-cudadll
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#why-are-libraries-compiled-by-other-authors-only-around-100mb-while-your-pre-compiled-versions-range-from-300mb-to-900mb
https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#license
llama-cpp-python.readthedocs.iohttps://llama-cpp-python.readthedocs.io
Readme https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python#MIT-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
Activityhttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/activity
156 starshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/stargazers
3 watchinghttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/watchers
21 forkshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FJamePeng%2Fllama-cpp-python&report=JamePeng+%28user%29
Releases 154https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/releases
v0.3.22-cu130-Basic-win-20260118 Latest Jan 18, 2026 https://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/releases/tag/v0.3.22-cu130-Basic-win-20260118
+ 153 releaseshttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python/releases
Packages 0https://patch-diff.githubusercontent.com/users/JamePeng/packages?repo_name=llama-cpp-python
Please reload this pagehttps://patch-diff.githubusercontent.com/JamePeng/llama-cpp-python
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.