René's URL Explorer Experiment


Title: GitHub - vonchenplus/llama.cpp: LLM inference in C/C++

Open Graph Title: GitHub - vonchenplus/llama.cpp: LLM inference in C/C++

X Title: GitHub - vonchenplus/llama.cpp: LLM inference in C/C++

Description: LLM inference in C/C++. Contribute to vonchenplus/llama.cpp development by creating an account on GitHub.

Open Graph Description: LLM inference in C/C++. Contribute to vonchenplus/llama.cpp development by creating an account on GitHub.

X Description: LLM inference in C/C++. Contribute to vonchenplus/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/vonchenplus/llama.cpp

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:c79287c4-b6e8-3458-c182-48a1d690ff0c
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idDE42:30CF41:5372B0:6EA7B1:69715DF5
html-safe-nonce6cb421f80d6b7bd6eddf0b200953ddf650c4e3818968c144693fa72055d3f8ae
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJERTQyOjMwQ0Y0MTo1MzcyQjA6NkVBN0IxOjY5NzE1REY1IiwidmlzaXRvcl9pZCI6IjMxMzU5MTM1NTQ4MjEwNzg1MTciLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac0a62e3ceaa52b5d6816b00ba878ac769de660a43fdf89dc615a8af45cbe7b794
hovercard-subject-tagrepository:874566161
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/vonchenplus/llama.cpp
twitter:imagehttps://opengraph.githubassets.com/5a63d7964f27617d140d9a1f27520435e62aeec365aae235ba9c52f2d25c8272/vonchenplus/llama.cpp
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/5a63d7964f27617d140d9a1f27520435e62aeec365aae235ba9c52f2d25c8272/vonchenplus/llama.cpp
og:image:altLLM inference in C/C++. Contribute to vonchenplus/llama.cpp development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None1c338feb2465fb28789c852d4d9bbb3a30c0620671d1df7914edfbde84531d5e
turbo-cache-controlno-preview
go-importgithub.com/vonchenplus/llama.cpp git https://github.com/vonchenplus/llama.cpp.git
octolytics-dimension-user_id3349963
octolytics-dimension-user_loginvonchenplus
octolytics-dimension-repository_id874566161
octolytics-dimension-repository_nwovonchenplus/llama.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id612354784
octolytics-dimension-repository_parent_nwoggml-org/llama.cpp
octolytics-dimension-repository_network_root_id612354784
octolytics-dimension-repository_network_root_nwoggml-org/llama.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
releaseca28321bb5dd58db88720c48080666bfbe28520a
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fvonchenplus%2Fllama.cpp
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fvonchenplus%2Fllama.cpp
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=vonchenplus%2Fllama.cpp
Reloadhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Reloadhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Reloadhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
vonchenplus https://patch-diff.githubusercontent.com/vonchenplus
llama.cpphttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
ggml-org/llama.cpphttps://patch-diff.githubusercontent.com/ggml-org/llama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fvonchenplus%2Fllama.cpp
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fvonchenplus%2Fllama.cpp
Star 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fvonchenplus%2Fllama.cpp
MIT license https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/LICENSE
0 stars https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/stargazers
14.6k forks https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/forks
Branches https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/branches
Tags https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tags
Activity https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fvonchenplus%2Fllama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fvonchenplus%2Fllama.cpp
Code https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Pull requests 0 https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/actions
Projects 0 https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/projects
Security Uh oh! There was an error while loading. Please reload this page. https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/security
Please reload this pagehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Insights https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/pulse
Code https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Pull requests https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/actions
Projects https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/projects
Security https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/security
Insights https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/pulse
Brancheshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/branches
Tagshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tags
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/branches
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tags
4,400 Commitshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/commits/master/
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/commits/master/
.devopshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/.devops
.devopshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/.devops
.githubhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/.github
Sources/llamahttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/Sources/llama
Sources/llamahttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/Sources/llama
cihttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/ci
cihttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/ci
cmakehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/cmake
cmakehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/cmake
commonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/common
commonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/common
docshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/docs
docshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/examples
ggmlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/ggml
ggmlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/ggml
gguf-pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/gguf-py
gguf-pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/gguf-py
grammarshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/grammars
grammarshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/grammars
includehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/include
includehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/include
mediahttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/media
mediahttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/media
modelshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/models
modelshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/models
pocshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/pocs
pocshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/pocs
promptshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/prompts
promptshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/prompts
requirementshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/requirements
requirementshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/requirements
scriptshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/scripts
scriptshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/scripts
spm-headershttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/spm-headers
spm-headershttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/spm-headers
srchttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/src
srchttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/src
testshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/tests
testshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/tree/master/tests
.clang-formathttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.clang-format
.clang-formathttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.clang-format
.clang-tidyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.clang-tidy
.clang-tidyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.clang-tidy
.dockerignorehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.dockerignore
.ecrchttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.ecrc
.ecrchttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.ecrc
.editorconfighttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.editorconfig
.editorconfighttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.editorconfig
.flake8https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.flake8
.flake8https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.flake8
.gitignorehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.gitmodules
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/.pre-commit-config.yaml
AUTHORShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/AUTHORS
AUTHORShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/AUTHORS
CMakeLists.txthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CMakeLists.txt
CMakePresets.jsonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CMakePresets.json
CMakePresets.jsonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CMakePresets.json
CODEOWNERShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CODEOWNERS
CODEOWNERShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CODEOWNERS
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CONTRIBUTING.md
LICENSEhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/LICENSE
Makefilehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/Makefile
Makefilehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/Makefile
Package.swifthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/Package.swift
Package.swifthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/Package.swift
README.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/README.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/SECURITY.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/SECURITY.md
convert_hf_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf_update.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_hf_to_gguf_update.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_llama_ggml_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_lora_to_gguf.py
convert_lora_to_gguf.pyhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/convert_lora_to_gguf.py
flake.lockhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/flake.lock
flake.lockhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/flake.lock
flake.nixhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/flake.nix
flake.nixhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/flake.nix
mypy.inihttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/mypy.ini
mypy.inihttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/mypy.ini
poetry.lockhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/poetry.lock
poetry.lockhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/poetry.lock
pyproject.tomlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/pyproject.toml
pyproject.tomlhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/pyproject.toml
pyrightconfig.jsonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/pyrightconfig.json
pyrightconfig.jsonhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/pyrightconfig.json
requirements.txthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/requirements.txt
READMEhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Contributinghttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Licensehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Securityhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llamacpp
https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
https://opensource.org/licenses/MIT
https://github.com/ggerganov/llama.cpp/actions/workflows/server.yml
Roadmaphttps://github.com/users/ggerganov/projects/7
Project statushttps://github.com/ggerganov/llama.cpp/discussions/3471
Manifestohttps://github.com/ggerganov/llama.cpp/discussions/205
ggmlhttps://github.com/ggerganov/ggml
LLaMAhttps://arxiv.org/abs/2302.13971
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#recent-api-changes
Changelog for libllama APIhttps://github.com/ggerganov/llama.cpp/issues/9289
Changelog for llama-server REST APIhttps://github.com/ggerganov/llama.cpp/issues/9291
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#hot-topics
ggml-org#10123https://github.com/ggml-org/llama.cpp/discussions/10123
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
discussionhttps://github.com/ggerganov/llama.cpp/discussions/9268
toolhttps://huggingface.co/spaces/CISCai/gguf-editor
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#description
ggmlhttps://github.com/ggerganov/ggml
HOWTO-add-model.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/development/HOWTO-add-model.md
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#text-only
Mistral 7Bhttps://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoEhttps://huggingface.co/models?search=mistral-ai/Mixtral
DBRXhttps://huggingface.co/databricks/dbrx-instruct
Falconhttps://huggingface.co/models?search=tiiuae/falcon
Chinese LLaMA / Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)https://github.com/bofenghuang/vigogne
BERThttps://github.com/ggerganov/llama.cpp/pull/5423
Koalahttps://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2https://huggingface.co/models?search=baichuan-inc/Baichuan
derivationshttps://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2https://huggingface.co/models?search=BAAI/Aquila
Starcoder modelshttps://github.com/ggerganov/llama.cpp/pull/3187
Refacthttps://huggingface.co/smallcloudai/Refact-1_6B-fim
MPThttps://github.com/ggerganov/llama.cpp/pull/3417
Bloomhttps://github.com/ggerganov/llama.cpp/pull/3553
Yi modelshttps://huggingface.co/models?search=01-ai/Yi
StableLM modelshttps://huggingface.co/stabilityai
Deepseek modelshttps://huggingface.co/models?search=deepseek-ai/deepseek
Qwen modelshttps://huggingface.co/models?search=Qwen/Qwen
PLaMo-13Bhttps://github.com/ggerganov/llama.cpp/pull/3557
Phi modelshttps://huggingface.co/models?search=microsoft/phi
GPT-2https://huggingface.co/gpt2
Orion 14Bhttps://github.com/ggerganov/llama.cpp/pull/5118
InternLM2https://huggingface.co/models?search=internlm2
CodeShellhttps://github.com/WisdomShell/codeshell
Gemmahttps://ai.google.dev/gemma
Mambahttps://github.com/state-spaces/mamba
Grok-1https://huggingface.co/keyfan/grok-1-hf
Xversehttps://huggingface.co/models?search=xverse
Command-R modelshttps://huggingface.co/models?search=CohereForAI/c4ai-command-r
SEA-LIONhttps://huggingface.co/models?search=sea-lion
GritLM-7Bhttps://huggingface.co/GritLM/GritLM-7B
GritLM-8x7Bhttps://huggingface.co/GritLM/GritLM-8x7B
OLMohttps://allenai.org/olmo
OLMo 2https://allenai.org/olmo
OLMoEhttps://huggingface.co/allenai/OLMoE-1B-7B-0924
Granite modelshttps://huggingface.co/collections/ibm-granite/granite-code-models-6624c5cec322e4c148c8b330
GPT-NeoXhttps://github.com/EleutherAI/gpt-neox
Pythiahttps://github.com/EleutherAI/pythia
Snowflake-Arctic MoEhttps://huggingface.co/collections/Snowflake/arctic-66290090abe542894a5ac520
Smaughttps://huggingface.co/models?search=Smaug
Poro 34Bhttps://huggingface.co/LumiOpen/Poro-34B
Bitnet b1.58 modelshttps://huggingface.co/1bitLLM
Flan T5https://huggingface.co/models?search=flan-t5
Open Elm modelshttps://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca
ChatGLM3-6bhttps://huggingface.co/THUDM/chatglm3-6b
ChatGLM4-9bhttps://huggingface.co/THUDM/glm-4-9b
SmolLMhttps://huggingface.co/collections/HuggingFaceTB/smollm-6695016cad7167254ce15966
EXAONE-3.0-7.8B-Instructhttps://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
FalconMamba Modelshttps://huggingface.co/collections/tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
Jaishttps://huggingface.co/inceptionai/jais-13b-chat
Bielik-11B-v2.3https://huggingface.co/collections/speakleash/bielik-11b-v23-66ee813238d9b526a072408a
RWKV-6https://github.com/BlinkDL/RWKV-LM
GigaChat-20B-A3Bhttps://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#multimodal
LLaVA 1.5 modelshttps://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 modelshttps://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVAhttps://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidianhttps://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4Vhttps://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B modelshttps://huggingface.co/models?search=mobileVLM
Yi-VLhttps://huggingface.co/models?search=Yi-VL
Mini CPMhttps://huggingface.co/models?search=MiniCPM
Moondreamhttps://huggingface.co/vikhyatk/moondream2
Bunnyhttps://github.com/BAAI-DCAI/Bunny
Qwen2-VLhttps://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d
abetlen/llama-cpp-pythonhttps://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpphttps://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpphttps://github.com/withcatai/node-llama-cpp
lgrammel/modelfusionhttps://modelfusion.dev/integration/model-provider/llamacpp
offline-ai/clihttps://github.com/offline-ai/cli
tangledgroup/llama-cpp-wasmhttps://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllamahttps://github.com/ngxson/wllama
yoshoku/llama_cpp.rbhttps://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rshttps://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpphttps://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rshttps://github.com/utilityai/llama-cpp-rs
SciSharp/LLamaSharphttps://github.com/SciSharp/LLamaSharp
LM-Kit.NEThttps://docs.lm-kit.com/lm-kit-net/index.html
donderom/llm4shttps://github.com/donderom/llm4s
phronmophobic/llama.cljhttps://github.com/phronmophobic/llama.clj
mybigday/llama.rnhttps://github.com/mybigday/llama.rn
kherud/java-llama.cpphttps://github.com/kherud/java-llama.cpp
deins/llama.cpp.zighttps://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_darthttps://github.com/netdur/llama_cpp_dart
xuegao-tzx/Fllamahttps://github.com/xuegao-tzx/Fllama
distantmagic/resonancehttps://github.com/distantmagic/resonance
(more info)https://github.com/ggerganov/llama.cpp/pull/6326
guile_llama_cpphttps://savannah.nongnu.org/projects/guile-llama-cpp
srgtuszy/llama-cpp-swifthttps://github.com/srgtuszy/llama-cpp-swift
ShenghaiWang/SwiftLlamahttps://github.com/ShenghaiWang/SwiftLlama
AI Sublime Text pluginhttps://github.com/yaroslavyaroslav/OpenAI-sublime-text
cztomsik/avahttps://github.com/cztomsik/ava
Dothttps://github.com/alexpinel/Dot
evahttps://github.com/ylsdamxssjxxdd/eva
iohub/collamahttps://github.com/iohub/coLLaMA
janhq/janhttps://github.com/janhq/jan
KanTVhttps://github.com/zhouwg/kantv?tab=readme-ov-file
KodiBothttps://github.com/firatkiral/kodibot
llama.vimhttps://github.com/ggml-org/llama.vim
LARShttps://github.com/abgulati/LARS
Llama Assistanthttps://github.com/vietanhdev/llama-assistant
LLMFarmhttps://github.com/guinmoon/LLMFarm?tab=readme-ov-file
LLMUnityhttps://github.com/undreamai/LLMUnity
LMStudiohttps://lmstudio.ai/
LocalAIhttps://github.com/mudler/LocalAI
LostRuins/koboldcpphttps://github.com/LostRuins/koboldcpp
MindMachttps://mindmac.app
MindWorkAI/AI-Studiohttps://github.com/MindWorkAI/AI-Studio
Mobile-Artificial-Intelligence/maidhttps://github.com/Mobile-Artificial-Intelligence/maid
Mozilla-Ocho/llamafilehttps://github.com/Mozilla-Ocho/llamafile
nat/openplaygroundhttps://github.com/nat/openplayground
nomic-ai/gpt4allhttps://github.com/nomic-ai/gpt4all
ollama/ollamahttps://github.com/ollama/ollama
oobabooga/text-generation-webuihttps://github.com/oobabooga/text-generation-webui
PocketPal AIhttps://github.com/a-ghorbani/pocketpal-ai
psugihara/FreeChathttps://github.com/psugihara/FreeChat
ptsochantaris/emeltalhttps://github.com/ptsochantaris/emeltal
pythops/tenerehttps://github.com/pythops/tenere
ramalamahttps://github.com/containers/ramalama
semperai/amicahttps://github.com/semperai/amica
withcatai/cataihttps://github.com/withcatai/catai
akx/ggifyhttps://github.com/akx/ggify
akx/ollama-dlhttps://github.com/akx/ollama-dl
crashr/gppmhttps://github.com/crashr/gppm
gpustack/gguf-parserhttps://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser
Styled Lineshttps://marketplace.unity.com/packages/tools/generative-ai/styled-lines-llama-cpp-model-292902
Paddlerhttps://github.com/distantmagic/paddler
GPUStackhttps://github.com/gpustack/gpustack
llama_cpp_canisterhttps://github.com/onicai/llama_cpp_canister
Lucy's Labyrinthhttps://github.com/MorganRO8/Lucys_Labyrinth
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#supported-backends
Metalhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#metal-build
BLAShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#blas-build
BLIShttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/backend/BLIS.md
SYCLhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/backend/SYCL.md
MUSAhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#musa
CUDAhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#cuda
HIPhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#hip
Vulkanhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#vulkan
CANNhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md#cann
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#building-the-project
include/llama.hhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/include/llama.h
how to buildhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md
brew, flox or nixhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/install.md
documentation for Dockerhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/docker.md
releaseshttps://github.com/ggerganov/llama.cpp/releases
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#obtaining-and-quantizing-models
Hugging Facehttps://huggingface.co
number of LLMshttps://huggingface.co/models?library=gguf&sort=trending
Trendinghttps://huggingface.co/models?library=gguf&sort=trending
LLaMAhttps://huggingface.co/models?sort=trending&search=llama+gguf
GGUFhttps://github.com/ggerganov/ggml/blob/master/docs/gguf.md
GGUF-my-repo spacehttps://huggingface.co/spaces/ggml-org/gguf-my-repo
GGUF-my-LoRA spacehttps://huggingface.co/spaces/ggml-org/gguf-my-lora
ggml-org#10123https://github.com/ggml-org/llama.cpp/discussions/10123
GGUF-editor spacehttps://huggingface.co/spaces/CISCai/gguf-editor
ggml-org#9268https://github.com/ggml-org/llama.cpp/discussions/9268
Inference Endpointshttps://ui.endpoints.huggingface.co/
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
read this documentationhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/quantize/README.md
llama-clihttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/main
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-cli
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#a-cli-tool-for-accessing-and-experimenting-with-most-of-llamacpps-functionality
Supported templateshttps://github.com/ggerganov/llama.cpp/wiki/Templates-supported-by-llama_chat_apply_template
grammars/https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/grammars
GBNF Guidehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/grammars/README.md
https://grammar.intrinsiclabs.ai/https://grammar.intrinsiclabs.ai/
llama-serverhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/server
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-server
OpenAI APIhttps://github.com/openai/openai-openapi
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#a-lightweight-openai-api-compatible-http-server-for-serving-llms
llama-perplexityhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/perplexity
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-perplexity
1https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fn-1-31792b66e3e9aea4d3132c46bfc13a82
2https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fn-2-31792b66e3e9aea4d3132c46bfc13a82
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#a-tool-for-measuring-the-perplexity-12-and-other-quality-metrics-of-a-model-over-a-given-text
llama-benchhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/llama-bench
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-bench
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#benchmark-the-performance-of-the-inference-for-various-parameters
llama-runhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/run
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-run
3https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fn-3-31792b66e3e9aea4d3132c46bfc13a82
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#a-comprehensive-example-for-running-llamacpp-models-useful-for-inferencing-used-with-ramalama-3
llama-simplehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/simple
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#llama-simple
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#a-minimal-example-for-implementing-apps-with-llamacpp-useful-for-developers
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#contributing
good first issueshttps://github.com/ggerganov/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/CONTRIBUTING.md
Inference at the edgehttps://github.com/ggerganov/llama.cpp/discussions/205
Changelog podcasthttps://changelog.com/podcast/532
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#other-documentation
main (cli)https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/main/README.md
serverhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/examples/server/README.md
GBNF grammarshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/grammars/README.md
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#development-documentation
How to buildhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/build.md
Running on Dockerhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/docker.md
Build on Androidhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/android.md
Performance troubleshootinghttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/blob/master/docs/development/token_generation_performance_tips.md
GGML tips & trickshttps://github.com/ggerganov/llama.cpp/wiki/GGML-Tips-&-Tricks
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language modelhttps://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Modelshttps://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learnershttps://arxiv.org/abs/2005.14165
Aligning language models to follow instructionshttps://openai.com/research/instruction-following
Training language models to follow instructions with human feedbackhttps://arxiv.org/abs/2203.02155
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#references
examples/perplexity/README.mdhttps://patch-diff.githubusercontent.com/vonchenplus/examples/perplexity/README.md
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fnref-1-31792b66e3e9aea4d3132c46bfc13a82
https://huggingface.co/docs/transformers/perplexityhttps://huggingface.co/docs/transformers/perplexity
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fnref-2-31792b66e3e9aea4d3132c46bfc13a82
RamaLamahttps://github.com/containers/ramalama
https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#user-content-fnref-3-31792b66e3e9aea4d3132c46bfc13a82
Readme https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#MIT-1-ov-file
Contributing https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#contributing-ov-file
Security policy https://patch-diff.githubusercontent.com/vonchenplus/llama.cpp#security-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp
Activityhttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/activity
0 starshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/watchers
0 forkshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fvonchenplus%2Fllama.cpp&report=vonchenplus+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/vonchenplus/llama.cpp/releases
Packages 0https://patch-diff.githubusercontent.com/users/vonchenplus/packages?repo_name=llama.cpp
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.