René's URL Explorer Experiment

Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

Open Graph Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

X Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

Open Graph Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

X Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/Gomez12/llama.cpp

X: @github

direct link

Domain: github.com

route-pattern	/:user_id/:repository
route-controller	files
route-action	disambiguate
fetch-nonce	v2:acf23ad5-bea9-554c-144d-f6101aa7186c
current-catalog-service-hash	f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id	A094:23CDC6:DF0B45:12A6412:69698AA8
html-safe-nonce	c38244b1ec0c4a2ce0695e614248aa2cedf1578ed6f292e0936ae3beaa60d0ea
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBMDk0OjIzQ0RDNjpERjBCNDU6MTJBNjQxMjo2OTY5OEFBOCIsInZpc2l0b3JfaWQiOiIxNjk4MzI2MDk3NDg1NzI4NDI1IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0=
visitor-hmac	a1ac91aa7592b519bc15643dc5a2a3e4637c76bc4758bb2074c0e758735f3bce
hovercard-subject-tag	repository:1111661254
github-keyboard-shortcuts	repository,copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
analytics-location	//
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/Gomez12/llama.cpp
twitter:image	https://opengraph.githubassets.com/36a8963e07770e3e0c1cd6363c0f7e016cae98af51a797f6842cbee495ab82dd/Gomez12/llama.cpp
twitter:card	summary_large_image
og:image	https://opengraph.githubassets.com/36a8963e07770e3e0c1cd6363c0f7e016cae98af51a797f6842cbee495ab82dd/Gomez12/llama.cpp
og:image:alt	LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.
og:image:width	1200
og:image:height	600
og:site_name	GitHub
og:type	object
hostname	github.com
expected-hostname	github.com
None	533e7cac596c452090972c1150d587fd0b36531b8dc4e8bbfe4ab694aca02408
turbo-cache-control	no-preview
go-import	github.com/Gomez12/llama.cpp git https://github.com/Gomez12/llama.cpp.git
octolytics-dimension-user_id	138985
octolytics-dimension-user_login	Gomez12
octolytics-dimension-repository_id	1111661254
octolytics-dimension-repository_nwo	Gomez12/llama.cpp
octolytics-dimension-repository_public	true
octolytics-dimension-repository_is_fork	true
octolytics-dimension-repository_parent_id	612354784
octolytics-dimension-repository_parent_nwo	ggml-org/llama.cpp
octolytics-dimension-repository_network_root_id	612354784
octolytics-dimension-repository_network_root_nwo	ggml-org/llama.cpp
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	63d27af10eea2ccab520b162530cf6c7b739e767
ui-target	full
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://github.com/Gomez12/llama.cpp#start-of-content
	https://github.com/
Sign in	https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp
Sign up	https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=Gomez12%2Fllama.cpp
Reload	https://github.com/Gomez12/llama.cpp
Reload	https://github.com/Gomez12/llama.cpp
Reload	https://github.com/Gomez12/llama.cpp
Gomez12	https://github.com/Gomez12
llama.cpp	https://github.com/Gomez12/llama.cpp
ggml-org/llama.cpp	https://github.com/ggml-org/llama.cpp
Notifications	https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Fork 0	https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Star 0	https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
MIT license	https://github.com/Gomez12/llama.cpp/blob/master/LICENSE
0 stars	https://github.com/Gomez12/llama.cpp/stargazers
14.5k forks	https://github.com/Gomez12/llama.cpp/forks
Branches	https://github.com/Gomez12/llama.cpp/branches
Tags	https://github.com/Gomez12/llama.cpp/tags
Activity	https://github.com/Gomez12/llama.cpp/activity
Star	https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Notifications	https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Code	https://github.com/Gomez12/llama.cpp
Pull requests 2	https://github.com/Gomez12/llama.cpp/pulls
Actions	https://github.com/Gomez12/llama.cpp/actions
Projects 0	https://github.com/Gomez12/llama.cpp/projects
Security Uh oh! There was an error while loading. Please reload this page.	https://github.com/Gomez12/llama.cpp/security
Please reload this page	https://github.com/Gomez12/llama.cpp
Insights	https://github.com/Gomez12/llama.cpp/pulse
Code	https://github.com/Gomez12/llama.cpp
Pull requests	https://github.com/Gomez12/llama.cpp/pulls
Actions	https://github.com/Gomez12/llama.cpp/actions
Projects	https://github.com/Gomez12/llama.cpp/projects
Security	https://github.com/Gomez12/llama.cpp/security
Insights	https://github.com/Gomez12/llama.cpp/pulse
Branches	https://github.com/Gomez12/llama.cpp/branches
Tags	https://github.com/Gomez12/llama.cpp/tags
	https://github.com/Gomez12/llama.cpp/branches
	https://github.com/Gomez12/llama.cpp/tags
7,717 Commits	https://github.com/Gomez12/llama.cpp/commits/master/
	https://github.com/Gomez12/llama.cpp/commits/master/
.devops	https://github.com/Gomez12/llama.cpp/tree/master/.devops
.devops	https://github.com/Gomez12/llama.cpp/tree/master/.devops
.gemini	https://github.com/Gomez12/llama.cpp/tree/master/.gemini
.gemini	https://github.com/Gomez12/llama.cpp/tree/master/.gemini
.github	https://github.com/Gomez12/llama.cpp/tree/master/.github
.github	https://github.com/Gomez12/llama.cpp/tree/master/.github
benches/dgx-spark	https://github.com/Gomez12/llama.cpp/tree/master/benches/dgx-spark
benches/dgx-spark	https://github.com/Gomez12/llama.cpp/tree/master/benches/dgx-spark
ci	https://github.com/Gomez12/llama.cpp/tree/master/ci
ci	https://github.com/Gomez12/llama.cpp/tree/master/ci
cmake	https://github.com/Gomez12/llama.cpp/tree/master/cmake
cmake	https://github.com/Gomez12/llama.cpp/tree/master/cmake
common	https://github.com/Gomez12/llama.cpp/tree/master/common
common	https://github.com/Gomez12/llama.cpp/tree/master/common
docs	https://github.com/Gomez12/llama.cpp/tree/master/docs
docs	https://github.com/Gomez12/llama.cpp/tree/master/docs
examples	https://github.com/Gomez12/llama.cpp/tree/master/examples
examples	https://github.com/Gomez12/llama.cpp/tree/master/examples
ggml	https://github.com/Gomez12/llama.cpp/tree/master/ggml
ggml	https://github.com/Gomez12/llama.cpp/tree/master/ggml
gguf-py	https://github.com/Gomez12/llama.cpp/tree/master/gguf-py
gguf-py	https://github.com/Gomez12/llama.cpp/tree/master/gguf-py
grammars	https://github.com/Gomez12/llama.cpp/tree/master/grammars
grammars	https://github.com/Gomez12/llama.cpp/tree/master/grammars
include	https://github.com/Gomez12/llama.cpp/tree/master/include
include	https://github.com/Gomez12/llama.cpp/tree/master/include
licenses	https://github.com/Gomez12/llama.cpp/tree/master/licenses
licenses	https://github.com/Gomez12/llama.cpp/tree/master/licenses
media	https://github.com/Gomez12/llama.cpp/tree/master/media
media	https://github.com/Gomez12/llama.cpp/tree/master/media
models	https://github.com/Gomez12/llama.cpp/tree/master/models
models	https://github.com/Gomez12/llama.cpp/tree/master/models
pocs	https://github.com/Gomez12/llama.cpp/tree/master/pocs
pocs	https://github.com/Gomez12/llama.cpp/tree/master/pocs
requirements	https://github.com/Gomez12/llama.cpp/tree/master/requirements
requirements	https://github.com/Gomez12/llama.cpp/tree/master/requirements
scripts	https://github.com/Gomez12/llama.cpp/tree/master/scripts
scripts	https://github.com/Gomez12/llama.cpp/tree/master/scripts
src	https://github.com/Gomez12/llama.cpp/tree/master/src
src	https://github.com/Gomez12/llama.cpp/tree/master/src
tests	https://github.com/Gomez12/llama.cpp/tree/master/tests
tests	https://github.com/Gomez12/llama.cpp/tree/master/tests
tools	https://github.com/Gomez12/llama.cpp/tree/master/tools
tools	https://github.com/Gomez12/llama.cpp/tree/master/tools
vendor	https://github.com/Gomez12/llama.cpp/tree/master/vendor
vendor	https://github.com/Gomez12/llama.cpp/tree/master/vendor
.clang-format	https://github.com/Gomez12/llama.cpp/blob/master/.clang-format
.clang-format	https://github.com/Gomez12/llama.cpp/blob/master/.clang-format
.clang-tidy	https://github.com/Gomez12/llama.cpp/blob/master/.clang-tidy
.clang-tidy	https://github.com/Gomez12/llama.cpp/blob/master/.clang-tidy
.dockerignore	https://github.com/Gomez12/llama.cpp/blob/master/.dockerignore
.dockerignore	https://github.com/Gomez12/llama.cpp/blob/master/.dockerignore
.ecrc	https://github.com/Gomez12/llama.cpp/blob/master/.ecrc
.ecrc	https://github.com/Gomez12/llama.cpp/blob/master/.ecrc
.editorconfig	https://github.com/Gomez12/llama.cpp/blob/master/.editorconfig
.editorconfig	https://github.com/Gomez12/llama.cpp/blob/master/.editorconfig
.flake8	https://github.com/Gomez12/llama.cpp/blob/master/.flake8
.flake8	https://github.com/Gomez12/llama.cpp/blob/master/.flake8
.gitignore	https://github.com/Gomez12/llama.cpp/blob/master/.gitignore
.gitignore	https://github.com/Gomez12/llama.cpp/blob/master/.gitignore
.gitmodules	https://github.com/Gomez12/llama.cpp/blob/master/.gitmodules
.gitmodules	https://github.com/Gomez12/llama.cpp/blob/master/.gitmodules
.pre-commit-config.yaml	https://github.com/Gomez12/llama.cpp/blob/master/.pre-commit-config.yaml
.pre-commit-config.yaml	https://github.com/Gomez12/llama.cpp/blob/master/.pre-commit-config.yaml
AGENTS.md	https://github.com/Gomez12/llama.cpp/blob/master/AGENTS.md
AGENTS.md	https://github.com/Gomez12/llama.cpp/blob/master/AGENTS.md
AUTHORS	https://github.com/Gomez12/llama.cpp/blob/master/AUTHORS
AUTHORS	https://github.com/Gomez12/llama.cpp/blob/master/AUTHORS
CLAUDE.md	https://github.com/Gomez12/llama.cpp/blob/master/CLAUDE.md
CLAUDE.md	https://github.com/Gomez12/llama.cpp/blob/master/CLAUDE.md
CMakeLists.txt	https://github.com/Gomez12/llama.cpp/blob/master/CMakeLists.txt
CMakeLists.txt	https://github.com/Gomez12/llama.cpp/blob/master/CMakeLists.txt
CMakePresets.json	https://github.com/Gomez12/llama.cpp/blob/master/CMakePresets.json
CMakePresets.json	https://github.com/Gomez12/llama.cpp/blob/master/CMakePresets.json
CODEOWNERS	https://github.com/Gomez12/llama.cpp/blob/master/CODEOWNERS
CODEOWNERS	https://github.com/Gomez12/llama.cpp/blob/master/CODEOWNERS
CONTRIBUTING.md	https://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
CONTRIBUTING.md	https://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
LICENSE	https://github.com/Gomez12/llama.cpp/blob/master/LICENSE
LICENSE	https://github.com/Gomez12/llama.cpp/blob/master/LICENSE
Makefile	https://github.com/Gomez12/llama.cpp/blob/master/Makefile
Makefile	https://github.com/Gomez12/llama.cpp/blob/master/Makefile
README.md	https://github.com/Gomez12/llama.cpp/blob/master/README.md
README.md	https://github.com/Gomez12/llama.cpp/blob/master/README.md
SECURITY.md	https://github.com/Gomez12/llama.cpp/blob/master/SECURITY.md
SECURITY.md	https://github.com/Gomez12/llama.cpp/blob/master/SECURITY.md
build-xcframework.sh	https://github.com/Gomez12/llama.cpp/blob/master/build-xcframework.sh
build-xcframework.sh	https://github.com/Gomez12/llama.cpp/blob/master/build-xcframework.sh
convert_hf_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf_update.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_hf_to_gguf_update.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_llama_ggml_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_lora_to_gguf.py
convert_lora_to_gguf.py	https://github.com/Gomez12/llama.cpp/blob/master/convert_lora_to_gguf.py
flake.lock	https://github.com/Gomez12/llama.cpp/blob/master/flake.lock
flake.lock	https://github.com/Gomez12/llama.cpp/blob/master/flake.lock
flake.nix	https://github.com/Gomez12/llama.cpp/blob/master/flake.nix
flake.nix	https://github.com/Gomez12/llama.cpp/blob/master/flake.nix
mypy.ini	https://github.com/Gomez12/llama.cpp/blob/master/mypy.ini
mypy.ini	https://github.com/Gomez12/llama.cpp/blob/master/mypy.ini
poetry.lock	https://github.com/Gomez12/llama.cpp/blob/master/poetry.lock
poetry.lock	https://github.com/Gomez12/llama.cpp/blob/master/poetry.lock
pyproject.toml	https://github.com/Gomez12/llama.cpp/blob/master/pyproject.toml
pyproject.toml	https://github.com/Gomez12/llama.cpp/blob/master/pyproject.toml
pyrightconfig.json	https://github.com/Gomez12/llama.cpp/blob/master/pyrightconfig.json
pyrightconfig.json	https://github.com/Gomez12/llama.cpp/blob/master/pyrightconfig.json
requirements.txt	https://github.com/Gomez12/llama.cpp/blob/master/requirements.txt
requirements.txt	https://github.com/Gomez12/llama.cpp/blob/master/requirements.txt
README	https://github.com/Gomez12/llama.cpp
Contributing	https://github.com/Gomez12/llama.cpp
License	https://github.com/Gomez12/llama.cpp
Security	https://github.com/Gomez12/llama.cpp
	https://github.com/Gomez12/llama.cpp#llamacpp
	https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
	https://opensource.org/licenses/MIT
	https://github.com/ggml-org/llama.cpp/releases
	https://github.com/ggml-org/llama.cpp/actions/workflows/server.yml
Manifesto	https://github.com/ggml-org/llama.cpp/discussions/205
ggml	https://github.com/ggml-org/ggml
ops	https://github.com/ggml-org/llama.cpp/blob/master/docs/ops.md
	https://github.com/Gomez12/llama.cpp#recent-api-changes
Changelog for libllama API	https://github.com/ggml-org/llama.cpp/issues/9289
Changelog for llama-server REST API	https://github.com/ggml-org/llama.cpp/issues/9291
	https://github.com/Gomez12/llama.cpp#hot-topics
guide : using the new WebUI of llama.cpp	https://github.com/ggml-org/llama.cpp/discussions/16938
guide : running gpt-oss with llama.cpp	https://github.com/ggml-org/llama.cpp/discussions/15396
[FEEDBACK] Better packaging for llama.cpp to support downstream consumers 🤗	https://github.com/ggml-org/llama.cpp/discussions/15313
PR	https://github.com/ggml-org/llama.cpp/pull/15091
Collaboration with NVIDIA	https://blogs.nvidia.com/blog/rtx-ai-garage-openai-oss
Comment	https://github.com/ggml-org/llama.cpp/discussions/15095
#12898	https://github.com/ggml-org/llama.cpp/pull/12898
documentation	https://github.com/Gomez12/llama.cpp/blob/master/docs/multimodal.md
https://github.com/ggml-org/llama.vscode	https://github.com/ggml-org/llama.vscode
https://github.com/ggml-org/llama.vim	https://github.com/ggml-org/llama.vim
ggml-org#9669	https://github.com/ggml-org/llama.cpp/discussions/9669
discussion	https://github.com/ggml-org/llama.cpp/discussions/9268
tool	https://huggingface.co/spaces/CISCai/gguf-editor
	https://github.com/Gomez12/llama.cpp#quick-start
brew, nix or winget	https://github.com/Gomez12/llama.cpp/blob/master/docs/install.md
Docker documentation	https://github.com/Gomez12/llama.cpp/blob/master/docs/docker.md
releases page	https://github.com/ggml-org/llama.cpp/releases
our build guide	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md
Obtaining and quantizing models	https://github.com/Gomez12/llama.cpp#obtaining-and-quantizing-models
	https://github.com/Gomez12/llama.cpp#description
ggml	https://github.com/ggml-org/ggml
HOWTO-add-model.md	https://github.com/Gomez12/llama.cpp/blob/master/docs/development/HOWTO-add-model.md
	https://github.com/Gomez12/llama.cpp#text-only
Mistral 7B	https://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoE	https://huggingface.co/models?search=mistral-ai/Mixtral
DBRX	https://huggingface.co/databricks/dbrx-instruct
Jamba	https://huggingface.co/ai21labs
Falcon	https://huggingface.co/models?search=tiiuae/falcon
Chinese LLaMA / Alpaca	https://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2	https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)	https://github.com/bofenghuang/vigogne
BERT	https://github.com/ggml-org/llama.cpp/pull/5423
Koala	https://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2	https://huggingface.co/models?search=baichuan-inc/Baichuan
derivations	https://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2	https://huggingface.co/models?search=BAAI/Aquila
Starcoder models	https://github.com/ggml-org/llama.cpp/pull/3187
Refact	https://huggingface.co/smallcloudai/Refact-1_6B-fim
MPT	https://github.com/ggml-org/llama.cpp/pull/3417
Bloom	https://github.com/ggml-org/llama.cpp/pull/3553
Yi models	https://huggingface.co/models?search=01-ai/Yi
StableLM models	https://huggingface.co/stabilityai
Deepseek models	https://huggingface.co/models?search=deepseek-ai/deepseek
Qwen models	https://huggingface.co/models?search=Qwen/Qwen
PLaMo-13B	https://github.com/ggml-org/llama.cpp/pull/3557
Phi models	https://huggingface.co/models?search=microsoft/phi
PhiMoE	https://github.com/ggml-org/llama.cpp/pull/11003
GPT-2	https://huggingface.co/gpt2
Orion 14B	https://github.com/ggml-org/llama.cpp/pull/5118
InternLM2	https://huggingface.co/models?search=internlm2
CodeShell	https://github.com/WisdomShell/codeshell
Gemma	https://ai.google.dev/gemma
Mamba	https://github.com/state-spaces/mamba
Grok-1	https://huggingface.co/keyfan/grok-1-hf
Xverse	https://huggingface.co/models?search=xverse
Command-R models	https://huggingface.co/models?search=CohereForAI/c4ai-command-r
SEA-LION	https://huggingface.co/models?search=sea-lion
GritLM-7B	https://huggingface.co/GritLM/GritLM-7B
GritLM-8x7B	https://huggingface.co/GritLM/GritLM-8x7B
OLMo	https://allenai.org/olmo
OLMo 2	https://allenai.org/olmo
OLMoE	https://huggingface.co/allenai/OLMoE-1B-7B-0924
Granite models	https://huggingface.co/collections/ibm-granite/granite-code-models-6624c5cec322e4c148c8b330
GPT-NeoX	https://github.com/EleutherAI/gpt-neox
Pythia	https://github.com/EleutherAI/pythia
Snowflake-Arctic MoE	https://huggingface.co/collections/Snowflake/arctic-66290090abe542894a5ac520
Smaug	https://huggingface.co/models?search=Smaug
Poro 34B	https://huggingface.co/LumiOpen/Poro-34B
Bitnet b1.58 models	https://huggingface.co/1bitLLM
Flan T5	https://huggingface.co/models?search=flan-t5
Open Elm models	https://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca
ChatGLM3-6b	https://huggingface.co/THUDM/chatglm3-6b
ChatGLM4-9b	https://huggingface.co/THUDM/glm-4-9b
GLMEdge-1.5b	https://huggingface.co/THUDM/glm-edge-1.5b-chat
GLMEdge-4b	https://huggingface.co/THUDM/glm-edge-4b-chat
GLM-4-0414	https://huggingface.co/collections/THUDM/glm-4-0414-67f3cbcb34dd9d252707cb2e
SmolLM	https://huggingface.co/collections/HuggingFaceTB/smollm-6695016cad7167254ce15966
EXAONE-3.0-7.8B-Instruct	https://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
FalconMamba Models	https://huggingface.co/collections/tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
Jais	https://huggingface.co/inceptionai/jais-13b-chat
Bielik-11B-v2.3	https://huggingface.co/collections/speakleash/bielik-11b-v23-66ee813238d9b526a072408a
RWKV-6	https://github.com/BlinkDL/RWKV-LM
QRWKV-6	https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1
GigaChat-20B-A3B	https://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct
Trillion-7B-preview	https://huggingface.co/trillionlabs/Trillion-7B-preview
Ling models	https://huggingface.co/collections/inclusionAI/ling-67c51c85b34a7ea0aba94c32
LFM2 models	https://huggingface.co/collections/LiquidAI/lfm2-686d721927015b2ad73eaa38
Hunyuan models	https://huggingface.co/collections/tencent/hunyuan-dense-model-6890632cda26b19119c9c5e7
BailingMoeV2 (Ring/Ling 2.0) models	https://huggingface.co/collections/inclusionAI/ling-v2-68bf1dd2fc34c306c1fa6f86
	https://github.com/Gomez12/llama.cpp#multimodal
LLaVA 1.5 models	https://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 models	https://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVA	https://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidian	https://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4V	https://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B models	https://huggingface.co/models?search=mobileVLM
Yi-VL	https://huggingface.co/models?search=Yi-VL
Mini CPM	https://huggingface.co/models?search=MiniCPM
Moondream	https://huggingface.co/vikhyatk/moondream2
Bunny	https://github.com/BAAI-DCAI/Bunny
GLM-EDGE	https://huggingface.co/models?search=glm-edge
Qwen2-VL	https://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d
LFM2-VL	https://huggingface.co/collections/LiquidAI/lfm2-vl-68963bbc84a610f7638d5ffa
ddh0/easy-llama	https://github.com/ddh0/easy-llama
abetlen/llama-cpp-python	https://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpp	https://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpp	https://github.com/withcatai/node-llama-cpp
lgrammel/modelfusion	https://modelfusion.dev/integration/model-provider/llamacpp
offline-ai/cli	https://github.com/offline-ai/cli
tangledgroup/llama-cpp-wasm	https://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllama	https://github.com/ngxson/wllama
yoshoku/llama_cpp.rb	https://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rs	https://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpp	https://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rs	https://github.com/utilityai/llama-cpp-rs
ShelbyJenkins/llm_client	https://github.com/ShelbyJenkins/llm_client
SciSharp/LLamaSharp	https://github.com/SciSharp/LLamaSharp
LM-Kit.NET	https://docs.lm-kit.com/lm-kit-net/index.html
donderom/llm4s	https://github.com/donderom/llm4s
phronmophobic/llama.clj	https://github.com/phronmophobic/llama.clj
mybigday/llama.rn	https://github.com/mybigday/llama.rn
kherud/java-llama.cpp	https://github.com/kherud/java-llama.cpp
QuasarByte/llama-cpp-jna	https://github.com/QuasarByte/llama-cpp-jna
deins/llama.cpp.zig	https://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_dart	https://github.com/netdur/llama_cpp_dart
xuegao-tzx/Fllama	https://github.com/xuegao-tzx/Fllama
distantmagic/resonance	https://github.com/distantmagic/resonance
(more info)	https://github.com/ggml-org/llama.cpp/pull/6326
guile_llama_cpp	https://savannah.nongnu.org/projects/guile-llama-cpp
srgtuszy/llama-cpp-swift	https://github.com/srgtuszy/llama-cpp-swift
ShenghaiWang/SwiftLlama	https://github.com/ShenghaiWang/SwiftLlama
Embarcadero/llama-cpp-delphi	https://github.com/Embarcadero/llama-cpp-delphi
hybridgroup/yzma	https://github.com/hybridgroup/yzma
llama.android	https://github.com/Gomez12/llama.cpp/blob/master/examples/llama.android
AI Sublime Text plugin	https://github.com/yaroslavyaroslav/OpenAI-sublime-text
BonzAI App	https://apps.apple.com/us/app/bonzai-your-local-ai-agent/id6752847988
cztomsik/ava	https://github.com/cztomsik/ava
Dot	https://github.com/alexpinel/Dot
eva	https://github.com/ylsdamxssjxxdd/eva
iohub/collama	https://github.com/iohub/coLLaMA
janhq/jan	https://github.com/janhq/jan
johnbean393/Sidekick	https://github.com/johnbean393/Sidekick
KanTV	https://github.com/zhouwg/kantv?tab=readme-ov-file
KodiBot	https://github.com/firatkiral/kodibot
llama.vim	https://github.com/ggml-org/llama.vim
LARS	https://github.com/abgulati/LARS
Llama Assistant	https://github.com/vietanhdev/llama-assistant
LLMFarm	https://github.com/guinmoon/LLMFarm?tab=readme-ov-file
LLMUnity	https://github.com/undreamai/LLMUnity
LMStudio	https://lmstudio.ai/
LocalAI	https://github.com/mudler/LocalAI
LostRuins/koboldcpp	https://github.com/LostRuins/koboldcpp
MindMac	https://mindmac.app
MindWorkAI/AI-Studio	https://github.com/MindWorkAI/AI-Studio
Mobile-Artificial-Intelligence/maid	https://github.com/Mobile-Artificial-Intelligence/maid
Mozilla-Ocho/llamafile	https://github.com/Mozilla-Ocho/llamafile
nat/openplayground	https://github.com/nat/openplayground
nomic-ai/gpt4all	https://github.com/nomic-ai/gpt4all
ollama/ollama	https://github.com/ollama/ollama
oobabooga/text-generation-webui	https://github.com/oobabooga/text-generation-webui
PocketPal AI	https://github.com/a-ghorbani/pocketpal-ai
psugihara/FreeChat	https://github.com/psugihara/FreeChat
ptsochantaris/emeltal	https://github.com/ptsochantaris/emeltal
pythops/tenere	https://github.com/pythops/tenere
ramalama	https://github.com/containers/ramalama
semperai/amica	https://github.com/semperai/amica
withcatai/catai	https://github.com/withcatai/catai
Autopen	https://github.com/blackhole89/autopen
akx/ggify	https://github.com/akx/ggify
akx/ollama-dl	https://github.com/akx/ollama-dl
crashr/gppm	https://github.com/crashr/gppm
gpustack/gguf-parser	https://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser
Styled Lines	https://marketplace.unity.com/packages/tools/generative-ai/styled-lines-llama-cpp-model-292902
unslothai/unsloth	https://github.com/unslothai/unsloth
Paddler	https://github.com/intentee/paddler
GPUStack	https://github.com/gpustack/gpustack
llama_cpp_canister	https://github.com/onicai/llama_cpp_canister
llama-swap	https://github.com/mostlygeek/llama-swap
Kalavai	https://github.com/kalavai-net/kalavai-client
llmaz	https://github.com/InftyAI/llmaz
Lucy's Labyrinth	https://github.com/MorganRO8/Lucys_Labyrinth
	https://github.com/Gomez12/llama.cpp#supported-backends
Metal	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#metal-build
BLAS	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#blas-build
BLIS	https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/BLIS.md
SYCL	https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/SYCL.md
MUSA	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#musa
CUDA	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#cuda
HIP	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#hip
ZenDNN	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#zendnn
Vulkan	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#vulkan
CANN	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#cann
OpenCL	https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/OPENCL.md
IBM zDNN	https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/zDNN.md
WebGPU [In Progress]	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#webgpu
RPC	https://github.com/ggml-org/llama.cpp/tree/master/tools/rpc
Hexagon [In Progress]	https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/hexagon/README.md
	https://github.com/Gomez12/llama.cpp#obtaining-and-quantizing-models
Hugging Face	https://huggingface.co
number of LLMs	https://huggingface.co/models?library=gguf&sort=trending
Trending	https://huggingface.co/models?library=gguf&sort=trending
LLaMA	https://huggingface.co/models?sort=trending&search=llama+gguf
Hugging Face	https://huggingface.co/
ModelScope	https://modelscope.cn/
GGUF	https://github.com/ggml-org/ggml/blob/master/docs/gguf.md
GGUF-my-repo space	https://huggingface.co/spaces/ggml-org/gguf-my-repo
GGUF-my-LoRA space	https://huggingface.co/spaces/ggml-org/gguf-my-lora
ggml-org#10123	https://github.com/ggml-org/llama.cpp/discussions/10123
GGUF-editor space	https://huggingface.co/spaces/CISCai/gguf-editor
ggml-org#9268	https://github.com/ggml-org/llama.cpp/discussions/9268
Inference Endpoints	https://ui.endpoints.huggingface.co/
ggml-org#9669	https://github.com/ggml-org/llama.cpp/discussions/9669
read this documentation	https://github.com/Gomez12/llama.cpp/blob/master/tools/quantize/README.md
llama-cli	https://github.com/Gomez12/llama.cpp/blob/master/tools/cli
	https://github.com/Gomez12/llama.cpp#llama-cli
	https://github.com/Gomez12/llama.cpp#a-cli-tool-for-accessing-and-experimenting-with-most-of-llamacpps-functionality
grammars/	https://github.com/Gomez12/llama.cpp/blob/master/grammars
GBNF Guide	https://github.com/Gomez12/llama.cpp/blob/master/grammars/README.md
https://grammar.intrinsiclabs.ai/	https://grammar.intrinsiclabs.ai/
llama-server	https://github.com/Gomez12/llama.cpp/blob/master/tools/server
	https://github.com/Gomez12/llama.cpp#llama-server
OpenAI API	https://github.com/openai/openai-openapi
	https://github.com/Gomez12/llama.cpp#a-lightweight-openai-api-compatible-http-server-for-serving-llms
llama-perplexity	https://github.com/Gomez12/llama.cpp/blob/master/tools/perplexity
	https://github.com/Gomez12/llama.cpp#llama-perplexity
perplexity	https://github.com/Gomez12/llama.cpp/blob/master/tools/perplexity/README.md
1	https://github.com/Gomez12/llama.cpp#user-content-fn-1-048d91990ad10561a76ae941167d0901
	https://github.com/Gomez12/llama.cpp#a-tool-for-measuring-the-perplexity-1-and-other-quality-metrics-of-a-model-over-a-given-text
llama-bench	https://github.com/Gomez12/llama.cpp/blob/master/tools/llama-bench
	https://github.com/Gomez12/llama.cpp#llama-bench
	https://github.com/Gomez12/llama.cpp#benchmark-the-performance-of-the-inference-for-various-parameters
llama-simple	https://github.com/Gomez12/llama.cpp/blob/master/examples/simple
	https://github.com/Gomez12/llama.cpp#llama-simple
	https://github.com/Gomez12/llama.cpp#a-minimal-example-for-implementing-apps-with-llamacpp-useful-for-developers
	https://github.com/Gomez12/llama.cpp#contributing
good first issues	https://github.com/ggml-org/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
CONTRIBUTING.md	https://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
Inference at the edge	https://github.com/ggml-org/llama.cpp/discussions/205
Changelog podcast	https://changelog.com/podcast/532
	https://github.com/Gomez12/llama.cpp#other-documentation
cli	https://github.com/Gomez12/llama.cpp/blob/master/tools/cli/README.md
completion	https://github.com/Gomez12/llama.cpp/blob/master/tools/completion/README.md
server	https://github.com/Gomez12/llama.cpp/blob/master/tools/server/README.md
GBNF grammars	https://github.com/Gomez12/llama.cpp/blob/master/grammars/README.md
	https://github.com/Gomez12/llama.cpp#development-documentation
How to build	https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md
Running on Docker	https://github.com/Gomez12/llama.cpp/blob/master/docs/docker.md
Build on Android	https://github.com/Gomez12/llama.cpp/blob/master/docs/android.md
Performance troubleshooting	https://github.com/Gomez12/llama.cpp/blob/master/docs/development/token_generation_performance_tips.md
GGML tips & tricks	https://github.com/ggml-org/llama.cpp/wiki/GGML-Tips-&-Tricks
	https://github.com/Gomez12/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language model	https://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Models	https://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learners	https://arxiv.org/abs/2005.14165
Aligning language models to follow instructions	https://openai.com/research/instruction-following
Training language models to follow instructions with human feedback	https://arxiv.org/abs/2203.02155
	https://github.com/Gomez12/llama.cpp#xcframework
	https://github.com/Gomez12/llama.cpp#completions
	https://github.com/Gomez12/llama.cpp#bash-completion
	https://github.com/Gomez12/llama.cpp#dependencies
yhirose/cpp-httplib	https://github.com/yhirose/cpp-httplib
stb-image	https://github.com/nothings/stb
nlohmann/json	https://github.com/nlohmann/json
minja	https://github.com/google/minja
curl	https://curl.se/
CURL License	https://curl.se/docs/copyright.html
miniaudio.h	https://github.com/mackron/miniaudio
subprocess.h	https://github.com/sheredom/subprocess.h
https://huggingface.co/docs/transformers/perplexity	https://huggingface.co/docs/transformers/perplexity
↩	https://github.com/Gomez12/llama.cpp#user-content-fnref-1-048d91990ad10561a76ae941167d0901
Readme	https://github.com/Gomez12/llama.cpp#readme-ov-file
MIT license	https://github.com/Gomez12/llama.cpp#MIT-1-ov-file
Contributing	https://github.com/Gomez12/llama.cpp#contributing-ov-file
Security policy	https://github.com/Gomez12/llama.cpp#security-ov-file
Please reload this page	https://github.com/Gomez12/llama.cpp
Activity	https://github.com/Gomez12/llama.cpp/activity
0 stars	https://github.com/Gomez12/llama.cpp/stargazers
0 watching	https://github.com/Gomez12/llama.cpp/watchers
0 forks	https://github.com/Gomez12/llama.cpp/forks
Report repository	https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp&report=Gomez12+%28user%29
Releases	https://github.com/Gomez12/llama.cpp/releases
3 tags	https://github.com/Gomez12/llama.cpp/tags
Packages 0	https://github.com/users/Gomez12/packages?repo_name=llama.cpp
Please reload this page	https://github.com/Gomez12/llama.cpp
Please reload this page	https://github.com/Gomez12/llama.cpp
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.