René's URL Explorer Experiment


Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

Open Graph Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

X Title: GitHub - Gomez12/llama.cpp: LLM inference in C/C++

Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

Open Graph Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

X Description: LLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/Gomez12/llama.cpp

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:acf23ad5-bea9-554c-144d-f6101aa7186c
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idA094:23CDC6:DF0B45:12A6412:69698AA8
html-safe-noncec38244b1ec0c4a2ce0695e614248aa2cedf1578ed6f292e0936ae3beaa60d0ea
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBMDk0OjIzQ0RDNjpERjBCNDU6MTJBNjQxMjo2OTY5OEFBOCIsInZpc2l0b3JfaWQiOiIxNjk4MzI2MDk3NDg1NzI4NDI1IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0=
visitor-hmaca1ac91aa7592b519bc15643dc5a2a3e4637c76bc4758bb2074c0e758735f3bce
hovercard-subject-tagrepository:1111661254
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/Gomez12/llama.cpp
twitter:imagehttps://opengraph.githubassets.com/36a8963e07770e3e0c1cd6363c0f7e016cae98af51a797f6842cbee495ab82dd/Gomez12/llama.cpp
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/36a8963e07770e3e0c1cd6363c0f7e016cae98af51a797f6842cbee495ab82dd/Gomez12/llama.cpp
og:image:altLLM inference in C/C++. Contribute to Gomez12/llama.cpp development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None533e7cac596c452090972c1150d587fd0b36531b8dc4e8bbfe4ab694aca02408
turbo-cache-controlno-preview
go-importgithub.com/Gomez12/llama.cpp git https://github.com/Gomez12/llama.cpp.git
octolytics-dimension-user_id138985
octolytics-dimension-user_loginGomez12
octolytics-dimension-repository_id1111661254
octolytics-dimension-repository_nwoGomez12/llama.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id612354784
octolytics-dimension-repository_parent_nwoggml-org/llama.cpp
octolytics-dimension-repository_network_root_id612354784
octolytics-dimension-repository_network_root_nwoggml-org/llama.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release63d27af10eea2ccab520b162530cf6c7b739e767
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/Gomez12/llama.cpp#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=Gomez12%2Fllama.cpp
Reloadhttps://github.com/Gomez12/llama.cpp
Reloadhttps://github.com/Gomez12/llama.cpp
Reloadhttps://github.com/Gomez12/llama.cpp
Gomez12 https://github.com/Gomez12
llama.cpphttps://github.com/Gomez12/llama.cpp
ggml-org/llama.cpphttps://github.com/ggml-org/llama.cpp
Notifications https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Fork 0 https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Star 0 https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
MIT license https://github.com/Gomez12/llama.cpp/blob/master/LICENSE
0 stars https://github.com/Gomez12/llama.cpp/stargazers
14.5k forks https://github.com/Gomez12/llama.cpp/forks
Branches https://github.com/Gomez12/llama.cpp/branches
Tags https://github.com/Gomez12/llama.cpp/tags
Activity https://github.com/Gomez12/llama.cpp/activity
Star https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Notifications https://github.com/login?return_to=%2FGomez12%2Fllama.cpp
Code https://github.com/Gomez12/llama.cpp
Pull requests 2 https://github.com/Gomez12/llama.cpp/pulls
Actions https://github.com/Gomez12/llama.cpp/actions
Projects 0 https://github.com/Gomez12/llama.cpp/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/Gomez12/llama.cpp/security
Please reload this pagehttps://github.com/Gomez12/llama.cpp
Insights https://github.com/Gomez12/llama.cpp/pulse
Code https://github.com/Gomez12/llama.cpp
Pull requests https://github.com/Gomez12/llama.cpp/pulls
Actions https://github.com/Gomez12/llama.cpp/actions
Projects https://github.com/Gomez12/llama.cpp/projects
Security https://github.com/Gomez12/llama.cpp/security
Insights https://github.com/Gomez12/llama.cpp/pulse
Brancheshttps://github.com/Gomez12/llama.cpp/branches
Tagshttps://github.com/Gomez12/llama.cpp/tags
https://github.com/Gomez12/llama.cpp/branches
https://github.com/Gomez12/llama.cpp/tags
7,717 Commitshttps://github.com/Gomez12/llama.cpp/commits/master/
https://github.com/Gomez12/llama.cpp/commits/master/
.devopshttps://github.com/Gomez12/llama.cpp/tree/master/.devops
.devopshttps://github.com/Gomez12/llama.cpp/tree/master/.devops
.geminihttps://github.com/Gomez12/llama.cpp/tree/master/.gemini
.geminihttps://github.com/Gomez12/llama.cpp/tree/master/.gemini
.githubhttps://github.com/Gomez12/llama.cpp/tree/master/.github
.githubhttps://github.com/Gomez12/llama.cpp/tree/master/.github
benches/dgx-sparkhttps://github.com/Gomez12/llama.cpp/tree/master/benches/dgx-spark
benches/dgx-sparkhttps://github.com/Gomez12/llama.cpp/tree/master/benches/dgx-spark
cihttps://github.com/Gomez12/llama.cpp/tree/master/ci
cihttps://github.com/Gomez12/llama.cpp/tree/master/ci
cmakehttps://github.com/Gomez12/llama.cpp/tree/master/cmake
cmakehttps://github.com/Gomez12/llama.cpp/tree/master/cmake
commonhttps://github.com/Gomez12/llama.cpp/tree/master/common
commonhttps://github.com/Gomez12/llama.cpp/tree/master/common
docshttps://github.com/Gomez12/llama.cpp/tree/master/docs
docshttps://github.com/Gomez12/llama.cpp/tree/master/docs
exampleshttps://github.com/Gomez12/llama.cpp/tree/master/examples
exampleshttps://github.com/Gomez12/llama.cpp/tree/master/examples
ggmlhttps://github.com/Gomez12/llama.cpp/tree/master/ggml
ggmlhttps://github.com/Gomez12/llama.cpp/tree/master/ggml
gguf-pyhttps://github.com/Gomez12/llama.cpp/tree/master/gguf-py
gguf-pyhttps://github.com/Gomez12/llama.cpp/tree/master/gguf-py
grammarshttps://github.com/Gomez12/llama.cpp/tree/master/grammars
grammarshttps://github.com/Gomez12/llama.cpp/tree/master/grammars
includehttps://github.com/Gomez12/llama.cpp/tree/master/include
includehttps://github.com/Gomez12/llama.cpp/tree/master/include
licenseshttps://github.com/Gomez12/llama.cpp/tree/master/licenses
licenseshttps://github.com/Gomez12/llama.cpp/tree/master/licenses
mediahttps://github.com/Gomez12/llama.cpp/tree/master/media
mediahttps://github.com/Gomez12/llama.cpp/tree/master/media
modelshttps://github.com/Gomez12/llama.cpp/tree/master/models
modelshttps://github.com/Gomez12/llama.cpp/tree/master/models
pocshttps://github.com/Gomez12/llama.cpp/tree/master/pocs
pocshttps://github.com/Gomez12/llama.cpp/tree/master/pocs
requirementshttps://github.com/Gomez12/llama.cpp/tree/master/requirements
requirementshttps://github.com/Gomez12/llama.cpp/tree/master/requirements
scriptshttps://github.com/Gomez12/llama.cpp/tree/master/scripts
scriptshttps://github.com/Gomez12/llama.cpp/tree/master/scripts
srchttps://github.com/Gomez12/llama.cpp/tree/master/src
srchttps://github.com/Gomez12/llama.cpp/tree/master/src
testshttps://github.com/Gomez12/llama.cpp/tree/master/tests
testshttps://github.com/Gomez12/llama.cpp/tree/master/tests
toolshttps://github.com/Gomez12/llama.cpp/tree/master/tools
toolshttps://github.com/Gomez12/llama.cpp/tree/master/tools
vendorhttps://github.com/Gomez12/llama.cpp/tree/master/vendor
vendorhttps://github.com/Gomez12/llama.cpp/tree/master/vendor
.clang-formathttps://github.com/Gomez12/llama.cpp/blob/master/.clang-format
.clang-formathttps://github.com/Gomez12/llama.cpp/blob/master/.clang-format
.clang-tidyhttps://github.com/Gomez12/llama.cpp/blob/master/.clang-tidy
.clang-tidyhttps://github.com/Gomez12/llama.cpp/blob/master/.clang-tidy
.dockerignorehttps://github.com/Gomez12/llama.cpp/blob/master/.dockerignore
.dockerignorehttps://github.com/Gomez12/llama.cpp/blob/master/.dockerignore
.ecrchttps://github.com/Gomez12/llama.cpp/blob/master/.ecrc
.ecrchttps://github.com/Gomez12/llama.cpp/blob/master/.ecrc
.editorconfighttps://github.com/Gomez12/llama.cpp/blob/master/.editorconfig
.editorconfighttps://github.com/Gomez12/llama.cpp/blob/master/.editorconfig
.flake8https://github.com/Gomez12/llama.cpp/blob/master/.flake8
.flake8https://github.com/Gomez12/llama.cpp/blob/master/.flake8
.gitignorehttps://github.com/Gomez12/llama.cpp/blob/master/.gitignore
.gitignorehttps://github.com/Gomez12/llama.cpp/blob/master/.gitignore
.gitmoduleshttps://github.com/Gomez12/llama.cpp/blob/master/.gitmodules
.gitmoduleshttps://github.com/Gomez12/llama.cpp/blob/master/.gitmodules
.pre-commit-config.yamlhttps://github.com/Gomez12/llama.cpp/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://github.com/Gomez12/llama.cpp/blob/master/.pre-commit-config.yaml
AGENTS.mdhttps://github.com/Gomez12/llama.cpp/blob/master/AGENTS.md
AGENTS.mdhttps://github.com/Gomez12/llama.cpp/blob/master/AGENTS.md
AUTHORShttps://github.com/Gomez12/llama.cpp/blob/master/AUTHORS
AUTHORShttps://github.com/Gomez12/llama.cpp/blob/master/AUTHORS
CLAUDE.mdhttps://github.com/Gomez12/llama.cpp/blob/master/CLAUDE.md
CLAUDE.mdhttps://github.com/Gomez12/llama.cpp/blob/master/CLAUDE.md
CMakeLists.txthttps://github.com/Gomez12/llama.cpp/blob/master/CMakeLists.txt
CMakeLists.txthttps://github.com/Gomez12/llama.cpp/blob/master/CMakeLists.txt
CMakePresets.jsonhttps://github.com/Gomez12/llama.cpp/blob/master/CMakePresets.json
CMakePresets.jsonhttps://github.com/Gomez12/llama.cpp/blob/master/CMakePresets.json
CODEOWNERShttps://github.com/Gomez12/llama.cpp/blob/master/CODEOWNERS
CODEOWNERShttps://github.com/Gomez12/llama.cpp/blob/master/CODEOWNERS
CONTRIBUTING.mdhttps://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
LICENSEhttps://github.com/Gomez12/llama.cpp/blob/master/LICENSE
LICENSEhttps://github.com/Gomez12/llama.cpp/blob/master/LICENSE
Makefilehttps://github.com/Gomez12/llama.cpp/blob/master/Makefile
Makefilehttps://github.com/Gomez12/llama.cpp/blob/master/Makefile
README.mdhttps://github.com/Gomez12/llama.cpp/blob/master/README.md
README.mdhttps://github.com/Gomez12/llama.cpp/blob/master/README.md
SECURITY.mdhttps://github.com/Gomez12/llama.cpp/blob/master/SECURITY.md
SECURITY.mdhttps://github.com/Gomez12/llama.cpp/blob/master/SECURITY.md
build-xcframework.shhttps://github.com/Gomez12/llama.cpp/blob/master/build-xcframework.sh
build-xcframework.shhttps://github.com/Gomez12/llama.cpp/blob/master/build-xcframework.sh
convert_hf_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf.py
convert_hf_to_gguf_update.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_hf_to_gguf_update.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_llama_ggml_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_lora_to_gguf.py
convert_lora_to_gguf.pyhttps://github.com/Gomez12/llama.cpp/blob/master/convert_lora_to_gguf.py
flake.lockhttps://github.com/Gomez12/llama.cpp/blob/master/flake.lock
flake.lockhttps://github.com/Gomez12/llama.cpp/blob/master/flake.lock
flake.nixhttps://github.com/Gomez12/llama.cpp/blob/master/flake.nix
flake.nixhttps://github.com/Gomez12/llama.cpp/blob/master/flake.nix
mypy.inihttps://github.com/Gomez12/llama.cpp/blob/master/mypy.ini
mypy.inihttps://github.com/Gomez12/llama.cpp/blob/master/mypy.ini
poetry.lockhttps://github.com/Gomez12/llama.cpp/blob/master/poetry.lock
poetry.lockhttps://github.com/Gomez12/llama.cpp/blob/master/poetry.lock
pyproject.tomlhttps://github.com/Gomez12/llama.cpp/blob/master/pyproject.toml
pyproject.tomlhttps://github.com/Gomez12/llama.cpp/blob/master/pyproject.toml
pyrightconfig.jsonhttps://github.com/Gomez12/llama.cpp/blob/master/pyrightconfig.json
pyrightconfig.jsonhttps://github.com/Gomez12/llama.cpp/blob/master/pyrightconfig.json
requirements.txthttps://github.com/Gomez12/llama.cpp/blob/master/requirements.txt
requirements.txthttps://github.com/Gomez12/llama.cpp/blob/master/requirements.txt
READMEhttps://github.com/Gomez12/llama.cpp
Contributinghttps://github.com/Gomez12/llama.cpp
Licensehttps://github.com/Gomez12/llama.cpp
Securityhttps://github.com/Gomez12/llama.cpp
https://github.com/Gomez12/llama.cpp#llamacpp
https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
https://opensource.org/licenses/MIT
https://github.com/ggml-org/llama.cpp/releases
https://github.com/ggml-org/llama.cpp/actions/workflows/server.yml
Manifestohttps://github.com/ggml-org/llama.cpp/discussions/205
ggmlhttps://github.com/ggml-org/ggml
opshttps://github.com/ggml-org/llama.cpp/blob/master/docs/ops.md
https://github.com/Gomez12/llama.cpp#recent-api-changes
Changelog for libllama APIhttps://github.com/ggml-org/llama.cpp/issues/9289
Changelog for llama-server REST APIhttps://github.com/ggml-org/llama.cpp/issues/9291
https://github.com/Gomez12/llama.cpp#hot-topics
guide : using the new WebUI of llama.cpphttps://github.com/ggml-org/llama.cpp/discussions/16938
guide : running gpt-oss with llama.cpphttps://github.com/ggml-org/llama.cpp/discussions/15396
[FEEDBACK] Better packaging for llama.cpp to support downstream consumers 🤗https://github.com/ggml-org/llama.cpp/discussions/15313
PRhttps://github.com/ggml-org/llama.cpp/pull/15091
Collaboration with NVIDIAhttps://blogs.nvidia.com/blog/rtx-ai-garage-openai-oss
Commenthttps://github.com/ggml-org/llama.cpp/discussions/15095
#12898https://github.com/ggml-org/llama.cpp/pull/12898
documentationhttps://github.com/Gomez12/llama.cpp/blob/master/docs/multimodal.md
https://github.com/ggml-org/llama.vscodehttps://github.com/ggml-org/llama.vscode
https://github.com/ggml-org/llama.vimhttps://github.com/ggml-org/llama.vim
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
discussionhttps://github.com/ggml-org/llama.cpp/discussions/9268
toolhttps://huggingface.co/spaces/CISCai/gguf-editor
https://github.com/Gomez12/llama.cpp#quick-start
brew, nix or wingethttps://github.com/Gomez12/llama.cpp/blob/master/docs/install.md
Docker documentationhttps://github.com/Gomez12/llama.cpp/blob/master/docs/docker.md
releases pagehttps://github.com/ggml-org/llama.cpp/releases
our build guidehttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md
Obtaining and quantizing modelshttps://github.com/Gomez12/llama.cpp#obtaining-and-quantizing-models
https://github.com/Gomez12/llama.cpp#description
ggmlhttps://github.com/ggml-org/ggml
HOWTO-add-model.mdhttps://github.com/Gomez12/llama.cpp/blob/master/docs/development/HOWTO-add-model.md
https://github.com/Gomez12/llama.cpp#text-only
Mistral 7Bhttps://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoEhttps://huggingface.co/models?search=mistral-ai/Mixtral
DBRXhttps://huggingface.co/databricks/dbrx-instruct
Jambahttps://huggingface.co/ai21labs
Falconhttps://huggingface.co/models?search=tiiuae/falcon
Chinese LLaMA / Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)https://github.com/bofenghuang/vigogne
BERThttps://github.com/ggml-org/llama.cpp/pull/5423
Koalahttps://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2https://huggingface.co/models?search=baichuan-inc/Baichuan
derivationshttps://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2https://huggingface.co/models?search=BAAI/Aquila
Starcoder modelshttps://github.com/ggml-org/llama.cpp/pull/3187
Refacthttps://huggingface.co/smallcloudai/Refact-1_6B-fim
MPThttps://github.com/ggml-org/llama.cpp/pull/3417
Bloomhttps://github.com/ggml-org/llama.cpp/pull/3553
Yi modelshttps://huggingface.co/models?search=01-ai/Yi
StableLM modelshttps://huggingface.co/stabilityai
Deepseek modelshttps://huggingface.co/models?search=deepseek-ai/deepseek
Qwen modelshttps://huggingface.co/models?search=Qwen/Qwen
PLaMo-13Bhttps://github.com/ggml-org/llama.cpp/pull/3557
Phi modelshttps://huggingface.co/models?search=microsoft/phi
PhiMoEhttps://github.com/ggml-org/llama.cpp/pull/11003
GPT-2https://huggingface.co/gpt2
Orion 14Bhttps://github.com/ggml-org/llama.cpp/pull/5118
InternLM2https://huggingface.co/models?search=internlm2
CodeShellhttps://github.com/WisdomShell/codeshell
Gemmahttps://ai.google.dev/gemma
Mambahttps://github.com/state-spaces/mamba
Grok-1https://huggingface.co/keyfan/grok-1-hf
Xversehttps://huggingface.co/models?search=xverse
Command-R modelshttps://huggingface.co/models?search=CohereForAI/c4ai-command-r
SEA-LIONhttps://huggingface.co/models?search=sea-lion
GritLM-7Bhttps://huggingface.co/GritLM/GritLM-7B
GritLM-8x7Bhttps://huggingface.co/GritLM/GritLM-8x7B
OLMohttps://allenai.org/olmo
OLMo 2https://allenai.org/olmo
OLMoEhttps://huggingface.co/allenai/OLMoE-1B-7B-0924
Granite modelshttps://huggingface.co/collections/ibm-granite/granite-code-models-6624c5cec322e4c148c8b330
GPT-NeoXhttps://github.com/EleutherAI/gpt-neox
Pythiahttps://github.com/EleutherAI/pythia
Snowflake-Arctic MoEhttps://huggingface.co/collections/Snowflake/arctic-66290090abe542894a5ac520
Smaughttps://huggingface.co/models?search=Smaug
Poro 34Bhttps://huggingface.co/LumiOpen/Poro-34B
Bitnet b1.58 modelshttps://huggingface.co/1bitLLM
Flan T5https://huggingface.co/models?search=flan-t5
Open Elm modelshttps://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca
ChatGLM3-6bhttps://huggingface.co/THUDM/chatglm3-6b
ChatGLM4-9bhttps://huggingface.co/THUDM/glm-4-9b
GLMEdge-1.5bhttps://huggingface.co/THUDM/glm-edge-1.5b-chat
GLMEdge-4bhttps://huggingface.co/THUDM/glm-edge-4b-chat
GLM-4-0414https://huggingface.co/collections/THUDM/glm-4-0414-67f3cbcb34dd9d252707cb2e
SmolLMhttps://huggingface.co/collections/HuggingFaceTB/smollm-6695016cad7167254ce15966
EXAONE-3.0-7.8B-Instructhttps://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
FalconMamba Modelshttps://huggingface.co/collections/tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
Jaishttps://huggingface.co/inceptionai/jais-13b-chat
Bielik-11B-v2.3https://huggingface.co/collections/speakleash/bielik-11b-v23-66ee813238d9b526a072408a
RWKV-6https://github.com/BlinkDL/RWKV-LM
QRWKV-6https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1
GigaChat-20B-A3Bhttps://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct
Trillion-7B-previewhttps://huggingface.co/trillionlabs/Trillion-7B-preview
Ling modelshttps://huggingface.co/collections/inclusionAI/ling-67c51c85b34a7ea0aba94c32
LFM2 modelshttps://huggingface.co/collections/LiquidAI/lfm2-686d721927015b2ad73eaa38
Hunyuan modelshttps://huggingface.co/collections/tencent/hunyuan-dense-model-6890632cda26b19119c9c5e7
BailingMoeV2 (Ring/Ling 2.0) modelshttps://huggingface.co/collections/inclusionAI/ling-v2-68bf1dd2fc34c306c1fa6f86
https://github.com/Gomez12/llama.cpp#multimodal
LLaVA 1.5 modelshttps://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 modelshttps://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVAhttps://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidianhttps://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4Vhttps://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B modelshttps://huggingface.co/models?search=mobileVLM
Yi-VLhttps://huggingface.co/models?search=Yi-VL
Mini CPMhttps://huggingface.co/models?search=MiniCPM
Moondreamhttps://huggingface.co/vikhyatk/moondream2
Bunnyhttps://github.com/BAAI-DCAI/Bunny
GLM-EDGEhttps://huggingface.co/models?search=glm-edge
Qwen2-VLhttps://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d
LFM2-VLhttps://huggingface.co/collections/LiquidAI/lfm2-vl-68963bbc84a610f7638d5ffa
ddh0/easy-llamahttps://github.com/ddh0/easy-llama
abetlen/llama-cpp-pythonhttps://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpphttps://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpphttps://github.com/withcatai/node-llama-cpp
lgrammel/modelfusionhttps://modelfusion.dev/integration/model-provider/llamacpp
offline-ai/clihttps://github.com/offline-ai/cli
tangledgroup/llama-cpp-wasmhttps://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllamahttps://github.com/ngxson/wllama
yoshoku/llama_cpp.rbhttps://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rshttps://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpphttps://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rshttps://github.com/utilityai/llama-cpp-rs
ShelbyJenkins/llm_clienthttps://github.com/ShelbyJenkins/llm_client
SciSharp/LLamaSharphttps://github.com/SciSharp/LLamaSharp
LM-Kit.NEThttps://docs.lm-kit.com/lm-kit-net/index.html
donderom/llm4shttps://github.com/donderom/llm4s
phronmophobic/llama.cljhttps://github.com/phronmophobic/llama.clj
mybigday/llama.rnhttps://github.com/mybigday/llama.rn
kherud/java-llama.cpphttps://github.com/kherud/java-llama.cpp
QuasarByte/llama-cpp-jnahttps://github.com/QuasarByte/llama-cpp-jna
deins/llama.cpp.zighttps://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_darthttps://github.com/netdur/llama_cpp_dart
xuegao-tzx/Fllamahttps://github.com/xuegao-tzx/Fllama
distantmagic/resonancehttps://github.com/distantmagic/resonance
(more info)https://github.com/ggml-org/llama.cpp/pull/6326
guile_llama_cpphttps://savannah.nongnu.org/projects/guile-llama-cpp
srgtuszy/llama-cpp-swifthttps://github.com/srgtuszy/llama-cpp-swift
ShenghaiWang/SwiftLlamahttps://github.com/ShenghaiWang/SwiftLlama
Embarcadero/llama-cpp-delphihttps://github.com/Embarcadero/llama-cpp-delphi
hybridgroup/yzmahttps://github.com/hybridgroup/yzma
llama.androidhttps://github.com/Gomez12/llama.cpp/blob/master/examples/llama.android
AI Sublime Text pluginhttps://github.com/yaroslavyaroslav/OpenAI-sublime-text
BonzAI Apphttps://apps.apple.com/us/app/bonzai-your-local-ai-agent/id6752847988
cztomsik/avahttps://github.com/cztomsik/ava
Dothttps://github.com/alexpinel/Dot
evahttps://github.com/ylsdamxssjxxdd/eva
iohub/collamahttps://github.com/iohub/coLLaMA
janhq/janhttps://github.com/janhq/jan
johnbean393/Sidekickhttps://github.com/johnbean393/Sidekick
KanTVhttps://github.com/zhouwg/kantv?tab=readme-ov-file
KodiBothttps://github.com/firatkiral/kodibot
llama.vimhttps://github.com/ggml-org/llama.vim
LARShttps://github.com/abgulati/LARS
Llama Assistanthttps://github.com/vietanhdev/llama-assistant
LLMFarmhttps://github.com/guinmoon/LLMFarm?tab=readme-ov-file
LLMUnityhttps://github.com/undreamai/LLMUnity
LMStudiohttps://lmstudio.ai/
LocalAIhttps://github.com/mudler/LocalAI
LostRuins/koboldcpphttps://github.com/LostRuins/koboldcpp
MindMachttps://mindmac.app
MindWorkAI/AI-Studiohttps://github.com/MindWorkAI/AI-Studio
Mobile-Artificial-Intelligence/maidhttps://github.com/Mobile-Artificial-Intelligence/maid
Mozilla-Ocho/llamafilehttps://github.com/Mozilla-Ocho/llamafile
nat/openplaygroundhttps://github.com/nat/openplayground
nomic-ai/gpt4allhttps://github.com/nomic-ai/gpt4all
ollama/ollamahttps://github.com/ollama/ollama
oobabooga/text-generation-webuihttps://github.com/oobabooga/text-generation-webui
PocketPal AIhttps://github.com/a-ghorbani/pocketpal-ai
psugihara/FreeChathttps://github.com/psugihara/FreeChat
ptsochantaris/emeltalhttps://github.com/ptsochantaris/emeltal
pythops/tenerehttps://github.com/pythops/tenere
ramalamahttps://github.com/containers/ramalama
semperai/amicahttps://github.com/semperai/amica
withcatai/cataihttps://github.com/withcatai/catai
Autopenhttps://github.com/blackhole89/autopen
akx/ggifyhttps://github.com/akx/ggify
akx/ollama-dlhttps://github.com/akx/ollama-dl
crashr/gppmhttps://github.com/crashr/gppm
gpustack/gguf-parserhttps://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser
Styled Lineshttps://marketplace.unity.com/packages/tools/generative-ai/styled-lines-llama-cpp-model-292902
unslothai/unslothhttps://github.com/unslothai/unsloth
Paddlerhttps://github.com/intentee/paddler
GPUStackhttps://github.com/gpustack/gpustack
llama_cpp_canisterhttps://github.com/onicai/llama_cpp_canister
llama-swaphttps://github.com/mostlygeek/llama-swap
Kalavaihttps://github.com/kalavai-net/kalavai-client
llmazhttps://github.com/InftyAI/llmaz
Lucy's Labyrinthhttps://github.com/MorganRO8/Lucys_Labyrinth
https://github.com/Gomez12/llama.cpp#supported-backends
Metalhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#metal-build
BLAShttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#blas-build
BLIShttps://github.com/Gomez12/llama.cpp/blob/master/docs/backend/BLIS.md
SYCLhttps://github.com/Gomez12/llama.cpp/blob/master/docs/backend/SYCL.md
MUSAhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#musa
CUDAhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#cuda
HIPhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#hip
ZenDNNhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#zendnn
Vulkanhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#vulkan
CANNhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#cann
OpenCLhttps://github.com/Gomez12/llama.cpp/blob/master/docs/backend/OPENCL.md
IBM zDNNhttps://github.com/Gomez12/llama.cpp/blob/master/docs/backend/zDNN.md
WebGPU [In Progress]https://github.com/Gomez12/llama.cpp/blob/master/docs/build.md#webgpu
RPChttps://github.com/ggml-org/llama.cpp/tree/master/tools/rpc
Hexagon [In Progress]https://github.com/Gomez12/llama.cpp/blob/master/docs/backend/hexagon/README.md
https://github.com/Gomez12/llama.cpp#obtaining-and-quantizing-models
Hugging Facehttps://huggingface.co
number of LLMshttps://huggingface.co/models?library=gguf&sort=trending
Trendinghttps://huggingface.co/models?library=gguf&sort=trending
LLaMAhttps://huggingface.co/models?sort=trending&search=llama+gguf
Hugging Facehttps://huggingface.co/
ModelScopehttps://modelscope.cn/
GGUFhttps://github.com/ggml-org/ggml/blob/master/docs/gguf.md
GGUF-my-repo spacehttps://huggingface.co/spaces/ggml-org/gguf-my-repo
GGUF-my-LoRA spacehttps://huggingface.co/spaces/ggml-org/gguf-my-lora
ggml-org#10123https://github.com/ggml-org/llama.cpp/discussions/10123
GGUF-editor spacehttps://huggingface.co/spaces/CISCai/gguf-editor
ggml-org#9268https://github.com/ggml-org/llama.cpp/discussions/9268
Inference Endpointshttps://ui.endpoints.huggingface.co/
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
read this documentationhttps://github.com/Gomez12/llama.cpp/blob/master/tools/quantize/README.md
llama-clihttps://github.com/Gomez12/llama.cpp/blob/master/tools/cli
https://github.com/Gomez12/llama.cpp#llama-cli
https://github.com/Gomez12/llama.cpp#a-cli-tool-for-accessing-and-experimenting-with-most-of-llamacpps-functionality
grammars/https://github.com/Gomez12/llama.cpp/blob/master/grammars
GBNF Guidehttps://github.com/Gomez12/llama.cpp/blob/master/grammars/README.md
https://grammar.intrinsiclabs.ai/https://grammar.intrinsiclabs.ai/
llama-serverhttps://github.com/Gomez12/llama.cpp/blob/master/tools/server
https://github.com/Gomez12/llama.cpp#llama-server
OpenAI APIhttps://github.com/openai/openai-openapi
https://github.com/Gomez12/llama.cpp#a-lightweight-openai-api-compatible-http-server-for-serving-llms
llama-perplexityhttps://github.com/Gomez12/llama.cpp/blob/master/tools/perplexity
https://github.com/Gomez12/llama.cpp#llama-perplexity
perplexityhttps://github.com/Gomez12/llama.cpp/blob/master/tools/perplexity/README.md
1https://github.com/Gomez12/llama.cpp#user-content-fn-1-048d91990ad10561a76ae941167d0901
https://github.com/Gomez12/llama.cpp#a-tool-for-measuring-the-perplexity-1-and-other-quality-metrics-of-a-model-over-a-given-text
llama-benchhttps://github.com/Gomez12/llama.cpp/blob/master/tools/llama-bench
https://github.com/Gomez12/llama.cpp#llama-bench
https://github.com/Gomez12/llama.cpp#benchmark-the-performance-of-the-inference-for-various-parameters
llama-simplehttps://github.com/Gomez12/llama.cpp/blob/master/examples/simple
https://github.com/Gomez12/llama.cpp#llama-simple
https://github.com/Gomez12/llama.cpp#a-minimal-example-for-implementing-apps-with-llamacpp-useful-for-developers
https://github.com/Gomez12/llama.cpp#contributing
good first issueshttps://github.com/ggml-org/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
CONTRIBUTING.mdhttps://github.com/Gomez12/llama.cpp/blob/master/CONTRIBUTING.md
Inference at the edgehttps://github.com/ggml-org/llama.cpp/discussions/205
Changelog podcasthttps://changelog.com/podcast/532
https://github.com/Gomez12/llama.cpp#other-documentation
clihttps://github.com/Gomez12/llama.cpp/blob/master/tools/cli/README.md
completionhttps://github.com/Gomez12/llama.cpp/blob/master/tools/completion/README.md
serverhttps://github.com/Gomez12/llama.cpp/blob/master/tools/server/README.md
GBNF grammarshttps://github.com/Gomez12/llama.cpp/blob/master/grammars/README.md
https://github.com/Gomez12/llama.cpp#development-documentation
How to buildhttps://github.com/Gomez12/llama.cpp/blob/master/docs/build.md
Running on Dockerhttps://github.com/Gomez12/llama.cpp/blob/master/docs/docker.md
Build on Androidhttps://github.com/Gomez12/llama.cpp/blob/master/docs/android.md
Performance troubleshootinghttps://github.com/Gomez12/llama.cpp/blob/master/docs/development/token_generation_performance_tips.md
GGML tips & trickshttps://github.com/ggml-org/llama.cpp/wiki/GGML-Tips-&-Tricks
https://github.com/Gomez12/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language modelhttps://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Modelshttps://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learnershttps://arxiv.org/abs/2005.14165
Aligning language models to follow instructionshttps://openai.com/research/instruction-following
Training language models to follow instructions with human feedbackhttps://arxiv.org/abs/2203.02155
https://github.com/Gomez12/llama.cpp#xcframework
https://github.com/Gomez12/llama.cpp#completions
https://github.com/Gomez12/llama.cpp#bash-completion
https://github.com/Gomez12/llama.cpp#dependencies
yhirose/cpp-httplibhttps://github.com/yhirose/cpp-httplib
stb-imagehttps://github.com/nothings/stb
nlohmann/jsonhttps://github.com/nlohmann/json
minjahttps://github.com/google/minja
curlhttps://curl.se/
CURL Licensehttps://curl.se/docs/copyright.html
miniaudio.hhttps://github.com/mackron/miniaudio
subprocess.hhttps://github.com/sheredom/subprocess.h
https://huggingface.co/docs/transformers/perplexityhttps://huggingface.co/docs/transformers/perplexity
https://github.com/Gomez12/llama.cpp#user-content-fnref-1-048d91990ad10561a76ae941167d0901
Readme https://github.com/Gomez12/llama.cpp#readme-ov-file
MIT license https://github.com/Gomez12/llama.cpp#MIT-1-ov-file
Contributing https://github.com/Gomez12/llama.cpp#contributing-ov-file
Security policy https://github.com/Gomez12/llama.cpp#security-ov-file
Please reload this pagehttps://github.com/Gomez12/llama.cpp
Activityhttps://github.com/Gomez12/llama.cpp/activity
0 starshttps://github.com/Gomez12/llama.cpp/stargazers
0 watchinghttps://github.com/Gomez12/llama.cpp/watchers
0 forkshttps://github.com/Gomez12/llama.cpp/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FGomez12%2Fllama.cpp&report=Gomez12+%28user%29
Releaseshttps://github.com/Gomez12/llama.cpp/releases
3 tags https://github.com/Gomez12/llama.cpp/tags
Packages 0https://github.com/users/Gomez12/packages?repo_name=llama.cpp
Please reload this pagehttps://github.com/Gomez12/llama.cpp
Please reload this pagehttps://github.com/Gomez12/llama.cpp
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.