René's URL Explorer Experiment


Title: GitHub - vproxy-tools/llama.cpp: LLM inference in C/C++

Open Graph Title: GitHub - vproxy-tools/llama.cpp: LLM inference in C/C++

X Title: GitHub - vproxy-tools/llama.cpp: LLM inference in C/C++

Description: LLM inference in C/C++. Contribute to vproxy-tools/llama.cpp development by creating an account on GitHub.

Open Graph Description: LLM inference in C/C++. Contribute to vproxy-tools/llama.cpp development by creating an account on GitHub.

X Description: LLM inference in C/C++. Contribute to vproxy-tools/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/vproxy-tools/llama.cpp

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:29b69ce6-7058-95a7-b957-40cf7ea1e1d4
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id85D4:24B668:1818082:2047756:6969ADD1
html-safe-nonce948b8ba6f4776eadbabc5905f30a40094d8b3d667d6ad3dccc92945cf2d36af2
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4NUQ0OjI0QjY2ODoxODE4MDgyOjIwNDc3NTY6Njk2OUFERDEiLCJ2aXNpdG9yX2lkIjoiODMwODA1MjgxNDkzNTQ2OTUyMiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmaced08e034e5640870f98f2f19348c7b15d8367c46126034bbfc36a0e445979765
hovercard-subject-tagrepository:945475012
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/vproxy-tools/llama.cpp
twitter:imagehttps://opengraph.githubassets.com/8982bb9ac728e9dfae646388556bb652835bb91c41548103be51c4d3c6d234f5/vproxy-tools/llama.cpp
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/8982bb9ac728e9dfae646388556bb652835bb91c41548103be51c4d3c6d234f5/vproxy-tools/llama.cpp
og:image:altLLM inference in C/C++. Contribute to vproxy-tools/llama.cpp development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None24c4c97a2d520cb286b35e1a4c22d7a4df3c26a2fa28dd7cdf0e65db327b4de7
turbo-cache-controlno-preview
go-importgithub.com/vproxy-tools/llama.cpp git https://github.com/vproxy-tools/llama.cpp.git
octolytics-dimension-user_id59661155
octolytics-dimension-user_loginvproxy-tools
octolytics-dimension-repository_id945475012
octolytics-dimension-repository_nwovproxy-tools/llama.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id612354784
octolytics-dimension-repository_parent_nwoggml-org/llama.cpp
octolytics-dimension-repository_network_root_id612354784
octolytics-dimension-repository_network_root_nwoggml-org/llama.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release124667f43168afb6c9c03b7c02eb5b1d2e1be3d9
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/vproxy-tools/llama.cpp#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fvproxy-tools%2Fllama.cpp
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fvproxy-tools%2Fllama.cpp
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=vproxy-tools%2Fllama.cpp
Reloadhttps://github.com/vproxy-tools/llama.cpp
Reloadhttps://github.com/vproxy-tools/llama.cpp
Reloadhttps://github.com/vproxy-tools/llama.cpp
vproxy-tools https://github.com/vproxy-tools
llama.cpphttps://github.com/vproxy-tools/llama.cpp
ggml-org/llama.cpphttps://github.com/ggml-org/llama.cpp
Notifications https://github.com/login?return_to=%2Fvproxy-tools%2Fllama.cpp
Fork 0 https://github.com/login?return_to=%2Fvproxy-tools%2Fllama.cpp
Star 2 https://github.com/login?return_to=%2Fvproxy-tools%2Fllama.cpp
MIT license https://github.com/vproxy-tools/llama.cpp/blob/will-force-push/LICENSE
2 stars https://github.com/vproxy-tools/llama.cpp/stargazers
14.5k forks https://github.com/vproxy-tools/llama.cpp/forks
Branches https://github.com/vproxy-tools/llama.cpp/branches
Tags https://github.com/vproxy-tools/llama.cpp/tags
Activity https://github.com/vproxy-tools/llama.cpp/activity
Star https://github.com/login?return_to=%2Fvproxy-tools%2Fllama.cpp
Notifications https://github.com/login?return_to=%2Fvproxy-tools%2Fllama.cpp
Code https://github.com/vproxy-tools/llama.cpp
Issues 0 https://github.com/vproxy-tools/llama.cpp/issues
Pull requests 0 https://github.com/vproxy-tools/llama.cpp/pulls
Actions https://github.com/vproxy-tools/llama.cpp/actions
Projects 0 https://github.com/vproxy-tools/llama.cpp/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/vproxy-tools/llama.cpp/security
Please reload this pagehttps://github.com/vproxy-tools/llama.cpp
Insights https://github.com/vproxy-tools/llama.cpp/pulse
Code https://github.com/vproxy-tools/llama.cpp
Issues https://github.com/vproxy-tools/llama.cpp/issues
Pull requests https://github.com/vproxy-tools/llama.cpp/pulls
Actions https://github.com/vproxy-tools/llama.cpp/actions
Projects https://github.com/vproxy-tools/llama.cpp/projects
Security https://github.com/vproxy-tools/llama.cpp/security
Insights https://github.com/vproxy-tools/llama.cpp/pulse
Brancheshttps://github.com/vproxy-tools/llama.cpp/branches
Tagshttps://github.com/vproxy-tools/llama.cpp/tags
https://github.com/vproxy-tools/llama.cpp/branches
https://github.com/vproxy-tools/llama.cpp/tags
4,888 Commitshttps://github.com/vproxy-tools/llama.cpp/commits/will-force-push/
https://github.com/vproxy-tools/llama.cpp/commits/will-force-push/
.devopshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/.devops
.devopshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/.devops
.githubhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/.github
.githubhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/.github
cihttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/ci
cihttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/ci
cmakehttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/cmake
cmakehttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/cmake
commonhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/common
commonhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/common
docshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/docs
docshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/docs
exampleshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/examples
exampleshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/examples
ggmlhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/ggml
ggmlhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/ggml
gguf-pyhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/gguf-py
gguf-pyhttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/gguf-py
grammarshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/grammars
grammarshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/grammars
includehttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/include
includehttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/include
mediahttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/media
mediahttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/media
modelshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/models
modelshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/models
pocshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/pocs
pocshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/pocs
promptshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/prompts
promptshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/prompts
requirementshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/requirements
requirementshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/requirements
scriptshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/scripts
scriptshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/scripts
srchttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/src
srchttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/src
testshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/tests
testshttps://github.com/vproxy-tools/llama.cpp/tree/will-force-push/tests
.clang-formathttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.clang-format
.clang-formathttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.clang-format
.clang-tidyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.clang-tidy
.clang-tidyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.clang-tidy
.dockerignorehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.dockerignore
.dockerignorehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.dockerignore
.ecrchttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.ecrc
.ecrchttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.ecrc
.editorconfighttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.editorconfig
.editorconfighttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.editorconfig
.flake8https://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.flake8
.flake8https://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.flake8
.gitignorehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.gitignore
.gitignorehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.gitignore
.gitmoduleshttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.gitmodules
.gitmoduleshttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.gitmodules
.pre-commit-config.yamlhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/.pre-commit-config.yaml
AUTHORShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/AUTHORS
AUTHORShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/AUTHORS
CMakeLists.txthttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CMakeLists.txt
CMakeLists.txthttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CMakeLists.txt
CMakePresets.jsonhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CMakePresets.json
CMakePresets.jsonhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CMakePresets.json
CODEOWNERShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CODEOWNERS
CODEOWNERShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CODEOWNERS
CONTRIBUTING.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CONTRIBUTING.md
CONTRIBUTING.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CONTRIBUTING.md
LICENSEhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/LICENSE
LICENSEhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/LICENSE
Makefilehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/Makefile
Makefilehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/Makefile
README.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/README.md
README.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/README.md
SECURITY.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/SECURITY.md
SECURITY.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/SECURITY.md
build-xcframework.shhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/build-xcframework.sh
build-xcframework.shhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/build-xcframework.sh
convert_hf_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_hf_to_gguf.py
convert_hf_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_hf_to_gguf.py
convert_hf_to_gguf_update.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_hf_to_gguf_update.py
convert_hf_to_gguf_update.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_hf_to_gguf_update.py
convert_llama_ggml_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_llama_ggml_to_gguf.py
convert_llama_ggml_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_llama_ggml_to_gguf.py
convert_lora_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_lora_to_gguf.py
convert_lora_to_gguf.pyhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/convert_lora_to_gguf.py
flake.lockhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/flake.lock
flake.lockhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/flake.lock
flake.nixhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/flake.nix
flake.nixhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/flake.nix
mypy.inihttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/mypy.ini
mypy.inihttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/mypy.ini
poetry.lockhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/poetry.lock
poetry.lockhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/poetry.lock
pyproject.tomlhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/pyproject.toml
pyproject.tomlhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/pyproject.toml
pyrightconfig.jsonhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/pyrightconfig.json
pyrightconfig.jsonhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/pyrightconfig.json
requirements.txthttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/requirements.txt
requirements.txthttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/requirements.txt
READMEhttps://github.com/vproxy-tools/llama.cpp
Contributinghttps://github.com/vproxy-tools/llama.cpp
MIT licensehttps://github.com/vproxy-tools/llama.cpp
Securityhttps://github.com/vproxy-tools/llama.cpp
https://github.com/vproxy-tools/llama.cpp#llamacpp
https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
https://opensource.org/licenses/MIT
https://github.com/ggml-org/llama.cpp/actions/workflows/server.yml
Roadmaphttps://github.com/users/ggerganov/projects/7
Project statushttps://github.com/ggml-org/llama.cpp/discussions/3471
Manifestohttps://github.com/ggml-org/llama.cpp/discussions/205
ggmlhttps://github.com/ggml-org/ggml
LLaMAhttps://arxiv.org/abs/2302.13971
ggml-org/llama.cpphttps://github.com/ggml-org/llama.cpp/pkgs/container/llama.cpp
ggml-org#11801https://github.com/ggml-org/llama.cpp/discussions/11801
https://github.com/vproxy-tools/llama.cpp#recent-api-changes
Changelog for libllama APIhttps://github.com/ggml-org/llama.cpp/issues/9289
Changelog for llama-server REST APIhttps://github.com/ggml-org/llama.cpp/issues/9291
https://github.com/vproxy-tools/llama.cpp#hot-topics
MTLResidencySethttps://developer.apple.com/documentation/metal/mtlresidencyset?language=objc
ggml-org#11427https://github.com/ggml-org/llama.cpp/pull/11427
https://github.com/ggml-org/llama.vscodehttps://github.com/ggml-org/llama.vscode
tool call supporthttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/function-calling.md
ggml-org#9639https://github.com/ggml-org/llama.cpp/pull/9639
https://github.com/ggml-org/llama.vimhttps://github.com/ggml-org/llama.vim
ggml-org#10123https://github.com/ggml-org/llama.cpp/discussions/10123
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
discussionhttps://github.com/ggml-org/llama.cpp/discussions/9268
toolhttps://huggingface.co/spaces/CISCai/gguf-editor
https://github.com/vproxy-tools/llama.cpp#description
ggmlhttps://github.com/ggml-org/ggml
HOWTO-add-model.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/development/HOWTO-add-model.md
https://github.com/vproxy-tools/llama.cpp#text-only
Mistral 7Bhttps://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoEhttps://huggingface.co/models?search=mistral-ai/Mixtral
DBRXhttps://huggingface.co/databricks/dbrx-instruct
Falconhttps://huggingface.co/models?search=tiiuae/falcon
Chinese LLaMA / Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)https://github.com/bofenghuang/vigogne
BERThttps://github.com/ggml-org/llama.cpp/pull/5423
Koalahttps://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2https://huggingface.co/models?search=baichuan-inc/Baichuan
derivationshttps://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2https://huggingface.co/models?search=BAAI/Aquila
Starcoder modelshttps://github.com/ggml-org/llama.cpp/pull/3187
Refacthttps://huggingface.co/smallcloudai/Refact-1_6B-fim
MPThttps://github.com/ggml-org/llama.cpp/pull/3417
Bloomhttps://github.com/ggml-org/llama.cpp/pull/3553
Yi modelshttps://huggingface.co/models?search=01-ai/Yi
StableLM modelshttps://huggingface.co/stabilityai
Deepseek modelshttps://huggingface.co/models?search=deepseek-ai/deepseek
Qwen modelshttps://huggingface.co/models?search=Qwen/Qwen
PLaMo-13Bhttps://github.com/ggml-org/llama.cpp/pull/3557
Phi modelshttps://huggingface.co/models?search=microsoft/phi
PhiMoEhttps://github.com/ggml-org/llama.cpp/pull/11003
GPT-2https://huggingface.co/gpt2
Orion 14Bhttps://github.com/ggml-org/llama.cpp/pull/5118
InternLM2https://huggingface.co/models?search=internlm2
CodeShellhttps://github.com/WisdomShell/codeshell
Gemmahttps://ai.google.dev/gemma
Mambahttps://github.com/state-spaces/mamba
Grok-1https://huggingface.co/keyfan/grok-1-hf
Xversehttps://huggingface.co/models?search=xverse
Command-R modelshttps://huggingface.co/models?search=CohereForAI/c4ai-command-r
SEA-LIONhttps://huggingface.co/models?search=sea-lion
GritLM-7Bhttps://huggingface.co/GritLM/GritLM-7B
GritLM-8x7Bhttps://huggingface.co/GritLM/GritLM-8x7B
OLMohttps://allenai.org/olmo
OLMo 2https://allenai.org/olmo
OLMoEhttps://huggingface.co/allenai/OLMoE-1B-7B-0924
Granite modelshttps://huggingface.co/collections/ibm-granite/granite-code-models-6624c5cec322e4c148c8b330
GPT-NeoXhttps://github.com/EleutherAI/gpt-neox
Pythiahttps://github.com/EleutherAI/pythia
Snowflake-Arctic MoEhttps://huggingface.co/collections/Snowflake/arctic-66290090abe542894a5ac520
Smaughttps://huggingface.co/models?search=Smaug
Poro 34Bhttps://huggingface.co/LumiOpen/Poro-34B
Bitnet b1.58 modelshttps://huggingface.co/1bitLLM
Flan T5https://huggingface.co/models?search=flan-t5
Open Elm modelshttps://huggingface.co/collections/apple/openelm-instruct-models-6619ad295d7ae9f868b759ca
ChatGLM3-6bhttps://huggingface.co/THUDM/chatglm3-6b
ChatGLM4-9bhttps://huggingface.co/THUDM/glm-4-9b
GLMEdge-1.5bhttps://huggingface.co/THUDM/glm-edge-1.5b-chat
GLMEdge-4bhttps://huggingface.co/THUDM/glm-edge-4b-chat
SmolLMhttps://huggingface.co/collections/HuggingFaceTB/smollm-6695016cad7167254ce15966
EXAONE-3.0-7.8B-Instructhttps://huggingface.co/LGAI-EXAONE/EXAONE-3.0-7.8B-Instruct
FalconMamba Modelshttps://huggingface.co/collections/tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
Jaishttps://huggingface.co/inceptionai/jais-13b-chat
Bielik-11B-v2.3https://huggingface.co/collections/speakleash/bielik-11b-v23-66ee813238d9b526a072408a
RWKV-6https://github.com/BlinkDL/RWKV-LM
QRWKV-6https://huggingface.co/recursal/QRWKV6-32B-Instruct-Preview-v0.1
GigaChat-20B-A3Bhttps://huggingface.co/ai-sage/GigaChat-20B-A3B-instruct
https://github.com/vproxy-tools/llama.cpp#multimodal
LLaVA 1.5 modelshttps://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 modelshttps://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVAhttps://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidianhttps://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4Vhttps://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B modelshttps://huggingface.co/models?search=mobileVLM
Yi-VLhttps://huggingface.co/models?search=Yi-VL
Mini CPMhttps://huggingface.co/models?search=MiniCPM
Moondreamhttps://huggingface.co/vikhyatk/moondream2
Bunnyhttps://github.com/BAAI-DCAI/Bunny
GLM-EDGEhttps://huggingface.co/models?search=glm-edge
Qwen2-VLhttps://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d
abetlen/llama-cpp-pythonhttps://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpphttps://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpphttps://github.com/withcatai/node-llama-cpp
lgrammel/modelfusionhttps://modelfusion.dev/integration/model-provider/llamacpp
offline-ai/clihttps://github.com/offline-ai/cli
tangledgroup/llama-cpp-wasmhttps://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllamahttps://github.com/ngxson/wllama
yoshoku/llama_cpp.rbhttps://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rshttps://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpphttps://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rshttps://github.com/utilityai/llama-cpp-rs
ShelbyJenkins/llm_clienthttps://github.com/ShelbyJenkins/llm_client
SciSharp/LLamaSharphttps://github.com/SciSharp/LLamaSharp
LM-Kit.NEThttps://docs.lm-kit.com/lm-kit-net/index.html
donderom/llm4shttps://github.com/donderom/llm4s
phronmophobic/llama.cljhttps://github.com/phronmophobic/llama.clj
mybigday/llama.rnhttps://github.com/mybigday/llama.rn
kherud/java-llama.cpphttps://github.com/kherud/java-llama.cpp
deins/llama.cpp.zighttps://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_darthttps://github.com/netdur/llama_cpp_dart
xuegao-tzx/Fllamahttps://github.com/xuegao-tzx/Fllama
distantmagic/resonancehttps://github.com/distantmagic/resonance
(more info)https://github.com/ggml-org/llama.cpp/pull/6326
guile_llama_cpphttps://savannah.nongnu.org/projects/guile-llama-cpp
srgtuszy/llama-cpp-swifthttps://github.com/srgtuszy/llama-cpp-swift
ShenghaiWang/SwiftLlamahttps://github.com/ShenghaiWang/SwiftLlama
Embarcadero/llama-cpp-delphihttps://github.com/Embarcadero/llama-cpp-delphi
AI Sublime Text pluginhttps://github.com/yaroslavyaroslav/OpenAI-sublime-text
cztomsik/avahttps://github.com/cztomsik/ava
Dothttps://github.com/alexpinel/Dot
evahttps://github.com/ylsdamxssjxxdd/eva
iohub/collamahttps://github.com/iohub/coLLaMA
janhq/janhttps://github.com/janhq/jan
johnbean393/Sidekickhttps://github.com/johnbean393/Sidekick
KanTVhttps://github.com/zhouwg/kantv?tab=readme-ov-file
KodiBothttps://github.com/firatkiral/kodibot
llama.vimhttps://github.com/ggml-org/llama.vim
LARShttps://github.com/abgulati/LARS
Llama Assistanthttps://github.com/vietanhdev/llama-assistant
LLMFarmhttps://github.com/guinmoon/LLMFarm?tab=readme-ov-file
LLMUnityhttps://github.com/undreamai/LLMUnity
LMStudiohttps://lmstudio.ai/
LocalAIhttps://github.com/mudler/LocalAI
LostRuins/koboldcpphttps://github.com/LostRuins/koboldcpp
MindMachttps://mindmac.app
MindWorkAI/AI-Studiohttps://github.com/MindWorkAI/AI-Studio
Mobile-Artificial-Intelligence/maidhttps://github.com/Mobile-Artificial-Intelligence/maid
Mozilla-Ocho/llamafilehttps://github.com/Mozilla-Ocho/llamafile
nat/openplaygroundhttps://github.com/nat/openplayground
nomic-ai/gpt4allhttps://github.com/nomic-ai/gpt4all
ollama/ollamahttps://github.com/ollama/ollama
oobabooga/text-generation-webuihttps://github.com/oobabooga/text-generation-webui
PocketPal AIhttps://github.com/a-ghorbani/pocketpal-ai
psugihara/FreeChathttps://github.com/psugihara/FreeChat
ptsochantaris/emeltalhttps://github.com/ptsochantaris/emeltal
pythops/tenerehttps://github.com/pythops/tenere
ramalamahttps://github.com/containers/ramalama
semperai/amicahttps://github.com/semperai/amica
withcatai/cataihttps://github.com/withcatai/catai
Autopenhttps://github.com/blackhole89/autopen
akx/ggifyhttps://github.com/akx/ggify
akx/ollama-dlhttps://github.com/akx/ollama-dl
crashr/gppmhttps://github.com/crashr/gppm
gpustack/gguf-parserhttps://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser
Styled Lineshttps://marketplace.unity.com/packages/tools/generative-ai/styled-lines-llama-cpp-model-292902
Paddlerhttps://github.com/distantmagic/paddler
GPUStackhttps://github.com/gpustack/gpustack
llama_cpp_canisterhttps://github.com/onicai/llama_cpp_canister
llama-swaphttps://github.com/mostlygeek/llama-swap
Kalavaihttps://github.com/kalavai-net/kalavai-client
llmazhttps://github.com/InftyAI/llmaz
Lucy's Labyrinthhttps://github.com/MorganRO8/Lucys_Labyrinth
https://github.com/vproxy-tools/llama.cpp#supported-backends
Metalhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#metal-build
BLAShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#blas-build
BLIShttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/backend/BLIS.md
SYCLhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/backend/SYCL.md
MUSAhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#musa
CUDAhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#cuda
HIPhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#hip
Vulkanhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#vulkan
CANNhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md#cann
OpenCLhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/backend/OPENCL.md
https://github.com/vproxy-tools/llama.cpp#building-the-project
include/llama.hhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/include/llama.h
how to buildhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md
brew, flox or nixhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/install.md
documentation for Dockerhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/docker.md
releaseshttps://github.com/ggml-org/llama.cpp/releases
https://github.com/vproxy-tools/llama.cpp#obtaining-and-quantizing-models
Hugging Facehttps://huggingface.co
number of LLMshttps://huggingface.co/models?library=gguf&sort=trending
Trendinghttps://huggingface.co/models?library=gguf&sort=trending
LLaMAhttps://huggingface.co/models?sort=trending&search=llama+gguf
GGUFhttps://github.com/ggml-org/ggml/blob/master/docs/gguf.md
GGUF-my-repo spacehttps://huggingface.co/spaces/ggml-org/gguf-my-repo
GGUF-my-LoRA spacehttps://huggingface.co/spaces/ggml-org/gguf-my-lora
ggml-org#10123https://github.com/ggml-org/llama.cpp/discussions/10123
GGUF-editor spacehttps://huggingface.co/spaces/CISCai/gguf-editor
ggml-org#9268https://github.com/ggml-org/llama.cpp/discussions/9268
Inference Endpointshttps://ui.endpoints.huggingface.co/
ggml-org#9669https://github.com/ggml-org/llama.cpp/discussions/9669
read this documentationhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/quantize/README.md
llama-clihttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/main
https://github.com/vproxy-tools/llama.cpp#llama-cli
https://github.com/vproxy-tools/llama.cpp#a-cli-tool-for-accessing-and-experimenting-with-most-of-llamacpps-functionality
grammars/https://github.com/vproxy-tools/llama.cpp/blob/will-force-push/grammars
GBNF Guidehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/grammars/README.md
https://grammar.intrinsiclabs.ai/https://grammar.intrinsiclabs.ai/
llama-serverhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/server
https://github.com/vproxy-tools/llama.cpp#llama-server
OpenAI APIhttps://github.com/openai/openai-openapi
https://github.com/vproxy-tools/llama.cpp#a-lightweight-openai-api-compatible-http-server-for-serving-llms
llama-perplexityhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/perplexity
https://github.com/vproxy-tools/llama.cpp#llama-perplexity
1https://github.com/vproxy-tools/llama.cpp#user-content-fn-1-48ada2fcc2c3ab39d4b3f2751f60e325
2https://github.com/vproxy-tools/llama.cpp#user-content-fn-2-48ada2fcc2c3ab39d4b3f2751f60e325
https://github.com/vproxy-tools/llama.cpp#a-tool-for-measuring-the-perplexity-12-and-other-quality-metrics-of-a-model-over-a-given-text
llama-benchhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/llama-bench
https://github.com/vproxy-tools/llama.cpp#llama-bench
https://github.com/vproxy-tools/llama.cpp#benchmark-the-performance-of-the-inference-for-various-parameters
llama-runhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/run
https://github.com/vproxy-tools/llama.cpp#llama-run
3https://github.com/vproxy-tools/llama.cpp#user-content-fn-3-48ada2fcc2c3ab39d4b3f2751f60e325
https://github.com/vproxy-tools/llama.cpp#a-comprehensive-example-for-running-llamacpp-models-useful-for-inferencing-used-with-ramalama-3
llama-simplehttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/simple
https://github.com/vproxy-tools/llama.cpp#llama-simple
https://github.com/vproxy-tools/llama.cpp#a-minimal-example-for-implementing-apps-with-llamacpp-useful-for-developers
https://github.com/vproxy-tools/llama.cpp#contributing
good first issueshttps://github.com/ggml-org/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
CONTRIBUTING.mdhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/CONTRIBUTING.md
Inference at the edgehttps://github.com/ggml-org/llama.cpp/discussions/205
Changelog podcasthttps://changelog.com/podcast/532
https://github.com/vproxy-tools/llama.cpp#other-documentation
main (cli)https://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/main/README.md
serverhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/examples/server/README.md
GBNF grammarshttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/grammars/README.md
https://github.com/vproxy-tools/llama.cpp#development-documentation
How to buildhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/build.md
Running on Dockerhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/docker.md
Build on Androidhttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/android.md
Performance troubleshootinghttps://github.com/vproxy-tools/llama.cpp/blob/will-force-push/docs/development/token_generation_performance_tips.md
GGML tips & trickshttps://github.com/ggml-org/llama.cpp/wiki/GGML-Tips-&-Tricks
https://github.com/vproxy-tools/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language modelhttps://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Modelshttps://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learnershttps://arxiv.org/abs/2005.14165
Aligning language models to follow instructionshttps://openai.com/research/instruction-following
Training language models to follow instructions with human feedbackhttps://arxiv.org/abs/2203.02155
https://github.com/vproxy-tools/llama.cpp#completions
https://github.com/vproxy-tools/llama.cpp#bash-completion
https://github.com/vproxy-tools/llama.cpp#references
examples/perplexity/README.mdhttps://github.com/vproxy-tools/examples/perplexity/README.md
https://github.com/vproxy-tools/llama.cpp#user-content-fnref-1-48ada2fcc2c3ab39d4b3f2751f60e325
https://huggingface.co/docs/transformers/perplexityhttps://huggingface.co/docs/transformers/perplexity
https://github.com/vproxy-tools/llama.cpp#user-content-fnref-2-48ada2fcc2c3ab39d4b3f2751f60e325
RamaLamahttps://github.com/containers/ramalama
https://github.com/vproxy-tools/llama.cpp#user-content-fnref-3-48ada2fcc2c3ab39d4b3f2751f60e325
Readme https://github.com/vproxy-tools/llama.cpp#readme-ov-file
MIT license https://github.com/vproxy-tools/llama.cpp#MIT-1-ov-file
Contributing https://github.com/vproxy-tools/llama.cpp#contributing-ov-file
Security policy https://github.com/vproxy-tools/llama.cpp#security-ov-file
Please reload this pagehttps://github.com/vproxy-tools/llama.cpp
Activityhttps://github.com/vproxy-tools/llama.cpp/activity
Custom propertieshttps://github.com/vproxy-tools/llama.cpp/custom-properties
2 starshttps://github.com/vproxy-tools/llama.cpp/stargazers
0 watchinghttps://github.com/vproxy-tools/llama.cpp/watchers
0 forkshttps://github.com/vproxy-tools/llama.cpp/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fvproxy-tools%2Fllama.cpp&report=vproxy-tools+%28user%29
Releaseshttps://github.com/vproxy-tools/llama.cpp/releases
Packages 0https://github.com/orgs/vproxy-tools/packages?repo_name=llama.cpp
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.