René's URL Explorer Experiment

Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

Open Graph Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

X Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

Open Graph Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

X Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/guyvdb/llama.cpp

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern	/:user_id/:repository
route-controller	files
route-action	disambiguate
fetch-nonce	v2:6b3e472a-83d6-7a55-d7e6-78c31287922c
current-catalog-service-hash	f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id	8C22:340321:BD56DA4:F4BEC0E:6976AC30
html-safe-nonce	67b3cd8b35f24b98ccf59b57b094cc2e2b07d565c704cf88c759b5da50b0c81b
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4QzIyOjM0MDMyMTpCRDU2REE0OkY0QkVDMEU6Njk3NkFDMzAiLCJ2aXNpdG9yX2lkIjoiODk2OTExMDgzNzk4OTM4NzMxMiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac	0fdb7a6a9a753b91f06ec28908473e2d815f4fca68513cc744a5b87180452454
hovercard-subject-tag	repository:625166367
github-keyboard-shortcuts	repository,copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
analytics-location	//
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/guyvdb/llama.cpp
twitter:image	https://opengraph.githubassets.com/7e3ee2ac26e88481a4bf2821c5aa969fc8de70ef1b359a8b724d36a10b5d3503/guyvdb/llama.cpp
twitter:card	summary_large_image
og:image	https://opengraph.githubassets.com/7e3ee2ac26e88481a4bf2821c5aa969fc8de70ef1b359a8b724d36a10b5d3503/guyvdb/llama.cpp
og:image:alt	Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.
og:image:width	1200
og:image:height	600
og:site_name	GitHub
og:type	object
hostname	github.com
expected-hostname	github.com
None	032152924a283b83384255d9489e7b93b54ba01da8d380b05ecd3953b3212411
turbo-cache-control	no-preview
go-import	github.com/guyvdb/llama.cpp git https://github.com/guyvdb/llama.cpp.git
octolytics-dimension-user_id	3418
octolytics-dimension-user_login	guyvdb
octolytics-dimension-repository_id	625166367
octolytics-dimension-repository_nwo	guyvdb/llama.cpp
octolytics-dimension-repository_public	true
octolytics-dimension-repository_is_fork	true
octolytics-dimension-repository_parent_id	612354784
octolytics-dimension-repository_parent_nwo	ggml-org/llama.cpp
octolytics-dimension-repository_network_root_id	612354784
octolytics-dimension-repository_network_root_nwo	ggml-org/llama.cpp
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	5b577f6be6482e336e3c30e8daefa30144947b17
ui-target	full
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#start-of-content
	https://patch-diff.githubusercontent.com/
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp
Sign up	https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=guyvdb%2Fllama.cpp
Reload	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Reload	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Reload	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
guyvdb	https://patch-diff.githubusercontent.com/guyvdb
llama.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
ggml-org/llama.cpp	https://patch-diff.githubusercontent.com/ggml-org/llama.cpp
Notifications	https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Fork 0	https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Star 0	https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
MIT license	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
0 stars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/stargazers
14.6k forks	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/forks
Branches	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
Tags	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
Activity	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/activity
Star	https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Notifications	https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Code	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Pull requests 0	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulls
Actions	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/actions
Projects 0	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/projects
Security 0	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/security
Insights	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulse
Code	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Pull requests	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulls
Actions	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/actions
Projects	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/projects
Security	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/security
Insights	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulse
Branches	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
Tags	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
2,611 Commits	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/commits/master/
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/commits/master/
.devops	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.devops
.devops	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.devops
.github	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.github
.github	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.github
ci	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ci
ci	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ci
cmake	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/cmake
cmake	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/cmake
common	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/common
common	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/common
docs	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/docs
docs	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/docs
examples	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/examples
examples	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/examples
ggml-cuda	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ggml-cuda
ggml-cuda	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ggml-cuda
gguf-py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/gguf-py
gguf-py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/gguf-py
grammars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/grammars
grammars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/grammars
kompute @ 4565194	https://patch-diff.githubusercontent.com/nomic-ai/kompute/tree/4565194ed7c32d1d2efa32ceab4d3c6cae006306
kompute @ 4565194	https://patch-diff.githubusercontent.com/nomic-ai/kompute/tree/4565194ed7c32d1d2efa32ceab4d3c6cae006306
kompute-shaders	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/kompute-shaders
kompute-shaders	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/kompute-shaders
media	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/media
media	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/media
models	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/models
models	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/models
pocs	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/pocs
pocs	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/pocs
prompts	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/prompts
prompts	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/prompts
requirements	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/requirements
requirements	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/requirements
scripts	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/scripts
scripts	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/scripts
spm-headers	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/spm-headers
spm-headers	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/spm-headers
tests	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/tests
tests	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/tests
.clang-tidy	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.clang-tidy
.clang-tidy	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.clang-tidy
.dockerignore	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.dockerignore
.dockerignore	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.dockerignore
.ecrc	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.ecrc
.ecrc	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.ecrc
.editorconfig	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.editorconfig
.editorconfig	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.editorconfig
.flake8	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.flake8
.flake8	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.flake8
.gitignore	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitignore
.gitignore	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitignore
.gitmodules	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitmodules
.gitmodules	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitmodules
.pre-commit-config.yaml	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.pre-commit-config.yaml
.pre-commit-config.yaml	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.pre-commit-config.yaml
CMakeLists.txt	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/CMakeLists.txt
CMakeLists.txt	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/CMakeLists.txt
LICENSE	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
LICENSE	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
Makefile	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Makefile
Makefile	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Makefile
Package.swift	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Package.swift
Package.swift	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Package.swift
README-sycl.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
README-sycl.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
README.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README.md
README.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README.md
SECURITY.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/SECURITY.md
SECURITY.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/SECURITY.md
build.zig	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/build.zig
build.zig	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/build.zig
codecov.yml	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/codecov.yml
codecov.yml	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/codecov.yml
convert-hf-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-hf-to-gguf.py
convert-hf-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-hf-to-gguf.py
convert-llama-ggml-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-llama-ggml-to-gguf.py
convert-llama-ggml-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-llama-ggml-to-gguf.py
convert-lora-to-ggml.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-lora-to-ggml.py
convert-lora-to-ggml.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-lora-to-ggml.py
convert-persimmon-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-persimmon-to-gguf.py
convert-persimmon-to-gguf.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-persimmon-to-gguf.py
convert.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert.py
convert.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert.py
flake.lock	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.lock
flake.lock	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.lock
flake.nix	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.nix
flake.nix	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.nix
ggml-alloc.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.c
ggml-alloc.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.c
ggml-alloc.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.h
ggml-alloc.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.h
ggml-backend-impl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend-impl.h
ggml-backend-impl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend-impl.h
ggml-backend.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.c
ggml-backend.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.c
ggml-backend.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.h
ggml-backend.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.h
ggml-common.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-common.h
ggml-common.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-common.h
ggml-cuda.cu	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.cu
ggml-cuda.cu	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.cu
ggml-cuda.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.h
ggml-cuda.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.h
ggml-impl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-impl.h
ggml-impl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-impl.h
ggml-kompute.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.cpp
ggml-kompute.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.cpp
ggml-kompute.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.h
ggml-kompute.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.h
ggml-metal.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.h
ggml-metal.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.h
ggml-metal.m	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.m
ggml-metal.m	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.m
ggml-metal.metal	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.metal
ggml-metal.metal	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.metal
ggml-mpi.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.c
ggml-mpi.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.c
ggml-mpi.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.h
ggml-mpi.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.h
ggml-opencl.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.cpp
ggml-opencl.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.cpp
ggml-opencl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.h
ggml-opencl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.h
ggml-quants.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.c
ggml-quants.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.c
ggml-quants.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.h
ggml-quants.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.h
ggml-sycl.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.cpp
ggml-sycl.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.cpp
ggml-sycl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.h
ggml-sycl.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.h
ggml-vulkan-shaders.hpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan-shaders.hpp
ggml-vulkan-shaders.hpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan-shaders.hpp
ggml-vulkan.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.cpp
ggml-vulkan.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.cpp
ggml-vulkan.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.h
ggml-vulkan.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.h
ggml.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.c
ggml.c	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.c
ggml.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.h
ggml.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.h
ggml_vk_generate_shaders.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml_vk_generate_shaders.py
ggml_vk_generate_shaders.py	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml_vk_generate_shaders.py
llama.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.cpp
llama.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.cpp
llama.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.h
llama.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.h
mypy.ini	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/mypy.ini
mypy.ini	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/mypy.ini
requirements.txt	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/requirements.txt
requirements.txt	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/requirements.txt
unicode-data.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.cpp
unicode-data.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.cpp
unicode-data.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.h
unicode-data.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.h
unicode.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.cpp
unicode.cpp	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.cpp
unicode.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.h
unicode.h	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.h
README	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
License	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Security	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#llamacpp
	https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
	https://opensource.org/licenses/MIT
Roadmap	https://github.com/users/ggerganov/projects/7
Project status	https://github.com/ggerganov/llama.cpp/discussions/3471
Manifesto	https://github.com/ggerganov/llama.cpp/discussions/205
ggml	https://github.com/ggerganov/ggml
LLaMA	https://arxiv.org/abs/2302.13971
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#recent-api-changes
ggml-org#6122	https://github.com/ggml-org/llama.cpp/pull/6122
ggml-org#6017	https://github.com/ggml-org/llama.cpp/pull/6017
ggml-org#5328	https://github.com/ggml-org/llama.cpp/pull/5328
ggml-org#5796	https://github.com/ggml-org/llama.cpp/pull/5796
ggml-org#5849	https://github.com/ggml-org/llama.cpp/pull/5849
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#hot-topics
ggml-org#6387	https://github.com/ggml-org/llama.cpp/pull/6387
ggml-org#6404	https://github.com/ggml-org/llama.cpp/discussions/6404
ggml-org#6225	https://github.com/ggml-org/llama.cpp/pull/6225
ggml-org#6017	https://github.com/ggml-org/llama.cpp/pull/6017
ggml-org#5981	https://github.com/ggml-org/llama.cpp/issues/5981
ggml-org#5962	https://github.com/ggml-org/llama.cpp/discussions/5962
ggml-org#5328	https://github.com/ggml-org/llama.cpp/pull/5328
Description	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#description
Usage	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage
Get the Code	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#get-the-code
Build	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#build
BLAS Build	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blas-build
Prepare and Quantize	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prepare-and-quantize
Run the quantized model	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#run-the-quantized-model
Memory/Disk Requirements	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#memorydisk-requirements
Quantization	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#quantization
Interactive mode	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#interactive-mode
Constrained output with grammars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#constrained-output-with-grammars
Instruct mode	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#instruct-mode
Obtaining and using the Facebook LLaMA 2 model	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
Seminal papers and background on the models	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#seminal-papers-and-background-on-the-models
Perplexity (measuring model quality)	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#perplexity-measuring-model-quality
Android	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#android
Docker	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker
Contributing	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#contributing
Coding guidelines	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#coding-guidelines
Docs	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docs
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#description
inception	https://github.com/ggerganov/llama.cpp/issues/33#issuecomment-1465108022
ggml	https://github.com/ggerganov/ggml
Mistral 7B	https://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoE	https://huggingface.co/models?search=mistral-ai/Mixtral
Chinese LLaMA / Alpaca	https://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2	https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)	https://github.com/bofenghuang/vigogne
Koala	https://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2	https://huggingface.co/models?search=baichuan-inc/Baichuan
derivations	https://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2	https://huggingface.co/models?search=BAAI/Aquila
Starcoder models	https://github.com/ggerganov/llama.cpp/pull/3187
Refact	https://huggingface.co/smallcloudai/Refact-1_6B-fim
Persimmon 8B	https://github.com/ggerganov/llama.cpp/pull/3410
MPT	https://github.com/ggerganov/llama.cpp/pull/3417
Bloom	https://github.com/ggerganov/llama.cpp/pull/3553
Yi models	https://huggingface.co/models?search=01-ai/Yi
StableLM models	https://huggingface.co/stabilityai
Deepseek models	https://huggingface.co/models?search=deepseek-ai/deepseek
Qwen models	https://huggingface.co/models?search=Qwen/Qwen
PLaMo-13B	https://github.com/ggerganov/llama.cpp/pull/3557
Phi models	https://huggingface.co/models?search=microsoft/phi
GPT-2	https://huggingface.co/gpt2
Orion 14B	https://github.com/ggerganov/llama.cpp/pull/5118
InternLM2	https://huggingface.co/models?search=internlm2
CodeShell	https://github.com/WisdomShell/codeshell
Gemma	https://ai.google.dev/gemma
Mamba	https://github.com/state-spaces/mamba
Xverse	https://huggingface.co/models?search=xverse
Command-R	https://huggingface.co/CohereForAI/c4ai-command-r-v01
SEA-LION	https://huggingface.co/models?search=sea-lion
LLaVA 1.5 models	https://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 models	https://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVA	https://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidian	https://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4V	https://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B models	https://huggingface.co/models?search=mobileVLM
Yi-VL	https://huggingface.co/models?search=Yi-VL
llama.cpp web server	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/server
OpenAI API	https://github.com/openai/openai-openapi
abetlen/llama-cpp-python	https://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpp	https://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpp	https://github.com/withcatai/node-llama-cpp
lgrammel/modelfusion	https://modelfusion.dev/integration/model-provider/llamacpp
tangledgroup/llama-cpp-wasm	https://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllama	https://github.com/ngxson/wllama
yoshoku/llama_cpp.rb	https://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rs	https://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpp	https://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rs	https://github.com/utilityai/llama-cpp-rs
SciSharp/LLamaSharp	https://github.com/SciSharp/LLamaSharp
donderom/llm4s	https://github.com/donderom/llm4s
phronmophobic/llama.clj	https://github.com/phronmophobic/llama.clj
mybigday/llama.rn	https://github.com/mybigday/llama.rn
kherud/java-llama.cpp	https://github.com/kherud/java-llama.cpp
deins/llama.cpp.zig	https://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_dart	https://github.com/netdur/llama_cpp_dart
distantmagic/resonance	https://github.com/distantmagic/resonance
(more info)	https://github.com/ggerganov/llama.cpp/pull/6326
iohub/collama	https://github.com/iohub/coLLaMA
janhq/jan	https://github.com/janhq/jan
nat/openplayground	https://github.com/nat/openplayground
Faraday	https://faraday.dev/
LMStudio	https://lmstudio.ai/
LocalAI	https://github.com/mudler/LocalAI
LostRuins/koboldcpp	https://github.com/LostRuins/koboldcpp
Mozilla-Ocho/llamafile	https://github.com/Mozilla-Ocho/llamafile
nomic-ai/gpt4all	https://github.com/nomic-ai/gpt4all
ollama/ollama	https://github.com/ollama/ollama
oobabooga/text-generation-webui	https://github.com/oobabooga/text-generation-webui
psugihara/FreeChat	https://github.com/psugihara/FreeChat
cztomsik/ava	https://github.com/cztomsik/ava
ptsochantaris/emeltal	https://github.com/ptsochantaris/emeltal
pythops/tenere	https://github.com/pythops/tenere
RecurseChat	https://recurse.chat/
semperai/amica	https://github.com/semperai/amica
withcatai/catai	https://github.com/withcatai/catai
Mobile-Artificial-Intelligence/maid	https://github.com/Mobile-Artificial-Intelligence/maid
Msty	https://msty.app
LLMFarm	https://github.com/guinmoon/LLMFarm?tab=readme-ov-file
KanTV	https://github.com/zhouwg/kantv?tab=readme-ov-file
Dot	https://github.com/alexpinel/Dot
whisper.cpp	https://github.com/ggerganov/whisper.cpp
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#get-the-code
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#build
w64devkit	https://github.com/skeeto/w64devkit/releases
DRM in FreeBSD	https://wiki.freebsd.org/Graphics
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#metal-build
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#mpi-build
MPICH	https://www.mpich.org
OpenMPI	https://www.open-mpi.org
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blas-build
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#accelerate-framework
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#openblas
w64devkit	https://github.com/skeeto/w64devkit/releases
OpenBLAS for Windows	https://github.com/xianyi/OpenBLAS/releases
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blis
BLIS.md	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/BLIS.md
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#sycl
llama.cpp for SYCL	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#intel-onemkl
llama.cpp for SYCL	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
oneAPI-basekit	https://hub.docker.com/r/intel/oneapi-basekit
Optimizing and Running LLaMA2 on Intel® CPU	https://www.intel.com/content/www/us/en/content-details/791610/optimizing-and-running-llama2-on-intel-cpu.html
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#cuda
CUDA Toolkit	https://developer.nvidia.com/cuda-downloads
Offical Support	https://www.jetson-ai-lab.com/tutorial_text-generation.html
CUDA_VISIBLE_DEVICES	https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#hipblas
ROCm Quick Start (Linux)	https://rocm.docs.amd.com/en/latest/deploy/linux/quick_start.html
here	https://llvm.org/docs/AMDGPUUsage.html#processors
HIP_VISIBLE_DEVICES	https://rocm.docs.amd.com/en/latest/understand/gpu_isolation.html#hip-visible-devices
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#clblast
CLBlast	https://github.com/CNugteren/CLBlast
OpenCL SDK	https://github.com/KhronosGroup/OpenCL-SDK
OpenCL Releases	https://github.com/KhronosGroup/OpenCL-SDK/releases
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#installing-clblast
CLBlast Releases	https://github.com/CNugteren/CLBlast/releases
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-llama-with-clblast
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#running-llama-with-clblast
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#vulkan
Vulkan SDK	https://vulkan.lunarg.com/doc/view/latest/linux/getting_started_ubuntu.html
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prepare-and-quantize
Obtaining and using the Facebook LLaMA 2 model	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#run-the-quantized-model
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#running-on-windows-with-prebuilt-binaries
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#memorydisk-requirements
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#quantization
k-quants	https://github.com/ggerganov/llama.cpp/pull/1684
#2707	https://github.com/ggerganov/llama.cpp/pull/2707
#2807	https://github.com/ggerganov/llama.cpp/pull/2807
#4773 - 2-bit i-quants (inference)	https://github.com/ggerganov/llama.cpp/pull/4773
#4856 - 2-bit i-quants (inference)	https://github.com/ggerganov/llama.cpp/pull/4856
#4861 - importance matrix	https://github.com/ggerganov/llama.cpp/pull/4861
#4872 - MoE models	https://github.com/ggerganov/llama.cpp/pull/4872
#4897 - 2-bit quantization	https://github.com/ggerganov/llama.cpp/pull/4897
#4930 - imatrix for all k-quants	https://github.com/ggerganov/llama.cpp/pull/4930
#4951 - imatrix on the GPU	https://github.com/ggerganov/llama.cpp/pull/4957
#4969 - imatrix for legacy quants	https://github.com/ggerganov/llama.cpp/pull/4969
#4996 - k-qunats tuning	https://github.com/ggerganov/llama.cpp/pull/4996
#5060 - Q3_K_XS	https://github.com/ggerganov/llama.cpp/pull/5060
#5196 - 3-bit i-quants	https://github.com/ggerganov/llama.cpp/pull/5196
quantization tuning	https://github.com/ggerganov/llama.cpp/pull/5320
another one	https://github.com/ggerganov/llama.cpp/pull/5334
another one	https://github.com/ggerganov/llama.cpp/pull/5361
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#perplexity-measuring-model-quality
https://huggingface.co/docs/transformers/perplexity	https://huggingface.co/docs/transformers/perplexity
https://paperswithcode.com/dataset/wikitext-2	https://paperswithcode.com/dataset/wikitext-2
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#how-to-run
https://huggingface.co/datasets/ggml-org/ci/resolve/main/wikitext-2-raw-v1.zip	https://huggingface.co/datasets/ggml-org/ci/resolve/main/wikitext-2-raw-v1.zip
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#interactive-mode
README	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/main/README.md
	https://user-images.githubusercontent.com/1991296/224575029-2af3c7dc-5a65-4f64-a6bb-517a532aea38.png
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#persistent-interaction
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#constrained-output-with-grammars
GBNF Guide	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/grammars/README.md
https://grammar.intrinsiclabs.ai/	https://grammar.intrinsiclabs.ai/
its repo	http://github.com/intrinsiclabsai/gbnfgen
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#instruct-mode
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
Facebook's LLaMA download page	https://ai.meta.com/resources/models-and-libraries/llama-downloads/
TheBloke	https://huggingface.co/TheBloke
LLaMA 2 7B base	https://huggingface.co/TheBloke/Llama-2-7B-GGUF
LLaMA 2 13B base	https://huggingface.co/TheBloke/Llama-2-13B-GGUF
LLaMA 2 70B base	https://huggingface.co/TheBloke/Llama-2-70B-GGUF
LLaMA 2 7B chat	https://huggingface.co/TheBloke/Llama-2-7B-chat-GGUF
LLaMA 2 13B chat	https://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF
LLaMA 2 70B chat	https://huggingface.co/TheBloke/Llama-2-70B-chat-GGUF
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language model	https://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Models	https://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learners	https://arxiv.org/abs/2005.14165
Aligning language models to follow instructions	https://openai.com/research/instruction-following
Training language models to follow instructions with human feedback	https://arxiv.org/abs/2203.02155
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#android
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-the-project-using-android-ndk
termux	https://termux.dev/
Android NDK	https://developer.android.com/ndk
termux	https://termux.dev/
llama-2-7b-chat.Q4_K_M.gguf	https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/blob/main/llama-2-7b-chat.Q4_K_M.gguf
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-the-project-using-termux-f-droid
https://github.com/CNugteren/CLBlast	https://github.com/CNugteren/CLBlast
https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/	https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prerequisites
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#images
.devops/	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.devops
.github/workflows/docker.yml	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.github/workflows/docker.yml
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage-1
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker-with-cuda
nvidia-container-toolkit	https://github.com/NVIDIA/nvidia-container-toolkit
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-locally
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage-2
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#contributing
Inference at the edge	https://github.com/ggerganov/llama.cpp/discussions/205
Changelog podcast	https://changelog.com/podcast/532
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#coding-guidelines
good first issues	https://github.com/ggerganov/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
z = ggml_mul_mat(ctx, x, y)	https://github.com/ggerganov/llama.cpp/blob/880e352277fc017df4d5794f0c21c44e1eae2b84/ggml.h#L1058-L1064
	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docs
main	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/main/README.md
server	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/server/README.md
jeopardy	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/jeopardy/README.md
BLIS	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/BLIS.md
Performance troubleshooting	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/token_generation_performance_tips.md
GGML tips & tricks	https://github.com/ggerganov/llama.cpp/wiki/GGML-Tips-&-Tricks
GBNF grammars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/grammars/README.md
Readme	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#readme-ov-file
MIT license	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#MIT-1-ov-file
Security policy	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#security-ov-file
Please reload this page	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Activity	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/activity
0 stars	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/stargazers
0 watching	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/watchers
0 forks	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/forks
Report repository	https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp&report=guyvdb+%28user%29
Releases	https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/releases
Packages 0	https://patch-diff.githubusercontent.com/users/guyvdb/packages?repo_name=llama.cpp
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.