René's URL Explorer Experiment


Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

Open Graph Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

X Title: GitHub - guyvdb/llama.cpp: Port of Facebook's LLaMA model in C/C++

Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

Open Graph Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

X Description: Port of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/guyvdb/llama.cpp

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:6b3e472a-83d6-7a55-d7e6-78c31287922c
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id8C22:340321:BD56DA4:F4BEC0E:6976AC30
html-safe-nonce67b3cd8b35f24b98ccf59b57b094cc2e2b07d565c704cf88c759b5da50b0c81b
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4QzIyOjM0MDMyMTpCRDU2REE0OkY0QkVDMEU6Njk3NkFDMzAiLCJ2aXNpdG9yX2lkIjoiODk2OTExMDgzNzk4OTM4NzMxMiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac0fdb7a6a9a753b91f06ec28908473e2d815f4fca68513cc744a5b87180452454
hovercard-subject-tagrepository:625166367
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/guyvdb/llama.cpp
twitter:imagehttps://opengraph.githubassets.com/7e3ee2ac26e88481a4bf2821c5aa969fc8de70ef1b359a8b724d36a10b5d3503/guyvdb/llama.cpp
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/7e3ee2ac26e88481a4bf2821c5aa969fc8de70ef1b359a8b724d36a10b5d3503/guyvdb/llama.cpp
og:image:altPort of Facebook's LLaMA model in C/C++. Contribute to guyvdb/llama.cpp development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None032152924a283b83384255d9489e7b93b54ba01da8d380b05ecd3953b3212411
turbo-cache-controlno-preview
go-importgithub.com/guyvdb/llama.cpp git https://github.com/guyvdb/llama.cpp.git
octolytics-dimension-user_id3418
octolytics-dimension-user_loginguyvdb
octolytics-dimension-repository_id625166367
octolytics-dimension-repository_nwoguyvdb/llama.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id612354784
octolytics-dimension-repository_parent_nwoggml-org/llama.cpp
octolytics-dimension-repository_network_root_id612354784
octolytics-dimension-repository_network_root_nwoggml-org/llama.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release5b577f6be6482e336e3c30e8daefa30144947b17
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=guyvdb%2Fllama.cpp
Reloadhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Reloadhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Reloadhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
guyvdb https://patch-diff.githubusercontent.com/guyvdb
llama.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
ggml-org/llama.cpphttps://patch-diff.githubusercontent.com/ggml-org/llama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Star 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
MIT license https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
0 stars https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/stargazers
14.6k forks https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/forks
Branches https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
Tags https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
Activity https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fguyvdb%2Fllama.cpp
Code https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Pull requests 0 https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/actions
Projects 0 https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/projects
Security 0 https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/security
Insights https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulse
Code https://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Pull requests https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/actions
Projects https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/projects
Security https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/security
Insights https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/pulse
Brancheshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
Tagshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/branches
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tags
2,611 Commitshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/commits/master/
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/commits/master/
.devopshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.devops
.devopshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.devops
.githubhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/.github
cihttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ci
cihttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ci
cmakehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/cmake
cmakehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/cmake
commonhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/common
commonhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/common
docshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/docs
docshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/examples
ggml-cudahttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ggml-cuda
ggml-cudahttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/ggml-cuda
gguf-pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/gguf-py
gguf-pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/gguf-py
grammarshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/grammars
grammarshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/grammars
kompute @ 4565194https://patch-diff.githubusercontent.com/nomic-ai/kompute/tree/4565194ed7c32d1d2efa32ceab4d3c6cae006306
kompute @ 4565194https://patch-diff.githubusercontent.com/nomic-ai/kompute/tree/4565194ed7c32d1d2efa32ceab4d3c6cae006306
kompute-shadershttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/kompute-shaders
kompute-shadershttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/kompute-shaders
mediahttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/media
mediahttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/media
modelshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/models
modelshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/models
pocshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/pocs
pocshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/pocs
promptshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/prompts
promptshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/prompts
requirementshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/requirements
requirementshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/requirements
scriptshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/scripts
scriptshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/scripts
spm-headershttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/spm-headers
spm-headershttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/spm-headers
testshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/tests
testshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/tree/master/tests
.clang-tidyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.clang-tidy
.clang-tidyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.clang-tidy
.dockerignorehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.dockerignore
.ecrchttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.ecrc
.ecrchttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.ecrc
.editorconfighttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.editorconfig
.editorconfighttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.editorconfig
.flake8https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.flake8
.flake8https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.flake8
.gitignorehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.gitmodules
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.pre-commit-config.yaml
CMakeLists.txthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/CMakeLists.txt
LICENSEhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/LICENSE
Makefilehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Makefile
Makefilehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Makefile
Package.swifthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Package.swift
Package.swifthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/Package.swift
README-sycl.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
README-sycl.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
README.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/SECURITY.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/SECURITY.md
build.zighttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/build.zig
build.zighttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/build.zig
codecov.ymlhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/codecov.yml
codecov.ymlhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/codecov.yml
convert-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-hf-to-gguf.py
convert-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-hf-to-gguf.py
convert-llama-ggml-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-llama-ggml-to-gguf.py
convert-llama-ggml-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-llama-ggml-to-gguf.py
convert-lora-to-ggml.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-lora-to-ggml.py
convert-lora-to-ggml.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-lora-to-ggml.py
convert-persimmon-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-persimmon-to-gguf.py
convert-persimmon-to-gguf.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert-persimmon-to-gguf.py
convert.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert.py
convert.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/convert.py
flake.lockhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.lock
flake.lockhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.lock
flake.nixhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.nix
flake.nixhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/flake.nix
ggml-alloc.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.c
ggml-alloc.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.c
ggml-alloc.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.h
ggml-alloc.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-alloc.h
ggml-backend-impl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend-impl.h
ggml-backend-impl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend-impl.h
ggml-backend.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.c
ggml-backend.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.c
ggml-backend.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.h
ggml-backend.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-backend.h
ggml-common.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-common.h
ggml-common.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-common.h
ggml-cuda.cuhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.cu
ggml-cuda.cuhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.cu
ggml-cuda.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.h
ggml-cuda.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-cuda.h
ggml-impl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-impl.h
ggml-impl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-impl.h
ggml-kompute.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.cpp
ggml-kompute.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.cpp
ggml-kompute.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.h
ggml-kompute.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-kompute.h
ggml-metal.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.h
ggml-metal.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.h
ggml-metal.mhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.m
ggml-metal.mhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.m
ggml-metal.metalhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.metal
ggml-metal.metalhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-metal.metal
ggml-mpi.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.c
ggml-mpi.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.c
ggml-mpi.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.h
ggml-mpi.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-mpi.h
ggml-opencl.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.cpp
ggml-opencl.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.cpp
ggml-opencl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.h
ggml-opencl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-opencl.h
ggml-quants.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.c
ggml-quants.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.c
ggml-quants.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.h
ggml-quants.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-quants.h
ggml-sycl.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.cpp
ggml-sycl.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.cpp
ggml-sycl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.h
ggml-sycl.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-sycl.h
ggml-vulkan-shaders.hpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan-shaders.hpp
ggml-vulkan-shaders.hpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan-shaders.hpp
ggml-vulkan.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.cpp
ggml-vulkan.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.cpp
ggml-vulkan.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.h
ggml-vulkan.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml-vulkan.h
ggml.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.c
ggml.chttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.c
ggml.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.h
ggml.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml.h
ggml_vk_generate_shaders.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml_vk_generate_shaders.py
ggml_vk_generate_shaders.pyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/ggml_vk_generate_shaders.py
llama.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.cpp
llama.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.cpp
llama.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.h
llama.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/llama.h
mypy.inihttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/mypy.ini
mypy.inihttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/mypy.ini
requirements.txthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/requirements.txt
unicode-data.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.cpp
unicode-data.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.cpp
unicode-data.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.h
unicode-data.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode-data.h
unicode.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.cpp
unicode.cpphttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.cpp
unicode.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.h
unicode.hhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/unicode.h
READMEhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Licensehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Securityhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#llamacpp
https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
https://opensource.org/licenses/MIT
Roadmaphttps://github.com/users/ggerganov/projects/7
Project statushttps://github.com/ggerganov/llama.cpp/discussions/3471
Manifestohttps://github.com/ggerganov/llama.cpp/discussions/205
ggmlhttps://github.com/ggerganov/ggml
LLaMAhttps://arxiv.org/abs/2302.13971
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#recent-api-changes
ggml-org#6122https://github.com/ggml-org/llama.cpp/pull/6122
ggml-org#6017https://github.com/ggml-org/llama.cpp/pull/6017
ggml-org#5328https://github.com/ggml-org/llama.cpp/pull/5328
ggml-org#5796https://github.com/ggml-org/llama.cpp/pull/5796
ggml-org#5849https://github.com/ggml-org/llama.cpp/pull/5849
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#hot-topics
ggml-org#6387https://github.com/ggml-org/llama.cpp/pull/6387
ggml-org#6404https://github.com/ggml-org/llama.cpp/discussions/6404
ggml-org#6225https://github.com/ggml-org/llama.cpp/pull/6225
ggml-org#6017https://github.com/ggml-org/llama.cpp/pull/6017
ggml-org#5981https://github.com/ggml-org/llama.cpp/issues/5981
ggml-org#5962https://github.com/ggml-org/llama.cpp/discussions/5962
ggml-org#5328https://github.com/ggml-org/llama.cpp/pull/5328
Descriptionhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#description
Usagehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage
Get the Codehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#get-the-code
Buildhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#build
BLAS Buildhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blas-build
Prepare and Quantizehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prepare-and-quantize
Run the quantized modelhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#run-the-quantized-model
Memory/Disk Requirementshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#memorydisk-requirements
Quantizationhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#quantization
Interactive modehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#interactive-mode
Constrained output with grammarshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#constrained-output-with-grammars
Instruct modehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#instruct-mode
Obtaining and using the Facebook LLaMA 2 modelhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
Seminal papers and background on the modelshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#seminal-papers-and-background-on-the-models
Perplexity (measuring model quality)https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#perplexity-measuring-model-quality
Androidhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#android
Dockerhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker
Contributinghttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#contributing
Coding guidelineshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#coding-guidelines
Docshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docs
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#description
inceptionhttps://github.com/ggerganov/llama.cpp/issues/33#issuecomment-1465108022
ggmlhttps://github.com/ggerganov/ggml
Mistral 7Bhttps://huggingface.co/mistralai/Mistral-7B-v0.1
Mixtral MoEhttps://huggingface.co/models?search=mistral-ai/Mixtral
Chinese LLaMA / Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)https://github.com/bofenghuang/vigogne
Koalahttps://bair.berkeley.edu/blog/2023/04/03/koala/
Baichuan 1 & 2https://huggingface.co/models?search=baichuan-inc/Baichuan
derivationshttps://huggingface.co/hiyouga/baichuan-7b-sft
Aquila 1 & 2https://huggingface.co/models?search=BAAI/Aquila
Starcoder modelshttps://github.com/ggerganov/llama.cpp/pull/3187
Refacthttps://huggingface.co/smallcloudai/Refact-1_6B-fim
Persimmon 8Bhttps://github.com/ggerganov/llama.cpp/pull/3410
MPThttps://github.com/ggerganov/llama.cpp/pull/3417
Bloomhttps://github.com/ggerganov/llama.cpp/pull/3553
Yi modelshttps://huggingface.co/models?search=01-ai/Yi
StableLM modelshttps://huggingface.co/stabilityai
Deepseek modelshttps://huggingface.co/models?search=deepseek-ai/deepseek
Qwen modelshttps://huggingface.co/models?search=Qwen/Qwen
PLaMo-13Bhttps://github.com/ggerganov/llama.cpp/pull/3557
Phi modelshttps://huggingface.co/models?search=microsoft/phi
GPT-2https://huggingface.co/gpt2
Orion 14Bhttps://github.com/ggerganov/llama.cpp/pull/5118
InternLM2https://huggingface.co/models?search=internlm2
CodeShellhttps://github.com/WisdomShell/codeshell
Gemmahttps://ai.google.dev/gemma
Mambahttps://github.com/state-spaces/mamba
Xversehttps://huggingface.co/models?search=xverse
Command-Rhttps://huggingface.co/CohereForAI/c4ai-command-r-v01
SEA-LIONhttps://huggingface.co/models?search=sea-lion
LLaVA 1.5 modelshttps://huggingface.co/collections/liuhaotian/llava-15-653aac15d994e992e2677a7e
LLaVA 1.6 modelshttps://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2
BakLLaVAhttps://huggingface.co/models?search=SkunkworksAI/Bakllava
Obsidianhttps://huggingface.co/NousResearch/Obsidian-3B-V0.5
ShareGPT4Vhttps://huggingface.co/models?search=Lin-Chen/ShareGPT4V
MobileVLM 1.7B/3B modelshttps://huggingface.co/models?search=mobileVLM
Yi-VLhttps://huggingface.co/models?search=Yi-VL
llama.cpp web serverhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/server
OpenAI APIhttps://github.com/openai/openai-openapi
abetlen/llama-cpp-pythonhttps://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpphttps://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpphttps://github.com/withcatai/node-llama-cpp
lgrammel/modelfusionhttps://modelfusion.dev/integration/model-provider/llamacpp
tangledgroup/llama-cpp-wasmhttps://github.com/tangledgroup/llama-cpp-wasm
ngxson/wllamahttps://github.com/ngxson/wllama
yoshoku/llama_cpp.rbhttps://github.com/yoshoku/llama_cpp.rb
edgenai/llama_cpp-rshttps://github.com/edgenai/llama_cpp-rs
mdrokz/rust-llama.cpphttps://github.com/mdrokz/rust-llama.cpp
utilityai/llama-cpp-rshttps://github.com/utilityai/llama-cpp-rs
SciSharp/LLamaSharphttps://github.com/SciSharp/LLamaSharp
donderom/llm4shttps://github.com/donderom/llm4s
phronmophobic/llama.cljhttps://github.com/phronmophobic/llama.clj
mybigday/llama.rnhttps://github.com/mybigday/llama.rn
kherud/java-llama.cpphttps://github.com/kherud/java-llama.cpp
deins/llama.cpp.zighttps://github.com/Deins/llama.cpp.zig
netdur/llama_cpp_darthttps://github.com/netdur/llama_cpp_dart
distantmagic/resonancehttps://github.com/distantmagic/resonance
(more info)https://github.com/ggerganov/llama.cpp/pull/6326
iohub/collamahttps://github.com/iohub/coLLaMA
janhq/janhttps://github.com/janhq/jan
nat/openplaygroundhttps://github.com/nat/openplayground
Faradayhttps://faraday.dev/
LMStudiohttps://lmstudio.ai/
LocalAIhttps://github.com/mudler/LocalAI
LostRuins/koboldcpphttps://github.com/LostRuins/koboldcpp
Mozilla-Ocho/llamafilehttps://github.com/Mozilla-Ocho/llamafile
nomic-ai/gpt4allhttps://github.com/nomic-ai/gpt4all
ollama/ollamahttps://github.com/ollama/ollama
oobabooga/text-generation-webuihttps://github.com/oobabooga/text-generation-webui
psugihara/FreeChathttps://github.com/psugihara/FreeChat
cztomsik/avahttps://github.com/cztomsik/ava
ptsochantaris/emeltalhttps://github.com/ptsochantaris/emeltal
pythops/tenerehttps://github.com/pythops/tenere
RecurseChathttps://recurse.chat/
semperai/amicahttps://github.com/semperai/amica
withcatai/cataihttps://github.com/withcatai/catai
Mobile-Artificial-Intelligence/maidhttps://github.com/Mobile-Artificial-Intelligence/maid
Mstyhttps://msty.app
LLMFarmhttps://github.com/guinmoon/LLMFarm?tab=readme-ov-file
KanTVhttps://github.com/zhouwg/kantv?tab=readme-ov-file
Dothttps://github.com/alexpinel/Dot
whisper.cpphttps://github.com/ggerganov/whisper.cpp
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#get-the-code
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#build
w64devkithttps://github.com/skeeto/w64devkit/releases
DRM in FreeBSDhttps://wiki.freebsd.org/Graphics
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#metal-build
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#mpi-build
MPICHhttps://www.mpich.org
OpenMPIhttps://www.open-mpi.org
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blas-build
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#accelerate-framework
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#openblas
w64devkithttps://github.com/skeeto/w64devkit/releases
OpenBLAS for Windowshttps://github.com/xianyi/OpenBLAS/releases
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#blis
BLIS.mdhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/BLIS.md
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#sycl
llama.cpp for SYCLhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#intel-onemkl
llama.cpp for SYCLhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/README-sycl.md
oneAPI-basekithttps://hub.docker.com/r/intel/oneapi-basekit
Optimizing and Running LLaMA2 on Intel® CPUhttps://www.intel.com/content/www/us/en/content-details/791610/optimizing-and-running-llama2-on-intel-cpu.html
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#cuda
CUDA Toolkithttps://developer.nvidia.com/cuda-downloads
Offical Supporthttps://www.jetson-ai-lab.com/tutorial_text-generation.html
CUDA_VISIBLE_DEVICEShttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#hipblas
ROCm Quick Start (Linux)https://rocm.docs.amd.com/en/latest/deploy/linux/quick_start.html
herehttps://llvm.org/docs/AMDGPUUsage.html#processors
HIP_VISIBLE_DEVICEShttps://rocm.docs.amd.com/en/latest/understand/gpu_isolation.html#hip-visible-devices
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#clblast
CLBlasthttps://github.com/CNugteren/CLBlast
OpenCL SDKhttps://github.com/KhronosGroup/OpenCL-SDK
OpenCL Releaseshttps://github.com/KhronosGroup/OpenCL-SDK/releases
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#installing-clblast
CLBlast Releaseshttps://github.com/CNugteren/CLBlast/releases
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-llama-with-clblast
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#running-llama-with-clblast
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#vulkan
Vulkan SDKhttps://vulkan.lunarg.com/doc/view/latest/linux/getting_started_ubuntu.html
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prepare-and-quantize
Obtaining and using the Facebook LLaMA 2 modelhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#run-the-quantized-model
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#running-on-windows-with-prebuilt-binaries
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#memorydisk-requirements
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#quantization
k-quantshttps://github.com/ggerganov/llama.cpp/pull/1684
#2707https://github.com/ggerganov/llama.cpp/pull/2707
#2807https://github.com/ggerganov/llama.cpp/pull/2807
#4773 - 2-bit i-quants (inference)https://github.com/ggerganov/llama.cpp/pull/4773
#4856 - 2-bit i-quants (inference)https://github.com/ggerganov/llama.cpp/pull/4856
#4861 - importance matrixhttps://github.com/ggerganov/llama.cpp/pull/4861
#4872 - MoE modelshttps://github.com/ggerganov/llama.cpp/pull/4872
#4897 - 2-bit quantizationhttps://github.com/ggerganov/llama.cpp/pull/4897
#4930 - imatrix for all k-quantshttps://github.com/ggerganov/llama.cpp/pull/4930
#4951 - imatrix on the GPUhttps://github.com/ggerganov/llama.cpp/pull/4957
#4969 - imatrix for legacy quantshttps://github.com/ggerganov/llama.cpp/pull/4969
#4996 - k-qunats tuninghttps://github.com/ggerganov/llama.cpp/pull/4996
#5060 - Q3_K_XShttps://github.com/ggerganov/llama.cpp/pull/5060
#5196 - 3-bit i-quantshttps://github.com/ggerganov/llama.cpp/pull/5196
quantization tuninghttps://github.com/ggerganov/llama.cpp/pull/5320
another onehttps://github.com/ggerganov/llama.cpp/pull/5334
another onehttps://github.com/ggerganov/llama.cpp/pull/5361
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#perplexity-measuring-model-quality
https://huggingface.co/docs/transformers/perplexityhttps://huggingface.co/docs/transformers/perplexity
https://paperswithcode.com/dataset/wikitext-2https://paperswithcode.com/dataset/wikitext-2
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#how-to-run
https://huggingface.co/datasets/ggml-org/ci/resolve/main/wikitext-2-raw-v1.ziphttps://huggingface.co/datasets/ggml-org/ci/resolve/main/wikitext-2-raw-v1.zip
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#interactive-mode
READMEhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/main/README.md
https://user-images.githubusercontent.com/1991296/224575029-2af3c7dc-5a65-4f64-a6bb-517a532aea38.png
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#persistent-interaction
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#constrained-output-with-grammars
GBNF Guidehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/grammars/README.md
https://grammar.intrinsiclabs.ai/https://grammar.intrinsiclabs.ai/
its repohttp://github.com/intrinsiclabsai/gbnfgen
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#instruct-mode
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#obtaining-and-using-the-facebook-llama-2-model
Facebook's LLaMA download pagehttps://ai.meta.com/resources/models-and-libraries/llama-downloads/
TheBlokehttps://huggingface.co/TheBloke
LLaMA 2 7B basehttps://huggingface.co/TheBloke/Llama-2-7B-GGUF
LLaMA 2 13B basehttps://huggingface.co/TheBloke/Llama-2-13B-GGUF
LLaMA 2 70B basehttps://huggingface.co/TheBloke/Llama-2-70B-GGUF
LLaMA 2 7B chathttps://huggingface.co/TheBloke/Llama-2-7B-chat-GGUF
LLaMA 2 13B chathttps://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF
LLaMA 2 70B chathttps://huggingface.co/TheBloke/Llama-2-70B-chat-GGUF
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language modelhttps://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Modelshttps://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learnershttps://arxiv.org/abs/2005.14165
Aligning language models to follow instructionshttps://openai.com/research/instruction-following
Training language models to follow instructions with human feedbackhttps://arxiv.org/abs/2203.02155
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#android
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-the-project-using-android-ndk
termuxhttps://termux.dev/
Android NDKhttps://developer.android.com/ndk
termuxhttps://termux.dev/
llama-2-7b-chat.Q4_K_M.ggufhttps://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/blob/main/llama-2-7b-chat.Q4_K_M.gguf
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-the-project-using-termux-f-droid
https://github.com/CNugteren/CLBlasthttps://github.com/CNugteren/CLBlast
https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#prerequisites
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#images
.devops/https://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.devops
.github/workflows/docker.ymlhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/.github/workflows/docker.yml
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage-1
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docker-with-cuda
nvidia-container-toolkithttps://github.com/NVIDIA/nvidia-container-toolkit
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#building-locally
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#usage-2
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#contributing
Inference at the edgehttps://github.com/ggerganov/llama.cpp/discussions/205
Changelog podcasthttps://changelog.com/podcast/532
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#coding-guidelines
good first issueshttps://github.com/ggerganov/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
z = ggml_mul_mat(ctx, x, y)https://github.com/ggerganov/llama.cpp/blob/880e352277fc017df4d5794f0c21c44e1eae2b84/ggml.h#L1058-L1064
https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#docs
mainhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/main/README.md
serverhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/server/README.md
jeopardyhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/examples/jeopardy/README.md
BLIShttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/BLIS.md
Performance troubleshootinghttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/docs/token_generation_performance_tips.md
GGML tips & trickshttps://github.com/ggerganov/llama.cpp/wiki/GGML-Tips-&-Tricks
GBNF grammarshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/blob/master/grammars/README.md
Readme https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#MIT-1-ov-file
Security policy https://patch-diff.githubusercontent.com/guyvdb/llama.cpp#security-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp
Activityhttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/activity
0 starshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/watchers
0 forkshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fguyvdb%2Fllama.cpp&report=guyvdb+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/guyvdb/llama.cpp/releases
Packages 0https://patch-diff.githubusercontent.com/users/guyvdb/packages?repo_name=llama.cpp
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.