René's URL Explorer Experiment


Title: GitHub - tmc/llama.cpp at server-parallel

Open Graph Title: GitHub - tmc/llama.cpp at server-parallel

X Title: GitHub - tmc/llama.cpp at server-parallel

Description: Port of Facebook's LLaMA model in C/C++. Contribute to tmc/llama.cpp development by creating an account on GitHub.

Open Graph Description: Port of Facebook's LLaMA model in C/C++. Contribute to tmc/llama.cpp development by creating an account on GitHub.

X Description: Port of Facebook's LLaMA model in C/C++. Contribute to tmc/llama.cpp development by creating an account on GitHub.

Opengraph URL: https://github.com/tmc/llama.cpp

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository/tree/*name(/*path)
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:0fc364bf-a7a6-0fbf-4428-52f4275098a4
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idA98C:3172B9:F16F5:153883:6978654C
html-safe-nonceba16bb7f211e51aef1f28dd8c1a2c91aa3454ac89a078e9f00fcaf9730f0125d
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBOThDOjMxNzJCOTpGMTZGNToxNTM4ODM6Njk3ODY1NEMiLCJ2aXNpdG9yX2lkIjoiMjUxOTk0NTg5MzczODIxMDYzNiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmacf6b0a3fa8b0935bfdfacdd6063775baf0008b24535b68def908e9b28f68b18cf
hovercard-subject-tagrepository:730934953
github-keyboard-shortcutsrepository,source-code,file-tree,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///files/disambiguate
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/tmc/llama.cpp/tree/server-parallel
twitter:imagehttps://opengraph.githubassets.com/1d4f9be40f823244cb0e340878d33f7fd109c4cdfeb150f26d5baa56de844ab8/tmc/llama.cpp
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/1d4f9be40f823244cb0e340878d33f7fd109c4cdfeb150f26d5baa56de844ab8/tmc/llama.cpp
og:image:altPort of Facebook's LLaMA model in C/C++. Contribute to tmc/llama.cpp development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None2981c597c945c1d90ac6fa355ce7929b2f413dfe7872ca5c435ee53a24a1de50
turbo-cache-controlno-preview
go-importgithub.com/tmc/llama.cpp git https://github.com/tmc/llama.cpp.git
octolytics-dimension-user_id3977
octolytics-dimension-user_logintmc
octolytics-dimension-repository_id730934953
octolytics-dimension-repository_nwotmc/llama.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id612354784
octolytics-dimension-repository_parent_nwoggml-org/llama.cpp
octolytics-dimension-repository_network_root_id612354784
octolytics-dimension-repository_network_root_nwoggml-org/llama.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release520b65a872113b919c1bbdb03834a50af15859fd
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftmc%2Fllama.cpp%2Ftree%2Fserver-parallel
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftmc%2Fllama.cpp%2Ftree%2Fserver-parallel
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Ffiles%2Fdisambiguate&source=header-repo&source_repo=tmc%2Fllama.cpp
Reloadhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Reloadhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Reloadhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
tmc https://patch-diff.githubusercontent.com/tmc
llama.cpphttps://patch-diff.githubusercontent.com/tmc/llama.cpp
ggml-org/llama.cpphttps://patch-diff.githubusercontent.com/ggml-org/llama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Ftmc%2Fllama.cpp
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Ftmc%2Fllama.cpp
Star 1 https://patch-diff.githubusercontent.com/login?return_to=%2Ftmc%2Fllama.cpp
MIT license https://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/master/LICENSE
1 star https://patch-diff.githubusercontent.com/tmc/llama.cpp/stargazers
14.6k forks https://patch-diff.githubusercontent.com/tmc/llama.cpp/forks
Branches https://patch-diff.githubusercontent.com/tmc/llama.cpp/branches
Tags https://patch-diff.githubusercontent.com/tmc/llama.cpp/tags
Activity https://patch-diff.githubusercontent.com/tmc/llama.cpp/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Ftmc%2Fllama.cpp
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Ftmc%2Fllama.cpp
Code https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Pull requests 0 https://patch-diff.githubusercontent.com/tmc/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/tmc/llama.cpp/actions
Projects 0 https://patch-diff.githubusercontent.com/tmc/llama.cpp/projects
Security 0 https://patch-diff.githubusercontent.com/tmc/llama.cpp/security
Insights https://patch-diff.githubusercontent.com/tmc/llama.cpp/pulse
Code https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Pull requests https://patch-diff.githubusercontent.com/tmc/llama.cpp/pulls
Actions https://patch-diff.githubusercontent.com/tmc/llama.cpp/actions
Projects https://patch-diff.githubusercontent.com/tmc/llama.cpp/projects
Security https://patch-diff.githubusercontent.com/tmc/llama.cpp/security
Insights https://patch-diff.githubusercontent.com/tmc/llama.cpp/pulse
Brancheshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/branches
Tagshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tags
https://patch-diff.githubusercontent.com/tmc/llama.cpp/branches
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tags
1,334 Commitshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/commits/server-parallel/
https://patch-diff.githubusercontent.com/tmc/llama.cpp/commits/server-parallel/
.devopshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/.devops
.devopshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/.devops
.githubhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/.github
.githubhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/.github
cihttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/ci
cihttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/ci
commonhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/common
commonhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/common
docshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/docs
docshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/docs
exampleshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/examples
exampleshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/examples
gguf-pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/gguf-py
gguf-pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/gguf-py
grammarshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/grammars
grammarshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/grammars
mediahttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/media
mediahttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/media
modelshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/models
modelshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/models
pocshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/pocs
pocshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/pocs
promptshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/prompts
promptshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/prompts
scriptshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/scripts
scriptshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/scripts
spm-headershttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/spm-headers
spm-headershttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/spm-headers
testshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/tests
testshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel/tests
.clang-tidyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.clang-tidy
.clang-tidyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.clang-tidy
.dockerignorehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.dockerignore
.ecrchttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.ecrc
.ecrchttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.ecrc
.editorconfighttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.editorconfig
.editorconfighttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.editorconfig
.flake8https://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.flake8
.flake8https://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.flake8
.gitignorehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.gitignore
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.pre-commit-config.yaml
CMakeLists.txthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/CMakeLists.txt
LICENSEhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/LICENSE
Makefilehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/Makefile
Makefilehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/Makefile
Package.swifthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/Package.swift
Package.swifthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/Package.swift
README.mdhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/README.md
README.mdhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/README.md
SHA256SUMShttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/SHA256SUMS
SHA256SUMShttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/SHA256SUMS
build.zighttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/build.zig
build.zighttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/build.zig
codecov.ymlhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/codecov.yml
codecov.ymlhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/codecov.yml
convert-baichuan-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-baichuan-hf-to-gguf.py
convert-baichuan-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-baichuan-hf-to-gguf.py
convert-falcon-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-falcon-hf-to-gguf.py
convert-falcon-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-falcon-hf-to-gguf.py
convert-gptneox-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-gptneox-hf-to-gguf.py
convert-gptneox-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-gptneox-hf-to-gguf.py
convert-llama-ggml-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-llama-ggml-to-gguf.py
convert-llama-ggml-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-llama-ggml-to-gguf.py
convert-lora-to-ggml.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-lora-to-ggml.py
convert-lora-to-ggml.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-lora-to-ggml.py
convert-refact-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-refact-hf-to-gguf.py
convert-refact-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-refact-hf-to-gguf.py
convert-starcoder-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-starcoder-hf-to-gguf.py
convert-starcoder-hf-to-gguf.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert-starcoder-hf-to-gguf.py
convert.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert.py
convert.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/convert.py
flake.lockhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/flake.lock
flake.lockhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/flake.lock
flake.nixhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/flake.nix
flake.nixhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/flake.nix
ggml-alloc.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-alloc.c
ggml-alloc.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-alloc.c
ggml-alloc.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-alloc.h
ggml-alloc.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-alloc.h
ggml-cuda.cuhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-cuda.cu
ggml-cuda.cuhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-cuda.cu
ggml-cuda.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-cuda.h
ggml-cuda.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-cuda.h
ggml-metal.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.h
ggml-metal.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.h
ggml-metal.mhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.m
ggml-metal.mhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.m
ggml-metal.metalhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.metal
ggml-metal.metalhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-metal.metal
ggml-mpi.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-mpi.c
ggml-mpi.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-mpi.c
ggml-mpi.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-mpi.h
ggml-mpi.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-mpi.h
ggml-opencl.cpphttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-opencl.cpp
ggml-opencl.cpphttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-opencl.cpp
ggml-opencl.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-opencl.h
ggml-opencl.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml-opencl.h
ggml.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml.c
ggml.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml.c
ggml.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml.h
ggml.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/ggml.h
k_quants.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/k_quants.c
k_quants.chttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/k_quants.c
k_quants.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/k_quants.h
k_quants.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/k_quants.h
llama.cpphttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/llama.cpp
llama.cpphttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/llama.cpp
llama.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/llama.h
llama.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/llama.h
mypy.inihttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/mypy.ini
mypy.inihttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/mypy.ini
requirements.txthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/requirements.txt
run_with_preset.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/run_with_preset.py
run_with_preset.pyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/run_with_preset.py
unicode.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/unicode.h
unicode.hhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/unicode.h
READMEhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Licensehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#llamacpp
https://user-images.githubusercontent.com/1991296/230134379-7181e485-c521-4d23-a0d6-f7b3b61ba524.png
https://github.com/ggerganov/llama.cpp/actions
https://opensource.org/licenses/MIT
Roadmaphttps://github.com/users/ggerganov/projects/7
Project statushttps://github.com/ggerganov/llama.cpp/discussions/3471
Manifestohttps://github.com/ggerganov/llama.cpp/discussions/205
ggmlhttps://github.com/ggerganov/ggml
LLaMAhttps://arxiv.org/abs/2302.13971
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#hot-topics
#3401https://github.com/ggerganov/llama.cpp/pull/3401
#3228https://github.com/ggerganov/llama.cpp/pull/3228
Descriptionhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#description
Usagehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#usage
Get the Codehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#get-the-code
Buildhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#build
BLAS Buildhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#blas-build
Prepare Data & Runhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#prepare-data--run
Memory/Disk Requirementshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#memorydisk-requirements
Quantizationhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#quantization
Interactive modehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#interactive-mode
Constrained output with grammarshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#constrained-output-with-grammars
Instruction mode with Alpacahttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#instruction-mode-with-alpaca
Using OpenLLaMAhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-openllama
Using GPT4Allhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-gpt4all
Using Pygmalion 7B & Metharme 7Bhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-pygmalion-7b--metharme-7b
Obtaining the Facebook LLaMA original model and Stanford Alpaca model datahttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#obtaining-the-facebook-llama-original-model-and-stanford-alpaca-model-data
Verifying the model fileshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#verifying-the-model-files
Seminal papers and background on the modelshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#seminal-papers-and-background-on-the-models
Perplexity (measuring model quality)https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#perplexity-measuring-model-quality
Androidhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#android
Dockerhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#docker
Contributinghttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#contributing
Coding guidelineshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#coding-guidelines
Docshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#docs
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#description
hacked in an eveninghttps://github.com/ggerganov/llama.cpp/issues/33#issuecomment-1465108022
ggmlhttps://github.com/ggerganov/ggml
Alpacahttps://github.com/ggerganov/llama.cpp#instruction-mode-with-alpaca
GPT4Allhttps://github.com/ggerganov/llama.cpp#using-gpt4all
Chinese LLaMA / Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
Chinese LLaMA-2 / Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Vigogne (French)https://github.com/bofenghuang/vigogne
Vicunahttps://github.com/ggerganov/llama.cpp/discussions/643#discussioncomment-5533894
Koalahttps://bair.berkeley.edu/blog/2023/04/03/koala/
OpenBuddy 🐶 (Multilingual)https://github.com/OpenBuddy/OpenBuddy
Pygmalion 7B / Metharme 7Bhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-pygmalion-7b--metharme-7b
WizardLMhttps://github.com/nlpxucan/WizardLM
Baichuan-7Bhttps://huggingface.co/baichuan-inc/baichuan-7B
baichuan-7b-sfthttps://huggingface.co/hiyouga/baichuan-7b-sft
Aquila-7Bhttps://huggingface.co/BAAI/Aquila-7B
AquilaChat-7Bhttps://huggingface.co/BAAI/AquilaChat-7B
Starcoder modelshttps://github.com/ggerganov/llama.cpp/pull/3187
Mistral AI v0.1https://huggingface.co/mistralai/Mistral-7B-v0.1
abetlen/llama-cpp-pythonhttps://github.com/abetlen/llama-cpp-python
go-skynet/go-llama.cpphttps://github.com/go-skynet/go-llama.cpp
withcatai/node-llama-cpphttps://github.com/withcatai/node-llama-cpp
hlhr202/llama-nodehttps://github.com/hlhr202/llama-node
yoshoku/llama_cpp.rbhttps://github.com/yoshoku/llama_cpp.rb
mdrokz/rust-llama.cpphttps://github.com/mdrokz/rust-llama.cpp
SciSharp/LLamaSharphttps://github.com/SciSharp/LLamaSharp
donderom/llm4shttps://github.com/donderom/llm4s
phronmophobic/llama.cljhttps://github.com/phronmophobic/llama.clj
mybigday/llama.rnhttps://github.com/mybigday/llama.rn
kherud/java-llama.cpphttps://github.com/kherud/java-llama.cpp
nat/openplaygroundhttps://github.com/nat/openplayground
oobabooga/text-generation-webuihttps://github.com/oobabooga/text-generation-webui
withcatai/cataihttps://github.com/withcatai/catai
whisper.cpphttps://github.com/ggerganov/whisper.cpp
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#usage
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#get-the-code
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#build
w64devkithttps://github.com/skeeto/w64devkit/releases
DRM in FreeBSDhttps://wiki.freebsd.org/Graphics
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#metal-build
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#mpi-build
MPICHhttps://www.mpich.org
OpenMPIhttps://www.open-mpi.org
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#blas-build
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#accelerate-framework
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#openblas
w64devkithttps://github.com/skeeto/w64devkit/releases
OpenBLAS for Windowshttps://github.com/xianyi/OpenBLAS/releases
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#blis
BLIS.mdhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/docs/BLIS.md
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#intel-mkl
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#cublas
CUDA Toolkithttps://developer.nvidia.com/cuda-downloads
CUDA_VISIBLE_DEVICEShttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#env-vars
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#hipblas
ROCm Quick Start (Linux)https://rocm.docs.amd.com/en/latest/deploy/linux/quick_start.html
HIP_VISIBLE_DEVICEShttps://rocm.docs.amd.com/en/latest/understand/gpu_isolation.html#hip-visible-devices
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#clblast
CLBlasthttps://github.com/CNugteren/CLBlast
OpenCL SDKhttps://github.com/KhronosGroup/OpenCL-SDK
OpenCL Releaseshttps://github.com/KhronosGroup/OpenCL-SDK/releases
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#installing-clblast
CLBlast Releaseshttps://github.com/CNugteren/CLBlast/releases
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#building-llama-with-clblast
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#running-llama-with-clblast
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#prepare-data--run
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#memorydisk-requirements
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#quantization
k-quantshttps://github.com/ggerganov/llama.cpp/pull/1684
#2707https://github.com/ggerganov/llama.cpp/pull/2707
#2807https://github.com/ggerganov/llama.cpp/pull/2807
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#perplexity-measuring-model-quality
https://huggingface.co/docs/transformers/perplexityhttps://huggingface.co/docs/transformers/perplexity
https://paperswithcode.com/dataset/wikitext-2https://paperswithcode.com/dataset/wikitext-2
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#interactive-mode
READMEhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/examples/main/README.md
https://user-images.githubusercontent.com/1991296/224575029-2af3c7dc-5a65-4f64-a6bb-517a532aea38.png
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#persistent-interaction
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#constrained-output-with-grammars
GBNF Guidehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/grammars/README.md
https://grammar.intrinsiclabs.ai/https://grammar.intrinsiclabs.ai/
its repohttp://github.com/intrinsiclabsai/gbnfgen
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#instruction-mode-with-alpaca
OpenLLaMAhttps://github.com/openlm-research/open_llama
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-openllama
3Bhttps://huggingface.co/openlm-research/open_llama_3b
7Bhttps://huggingface.co/openlm-research/open_llama_7b
13Bhttps://huggingface.co/openlm-research/open_llama_13b
GPT4Allhttps://github.com/nomic-ai/gpt4all
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-gpt4all
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#using-pygmalion-7b--metharme-7b
LLaMA weightshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#obtaining-the-facebook-llama-original-model-and-stanford-alpaca-model-data
Pygmalion 7Bhttps://huggingface.co/PygmalionAI/pygmalion-7b/
Metharme 7Bhttps://huggingface.co/PygmalionAI/metharme-7b
the latest HF convert scripthttps://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/convert_llama_weights_to_hf.py
xor_codechttps://huggingface.co/PygmalionAI/pygmalion-7b/blob/main/xor_codec.py
bfloat16https://en.wikipedia.org/wiki/Bfloat16_floating-point_format
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#obtaining-the-facebook-llama-original-model-and-stanford-alpaca-model-data
Facebook's LLaMA repositoryhttps://github.com/facebookresearch/llama/pull/73/files
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#obtaining-and-using-the-facebook-llama-2-model
Facebook's LLaMA download pagehttps://ai.meta.com/resources/models-and-libraries/llama-downloads/
TheBlokehttps://huggingface.co/TheBloke
LLaMA 2 7B basehttps://huggingface.co/TheBloke/Llama-2-7B-GGUF
LLaMA 2 13B basehttps://huggingface.co/TheBloke/Llama-2-13B-GGUF
LLaMA 2 70B basehttps://huggingface.co/TheBloke/Llama-2-70B-GGUF
LLaMA 2 7B chathttps://huggingface.co/TheBloke/Llama-2-7B-chat-GGUF
LLaMA 2 13B chathttps://huggingface.co/TheBloke/Llama-2-13B-chat-GGUF
LLaMA 2 70B chathttps://huggingface.co/TheBloke/Llama-2-70B-chat-GGUF
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#verifying-the-model-files
sha256 checksumshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/SHA256SUMS
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#seminal-papers-and-background-on-the-models
Introducing LLaMA: A foundational, 65-billion-parameter large language modelhttps://ai.facebook.com/blog/large-language-model-llama-meta-ai/
LLaMA: Open and Efficient Foundation Language Modelshttps://arxiv.org/abs/2302.13971
Language Models are Few-Shot Learnershttps://arxiv.org/abs/2005.14165
Aligning language models to follow instructionshttps://openai.com/research/instruction-following
Training language models to follow instructions with human feedbackhttps://arxiv.org/abs/2203.02155
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#how-to-run
https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-2-raw-v1.zip?ref=salesforce-researchhttps://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-2-raw-v1.zip?ref=salesforce-research
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#android
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#building-the-project-using-android-ndk
termuxhttps://termux.dev/
Android NDKhttps://developer.android.com/ndk
termuxhttps://termux.dev/
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#building-the-project-using-termux-f-droid
https://github.com/CNugteren/CLBlasthttps://github.com/CNugteren/CLBlast
https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/https://www.reddit.com/r/termux/comments/kc3ynp/opencl_working_in_termux_more_in_comments/
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#docker
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#prerequisites
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#images
.devops/https://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.devops
.github/workflows/docker.ymlhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/.github/workflows/docker.yml
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#usage-1
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#docker-with-cuda
nvidia-container-toolkithttps://github.com/NVIDIA/nvidia-container-toolkit
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#building-locally
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#usage-2
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#contributing
Inference at the edgehttps://github.com/ggerganov/llama.cpp/discussions/205
Changelog podcasthttps://changelog.com/podcast/532
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#coding-guidelines
good first issueshttps://github.com/ggerganov/llama.cpp/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22
https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#docs
mainhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/examples/main/README.md
serverhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/examples/server/README.md
embd-inputhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/examples/embd-input/README.md
jeopardyhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/examples/jeopardy/README.md
BLIShttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/docs/BLIS.md
Performance troubleshootinghttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/docs/token_generation_performance_tips.md
GGML tips & trickshttps://github.com/ggerganov/llama.cpp/wiki/GGML-Tips-&-Tricks
GBNF grammarshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/server-parallel/grammars/README.md
Readme https://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/tmc/llama.cpp/blob/master/LICENSE
Please reload this pagehttps://patch-diff.githubusercontent.com/tmc/llama.cpp/tree/server-parallel
Activityhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/activity
1 starhttps://patch-diff.githubusercontent.com/tmc/llama.cpp/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/tmc/llama.cpp/watchers
0 forkshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Ftmc%2Fllama.cpp&report=tmc+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/tmc/llama.cpp/releases
1,071 tags https://patch-diff.githubusercontent.com/tmc/llama.cpp/tags
Packages 0https://patch-diff.githubusercontent.com/users/tmc/packages?repo_name=llama.cpp
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.