René's URL Explorer Experiment


Title: GitHub - VectorInstitute/vector-inference: Efficient LLM inference on Slurm clusters.

Open Graph Title: GitHub - VectorInstitute/vector-inference: Efficient LLM inference on Slurm clusters.

X Title: GitHub - VectorInstitute/vector-inference: Efficient LLM inference on Slurm clusters.

Description: Efficient LLM inference on Slurm clusters. . Contribute to VectorInstitute/vector-inference development by creating an account on GitHub.

Open Graph Description: Efficient LLM inference on Slurm clusters. . Contribute to VectorInstitute/vector-inference development by creating an account on GitHub.

X Description: Efficient LLM inference on Slurm clusters. . Contribute to VectorInstitute/vector-inference development by creating an account on GitHub.

Opengraph URL: https://github.com/VectorInstitute/vector-inference

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:005f8e79-e6b8-59b4-6410-da86a0e32379
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idD1AA:3E0034:2B9CC1C:3B77E4D:698C92CF
html-safe-nonce86332e19510ea57737d35cd3fdc62bdf721a6e9199c32d9051e6cde1c12cf59e
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJEMUFBOjNFMDAzNDoyQjlDQzFDOjNCNzdFNEQ6Njk4QzkyQ0YiLCJ2aXNpdG9yX2lkIjoiNzA4MzA0MDM3ODc1NjgyOTkwMyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmacc7ea1f332a924ebf99bfdcf8cade5e1465c098520800455b1c12706e06dd22d9
hovercard-subject-tagrepository:767839751
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/VectorInstitute/vector-inference
twitter:imagehttps://opengraph.githubassets.com/35166491b4ead7eb709cfcbb5c9a61db3961b068a5efb56e6bca348199e8472d/VectorInstitute/vector-inference
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/35166491b4ead7eb709cfcbb5c9a61db3961b068a5efb56e6bca348199e8472d/VectorInstitute/vector-inference
og:image:altEfficient LLM inference on Slurm clusters. . Contribute to VectorInstitute/vector-inference development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None640eeb7b6ff4d8d106235d228c0c286e82592d4d2403227b5b2b4fc5832297a4
turbo-cache-controlno-preview
go-importgithub.com/VectorInstitute/vector-inference git https://github.com/VectorInstitute/vector-inference.git
octolytics-dimension-user_id40637123
octolytics-dimension-user_loginVectorInstitute
octolytics-dimension-repository_id767839751
octolytics-dimension-repository_nwoVectorInstitute/vector-inference
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id767839751
octolytics-dimension-repository_network_root_nwoVectorInstitute/vector-inference
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release3d444f0a47beeeac94cddbb51c91ab408befe8d4
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FVectorInstitute%2Fvector-inference
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FVectorInstitute%2Fvector-inference
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=VectorInstitute%2Fvector-inference
Reloadhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Reloadhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Reloadhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
VectorInstitute https://patch-diff.githubusercontent.com/VectorInstitute
vector-inferencehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
VectorInstitute/aieng-template-poetryhttps://patch-diff.githubusercontent.com/VectorInstitute/aieng-template-poetry
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FVectorInstitute%2Fvector-inference
Fork 12 https://patch-diff.githubusercontent.com/login?return_to=%2FVectorInstitute%2Fvector-inference
Star 91 https://patch-diff.githubusercontent.com/login?return_to=%2FVectorInstitute%2Fvector-inference
vectorinstitute.github.io/vector-inference/https://vectorinstitute.github.io/vector-inference/
MIT license https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/LICENSE
91 stars https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/stargazers
12 forks https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/forks
Branches https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/branches
Tags https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tags
Activity https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FVectorInstitute%2Fvector-inference
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FVectorInstitute%2Fvector-inference
Code https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Issues 1 https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/issues
Pull requests 6 https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/pulls
Actions https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/actions
Projects 0 https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/projects
Security 0 https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/security
Insights https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/pulse
Code https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Issues https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/issues
Pull requests https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/pulls
Actions https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/actions
Projects https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/projects
Security https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/security
Insights https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/pulse
Brancheshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/branches
Tagshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tags
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/branches
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tags
1,153 Commitshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/commits/main/
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/commits/main/
.githubhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/.github
.githubhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/.github
docshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/docs
docshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/docs
exampleshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/examples
exampleshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/examples
profilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/profile
profilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/profile
testshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/tests
testshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/tests
vec_infhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/vec_inf
vec_infhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/tree/main/vec_inf
.gitignorehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.gitignore
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.pre-commit-config.yaml
.python-versionhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.python-version
.python-versionhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/.python-version
LICENSEhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/LICENSE
MODEL_TRACKING.mdhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/MODEL_TRACKING.md
MODEL_TRACKING.mdhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/MODEL_TRACKING.md
README.mdhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/README.md
codecov.ymlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/codecov.yml
codecov.ymlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/codecov.yml
mkdocs.ymlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/mkdocs.yml
mkdocs.ymlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/mkdocs.yml
pyproject.tomlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/pyproject.toml
pyproject.tomlhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/pyproject.toml
sglang.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/sglang.Dockerfile
sglang.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/sglang.Dockerfile
uv.lockhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/uv.lock
uv.lockhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/uv.lock
venv.shhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/venv.sh
venv.shhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/venv.sh
vllm.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vllm.Dockerfile
vllm.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vllm.Dockerfile
READMEhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Contributinghttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
MIT licensehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#vector-inference-easy-inference-on-slurm-clusters
https://pypi.org/project/vec-inf
https://pypistats.org/packages/vec-inf
https://github.com/VectorInstitute/vector-inference/actions/workflows/code_checks.yml
https://github.com/VectorInstitute/vector-inference/actions/workflows/docs.yml
https://app.codecov.io/github/VectorInstitute/vector-inference/tree/main
https://docs.vllm.ai/en/v0.15.0/
https://docs.sglang.io/index.html
https://camo.githubusercontent.com/e05f584bbedb17846270641bcc5bf7eeac2cabf83429da051ce5b98d31ba8d6f/68747470733a2f2f696d672e736869656c64732e696f2f6769746875622f6c6963656e73652f566563746f72496e737469747574652f766563746f722d696e666572656e6365
Slurmhttps://slurm.schedmd.com/overview.html
vLLMhttps://docs.vllm.ai/en/v0.15.0/
SGLanghttps://docs.sglang.io/index.html
Installationhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#installation
herehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/MODEL_TRACKING.md
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#installation
vllm.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vllm.Dockerfile
sglang.Dockerfilehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/sglang.Dockerfile
Docker Hubhttps://hub.docker.com/orgs/vectorinstitute/repositories
vec_inf/confighttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vec_inf/config
vec_inf/confighttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vec_inf/config
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#usage
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#cli
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/docs/assets/launch.png
slurm_vars.pyhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vec_inf/client/slurm_vars.py
default configurationhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/vec_inf/config/models.yaml
launch command section in User Guidehttps://vectorinstitute.github.io/vector-inference/latest/user_guide/#launch-command
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#other-commands
User Guidehttps://vectorinstitute.github.io/vector-inference/user_guide/
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#api
API Referencehttps://vectorinstitute.github.io/vector-inference/api/
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#check-job-configuration
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#send-inference-requests
exampleshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/blob/main/examples
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#ssh-tunnel-from-your-local-device
https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#reference
vectorinstitute.github.io/vector-inference/https://vectorinstitute.github.io/vector-inference/
inference https://patch-diff.githubusercontent.com/topics/inference
speech-to-text https://patch-diff.githubusercontent.com/topics/speech-to-text
vlm https://patch-diff.githubusercontent.com/topics/vlm
text-embedding https://patch-diff.githubusercontent.com/topics/text-embedding
multimodal https://patch-diff.githubusercontent.com/topics/multimodal
audio-transcription https://patch-diff.githubusercontent.com/topics/audio-transcription
llm https://patch-diff.githubusercontent.com/topics/llm
vllm https://patch-diff.githubusercontent.com/topics/vllm
reward-model https://patch-diff.githubusercontent.com/topics/reward-model
llm-infernece https://patch-diff.githubusercontent.com/topics/llm-infernece
sglang https://patch-diff.githubusercontent.com/topics/sglang
llm-infrastructure https://patch-diff.githubusercontent.com/topics/llm-infrastructure
Readme https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#MIT-1-ov-file
Contributing https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference#contributing-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Activityhttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/activity
Custom propertieshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/custom-properties
91 starshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/stargazers
7 watchinghttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/watchers
12 forkshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FVectorInstitute%2Fvector-inference&report=VectorInstitute+%28user%29
Releases 20https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/releases
v0.8.1 Latest Feb 4, 2026 https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/releases/tag/v0.8.1
+ 19 releaseshttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/releases
Packages 0https://patch-diff.githubusercontent.com/orgs/VectorInstitute/packages?repo_name=vector-inference
Please reload this pagehttps://patch-diff.githubusercontent.com/VectorInstitute/vector-inference
Contributors 15https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/graphs/contributors
https://github.com/XkunW
https://github.com/amrit110
https://github.com/apps/pre-commit-ci
https://github.com/apps/github-actions
https://github.com/apps/dependabot
https://github.com/kohankhaki
https://github.com/fcogidi
https://github.com/xeon27
https://github.com/jwilles
https://github.com/jacobthebanana
https://github.com/apps/copilot-pull-request-reviewer
https://github.com/raeidsaqur
https://github.com/actions-user
https://github.com/jewelltaylor
https://github.com/rohan-uiuc
Python 97.9% https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/search?l=python
Dockerfile 1.4% https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/search?l=dockerfile
Shell 0.7% https://patch-diff.githubusercontent.com/VectorInstitute/vector-inference/search?l=shell
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.