René's URL Explorer Experiment


Title: GitHub - masterFoad/optillm: Optimizing inference proxy for LLMs

Open Graph Title: GitHub - masterFoad/optillm: Optimizing inference proxy for LLMs

X Title: GitHub - masterFoad/optillm: Optimizing inference proxy for LLMs

Description: Optimizing inference proxy for LLMs. Contribute to masterFoad/optillm development by creating an account on GitHub.

Open Graph Description: Optimizing inference proxy for LLMs. Contribute to masterFoad/optillm development by creating an account on GitHub.

X Description: Optimizing inference proxy for LLMs. Contribute to masterFoad/optillm development by creating an account on GitHub.

Opengraph URL: https://github.com/masterFoad/optillm

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:d5ae55c3-4fde-74f6-8543-259ec4b079c2
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id8ADA:F8C89:1488FE:198591:69910A3A
html-safe-nonced39daea701adf0eb738055416242847847236a3e738e50900964d178e64253f0
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4QURBOkY4Qzg5OjE0ODhGRToxOTg1OTE6Njk5MTBBM0EiLCJ2aXNpdG9yX2lkIjoiNDcxNTY5ODUwNTkyNjExNzk0NyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac0300a1a34a60018041ced8315e97a16bf4ab30eac63d03713a30bdc6d835fae9
hovercard-subject-tagrepository:914744822
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/masterFoad/optillm
twitter:imagehttps://opengraph.githubassets.com/aeb2587d66fc5ceb7465e04836f056bea4e0c0ff882f94ad722a24f7f9b0a539/masterFoad/optillm
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/aeb2587d66fc5ceb7465e04836f056bea4e0c0ff882f94ad722a24f7f9b0a539/masterFoad/optillm
og:image:altOptimizing inference proxy for LLMs. Contribute to masterFoad/optillm development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None42c603b9d642c4a9065a51770f75e5e27132fef0e858607f5c9cb7e422831a7b
turbo-cache-controlno-preview
go-importgithub.com/masterFoad/optillm git https://github.com/masterFoad/optillm.git
octolytics-dimension-user_id32059146
octolytics-dimension-user_loginmasterFoad
octolytics-dimension-repository_id914744822
octolytics-dimension-repository_nwomasterFoad/optillm
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id846237240
octolytics-dimension-repository_parent_nwoalgorithmicsuperintelligence/optillm
octolytics-dimension-repository_network_root_id846237240
octolytics-dimension-repository_network_root_nwoalgorithmicsuperintelligence/optillm
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release848bc6032dcc93a9a7301dcc3f379a72ba13b96e
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/masterFoad/optillm#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FmasterFoad%2Foptillm
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FmasterFoad%2Foptillm
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=masterFoad%2Foptillm
Reloadhttps://patch-diff.githubusercontent.com/masterFoad/optillm
Reloadhttps://patch-diff.githubusercontent.com/masterFoad/optillm
Reloadhttps://patch-diff.githubusercontent.com/masterFoad/optillm
masterFoad https://patch-diff.githubusercontent.com/masterFoad
optillmhttps://patch-diff.githubusercontent.com/masterFoad/optillm
algorithmicsuperintelligence/optillmhttps://patch-diff.githubusercontent.com/algorithmicsuperintelligence/optillm
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FmasterFoad%2Foptillm
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2FmasterFoad%2Foptillm
Star 1 https://patch-diff.githubusercontent.com/login?return_to=%2FmasterFoad%2Foptillm
Apache-2.0 license https://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/LICENSE
1 star https://patch-diff.githubusercontent.com/masterFoad/optillm/stargazers
260 forks https://patch-diff.githubusercontent.com/masterFoad/optillm/forks
Branches https://patch-diff.githubusercontent.com/masterFoad/optillm/branches
Tags https://patch-diff.githubusercontent.com/masterFoad/optillm/tags
Activity https://patch-diff.githubusercontent.com/masterFoad/optillm/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FmasterFoad%2Foptillm
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FmasterFoad%2Foptillm
Code https://patch-diff.githubusercontent.com/masterFoad/optillm
Pull requests 0 https://patch-diff.githubusercontent.com/masterFoad/optillm/pulls
Actions https://patch-diff.githubusercontent.com/masterFoad/optillm/actions
Projects 0 https://patch-diff.githubusercontent.com/masterFoad/optillm/projects
Security 0 https://patch-diff.githubusercontent.com/masterFoad/optillm/security
Insights https://patch-diff.githubusercontent.com/masterFoad/optillm/pulse
Code https://patch-diff.githubusercontent.com/masterFoad/optillm
Pull requests https://patch-diff.githubusercontent.com/masterFoad/optillm/pulls
Actions https://patch-diff.githubusercontent.com/masterFoad/optillm/actions
Projects https://patch-diff.githubusercontent.com/masterFoad/optillm/projects
Security https://patch-diff.githubusercontent.com/masterFoad/optillm/security
Insights https://patch-diff.githubusercontent.com/masterFoad/optillm/pulse
Brancheshttps://patch-diff.githubusercontent.com/masterFoad/optillm/branches
Tagshttps://patch-diff.githubusercontent.com/masterFoad/optillm/tags
https://patch-diff.githubusercontent.com/masterFoad/optillm/branches
https://patch-diff.githubusercontent.com/masterFoad/optillm/tags
298 Commitshttps://patch-diff.githubusercontent.com/masterFoad/optillm/commits/main/
https://patch-diff.githubusercontent.com/masterFoad/optillm/commits/main/
.github/workflowshttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/.github/workflows
.github/workflowshttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/.github/workflows
optillmhttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/optillm
optillmhttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/optillm
scriptshttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/scripts
scriptshttps://patch-diff.githubusercontent.com/masterFoad/optillm/tree/main/scripts
.dockerignorehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/.dockerignore
.dockerignorehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/.dockerignore
.gitignorehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/.gitignore
Dockerfilehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/Dockerfile
Dockerfilehttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/Dockerfile
LICENSEhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/MANIFEST.in
README.mdhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/README.md
docker-compose.yamlhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/docker-compose.yaml
docker-compose.yamlhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/docker-compose.yaml
moa-patchwork-results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/moa-patchwork-results.png
moa-patchwork-results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/moa-patchwork-results.png
moa-results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/moa-results.png
moa-results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/moa-results.png
optillm-sequence-diagram.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/optillm-sequence-diagram.png
optillm-sequence-diagram.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/optillm-sequence-diagram.png
optillm.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/optillm.py
optillm.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/optillm.py
requirements.txthttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/requirements.txt
setup.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/setup.py
test.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test.py
test.pyhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test.py
test_cases.jsonhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_cases.json
test_cases.jsonhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_cases.json
test_results.jsonhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_results.json
test_results.jsonhttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_results.json
test_results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_results.png
test_results.pnghttps://patch-diff.githubusercontent.com/masterFoad/optillm/blob/main/test_results.png
READMEhttps://patch-diff.githubusercontent.com/masterFoad/optillm
Licensehttps://patch-diff.githubusercontent.com/masterFoad/optillm
https://patch-diff.githubusercontent.com/masterFoad/optillm#optillm
https://huggingface.co/spaces/codelion/optillm
https://colab.research.google.com/drive/1SpuUb8d9xAoTh32M-9wJsB50AOH54EaH?usp=sharing
https://github.com/codelion/optillm/discussions
https://patch-diff.githubusercontent.com/masterFoad/optillm#installation
https://patch-diff.githubusercontent.com/masterFoad/optillm#using-pip
https://patch-diff.githubusercontent.com/masterFoad/optillm#using-docker
https://patch-diff.githubusercontent.com/masterFoad/optillm#install-from-source
herehttps://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/managed-identity
https://patch-diff.githubusercontent.com/masterFoad/optillm#usage
LiteLLM sdkhttps://docs.litellm.ai/docs/#litellm-python-sdk
LiteLLM proxy serverhttps://docs.litellm.ai/docs/proxy/quick_start
https://raw.githubusercontent.com/codelion/optillm/main/optillm-sequence-diagram.png
oobaboogahttps://github.com/oobabooga/text-generation-webui/
patchworkhttps://github.com/patched-codes/patchwork
https://patch-diff.githubusercontent.com/masterFoad/optillm#local-inference-server
https://patch-diff.githubusercontent.com/masterFoad/optillm#starting-the-optillm-proxy-with-an-external-server-eg-llamacpp-or-ollama
https://patch-diff.githubusercontent.com/masterFoad/optillm#implemented-techniques
https://patch-diff.githubusercontent.com/masterFoad/optillm#implemented-plugins
optillm-bert-uncasedhttps://huggingface.co/codelion/optillm-bert-uncased
https://patch-diff.githubusercontent.com/masterFoad/optillm#available-parameters
https://patch-diff.githubusercontent.com/masterFoad/optillm#running-with-docker
Dockerfilehttps://github.com/codelion/optillm/blob/main/Dockerfile
https://patch-diff.githubusercontent.com/masterFoad/optillm#using-docker-compose
https://patch-diff.githubusercontent.com/masterFoad/optillm#sota-results-on-benchmarks-with-optillm
https://patch-diff.githubusercontent.com/masterFoad/optillm#coc-claude-3-5-sonnet-20241022-on-aime-2024-pass1-nov-2024
https://patch-diff.githubusercontent.com/masterFoad/optillm#readurlsmemory-gpt-4o-mini-on-google-frames-benchmark-oct-2024
https://patch-diff.githubusercontent.com/masterFoad/optillm#plansearch-gpt-4o-mini-on-livecodebench-sep-2024
https://patch-diff.githubusercontent.com/masterFoad/optillm#moa-gpt-4o-mini-on-arena-hard-auto-aug-2024
https://raw.githubusercontent.com/codelion/optillm/main/moa-results.png
https://patch-diff.githubusercontent.com/masterFoad/optillm#optillm-with-patchwork-july-2024
patchworkhttps://github.com/patched-codes/patchwork
https://raw.githubusercontent.com/codelion/optillm/main/moa-patchwork-results.png
https://patch-diff.githubusercontent.com/masterFoad/optillm#references
Chain of Code: Reasoning with a Language Model-Augmented Code Emulatorhttps://arxiv.org/abs/2312.04474
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/plugins/coc_plugin.py
Entropy Based Sampling and Parallel CoT Decodinghttps://github.com/xjdr-alt/entropix
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/entropy_decoding.py
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generationhttps://arxiv.org/abs/2409.12941
Evaluation scripthttps://github.com/codelion/optillm/blob/main/scripts/eval_frames_benchmark.py
Writing in the Margins: Better Inference Pattern for Long Context Retrievalhttps://www.arxiv.org/abs/2408.14906
Inspired the implementation of the memory pluginhttps://github.com/codelion/optillm/blob/main/optillm/plugins/memory_plugin.py
Chain-of-Thought Reasoning Without Promptinghttps://arxiv.org/abs/2402.10200
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/cot_decoding.py
Re-Reading Improves Reasoning in Large Language Modelshttps://arxiv.org/abs/2309.06275
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/reread.py
In-Context Principle Learning from Mistakeshttps://arxiv.org/abs/2402.05403
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/leap.py
Planning In Natural Language Improves LLM Search For Code Generationhttps://arxiv.org/abs/2409.03733
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/plansearch.py
Self-Consistency Improves Chain of Thought Reasoning in Language Modelshttps://arxiv.org/abs/2203.11171
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/self_consistency.py
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvershttps://arxiv.org/abs/2408.06195
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/rstar.py
Mixture-of-Agents Enhances Large Language Model Capabilitieshttps://arxiv.org/abs/2406.04692
Inspired the implementation of moahttps://github.com/codelion/optillm/blob/main/optillm/moa.py
Prover-Verifier Games improve legibility of LLM outputshttps://arxiv.org/abs/2407.13692
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/pvg.py
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learninghttps://arxiv.org/abs/2405.00451
Inspired the implementation of mctshttps://github.com/codelion/optillm/blob/main/optillm/mcts.py
Unsupervised Evaluation of Code LLMs with Round-Trip Correctnesshttps://arxiv.org/abs/2402.08699
Inspired the implementation of rtohttps://github.com/codelion/optillm/blob/main/optillm/rto.py
Patched MOA: optimizing inference for diverse software development taskshttps://arxiv.org/abs/2407.18521
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/moa.py
Patched RTC: evaluating LLMs for diverse software development taskshttps://arxiv.org/abs/2407.16557
Implementationhttps://github.com/codelion/optillm/blob/main/optillm/rto.py
Readme https://patch-diff.githubusercontent.com/masterFoad/optillm#readme-ov-file
Apache-2.0 license https://patch-diff.githubusercontent.com/masterFoad/optillm#Apache-2.0-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/masterFoad/optillm
Activityhttps://patch-diff.githubusercontent.com/masterFoad/optillm/activity
1 starhttps://patch-diff.githubusercontent.com/masterFoad/optillm/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/masterFoad/optillm/watchers
0 forkshttps://patch-diff.githubusercontent.com/masterFoad/optillm/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FmasterFoad%2Foptillm&report=masterFoad+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/masterFoad/optillm/releases
Packages 0https://patch-diff.githubusercontent.com/users/masterFoad/packages?repo_name=optillm
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.