René's URL Explorer Experiment


Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Open Graph Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

X Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

Open Graph Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

X Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

Mail addresses
opencode@microsoft.com

Opengraph URL: https://github.com/deepspeedai/DeepSpeed

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:19351af8-ecb3-1729-c01c-88c3235e8307
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idD5B4:319547:2B7AC38:38D79CE:696B72F7
html-safe-nonce73af5efe0744e0224edc8bdca36c557afe3498b069a727556c70ed706d081e9a
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJENUI0OjMxOTU0NzoyQjdBQzM4OjM4RDc5Q0U6Njk2QjcyRjciLCJ2aXNpdG9yX2lkIjoiNTQ4MDYwNTU4NjkwMzc1NzU1OSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac1325081f5d42b084adb6c1be91e36bf168ded0ba655d9ce03b8e49e1d1fac3bf
hovercard-subject-tagrepository:235860204
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/deepspeedai/DeepSpeed
twitter:imagehttps://opengraph.githubassets.com/85bdbfcccc2c42937b48011e6b626af114f3784af34890f7f3e8a26ef5bd8028/deepspeedai/DeepSpeed
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/85bdbfcccc2c42937b48011e6b626af114f3784af34890f7f3e8a26ef5bd8028/deepspeedai/DeepSpeed
og:image:altDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None5f99f7c1d70f01da5b93e5ca90303359738944d8ab470e396496262c66e60b8d
turbo-cache-controlno-preview
go-importgithub.com/deepspeedai/DeepSpeed git https://github.com/deepspeedai/DeepSpeed.git
octolytics-dimension-user_id74068820
octolytics-dimension-user_logindeepspeedai
octolytics-dimension-repository_id235860204
octolytics-dimension-repository_nwodeepspeedai/DeepSpeed
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id235860204
octolytics-dimension-repository_network_root_nwodeepspeedai/DeepSpeed
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release82560a55c6b2054555076f46e683151ee28a19bc
ui-targetcanary-2
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=deepspeedai%2FDeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
deepspeedai https://patch-diff.githubusercontent.com/deepspeedai
DeepSpeedhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Fork 4.7k https://patch-diff.githubusercontent.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Star 41.3k https://patch-diff.githubusercontent.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
www.deepspeed.ai/https://www.deepspeed.ai/
Apache-2.0 license https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/LICENSE
41.3k stars https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/stargazers
4.7k forks https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/forks
Branches https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/branches
Tags https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tags
Activity https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Code https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Issues 1.1k https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/issues
Pull requests 108 https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/pulls
Discussions https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/discussions
Actions https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/actions
Projects 0 https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/projects
Security Uh oh! There was an error while loading. Please reload this page. https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/security
Please reload this pagehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Insights https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/pulse
Code https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Issues https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/issues
Pull requests https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/pulls
Discussions https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/discussions
Actions https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/actions
Projects https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/projects
Security https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/security
Insights https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/pulse
Brancheshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/branches
Tagshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tags
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/branches
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tags
3,034 Commitshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/commits/master/
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/commits/master/
.githubhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/.github
acceleratorhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/accelerator
acceleratorhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/accelerator
azurehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/azure
azurehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/azure
benchmarkshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/benchmarks
benchmarkshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/benchmarks
binhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/bin
binhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/bin
blogshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/blogs
blogshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/blogs
cihttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/ci
cihttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/ci
csrchttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/csrc
csrchttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/csrc
deepspeedhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/deepspeed
deepspeedhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/deepspeed
dockerhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/docker
dockerhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/docker
docshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/docs
docshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/examples
op_builderhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/op_builder
op_builderhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/op_builder
releasehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/release
releasehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/release
requirementshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/requirements
requirementshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/requirements
scriptshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/scripts
scriptshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/scripts
testshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/tests
testshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/tree/master/tests
.clang-formathttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.clang-format
.clang-formathttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.clang-format
.flake8https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.flake8
.flake8https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.flake8
.gitignorehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.gitmodules
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.pre-commit-config.yaml
.pylintrchttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.pylintrc
.pylintrchttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.pylintrc
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.readthedocs.yml
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.readthedocs.yml
.style.yapfhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.style.yapf
.style.yapfhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/.style.yapf
CODEOWNERShttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CODEOWNERS
CODEOWNERShttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CODEOWNERS
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
COMMITTERS.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/COMMITTERS.md
COMMITTERS.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/COMMITTERS.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
GOVERNANCE.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/GOVERNANCE.md
GOVERNANCE.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/GOVERNANCE.md
LICENSEhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/MANIFEST.in
MANIFEST_win.inhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/MANIFEST_win.in
MANIFEST_win.inhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/MANIFEST_win.in
Makefilehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/Makefile
Makefilehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/Makefile
README.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/README.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/SECURITY.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/SECURITY.md
build_win.bathttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/build_win.bat
build_win.bathttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/build_win.bat
environment.ymlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/environment.yml
environment.ymlhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/environment.yml
install.shhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/install.sh
install.shhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/install.sh
setup.cfghttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/setup.cfg
setup.cfghttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/setup.cfg
setup.pyhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/setup.py
version.txthttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/version.txt
version.txthttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/version.txt
READMEhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Code of conducthttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Contributinghttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Apache-2.0 licensehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Securityhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
https://github.com/deepspeedai/DeepSpeed/blob/master/LICENSE
https://pypi.org/project/deepspeed/
https://pepy.tech/project/deepspeed
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#build-pipeline-status
https://www.bestpractices.dev/projects/9530
https://twitter.com/intent/follow?screen_name=DeepSpeedAI
https://twitter.com/DeepSpeedAI_JP
https://www.zhihu.com/people/deepspeed
https://join.slack.com/t/deepspeedworkspace/shared_invite/zt-3a8pjd8dd-PCj2hMvR4Y2syPwVnjEoww
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_light.svg#gh-light-mode-only
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_dark_transparent.svg#gh-dark-mode-only
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#latest-news
DeepSpeed Core API updates: PyTorch-style backward and low-precision master stateshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/core_api_update/README.md
Ray x DeepSpeed Meetuphttps://luma.com/3wctqteh
herehttps://docs.google.com/presentation/d/1eM3mY6oW9GYkRy1Xz0iOnbbEr5T1t0JJXOM5BKtR-Ks/edit?slide=id.g38615d6b4c2_0_87#slide=id.g38615d6b4c2_0_87
SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchipshttps://pytorch.org/blog/superoffload-unleashing-the-power-of-large-scale-llm-training-on-superchips/
Study of ZenFlow and ZeRO offload performance with DeepSpeed CPU core bindinghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/zenflow-corebinding/README.md
ZenFlow: Stall-Free Offloading Engine for LLM Traininghttps://pytorch.org/blog/zenflow-stall-free-offloading-engine-for-llm-training/
Arctic Long Sequence Training (ALST) with DeepSpeed: Scalable And Efficient Training For Multi-Million Token Sequenceshttps://www.snowflake.com/en/engineering-blog/arctic-long-sequence-training-multi-million-token-ai/
DeepNVMe: Affordable I/O scaling for Deep Learning Applicationshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepnvme/06-2025/README.md
DeepCompile: Unlocking Compiler Optimization for Distributed Traininghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepcompile/README.md
DeepSpeed AutoTP: Automatic Tensor Parallel Training of Hugging Face modelshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/huggingface-tp/README.md
Ulysses-Offload: Democratizing Long Context LLM Traininghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/ulysses-offload/README.md
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#extreme-speed-and-scale-for-dl-training
DeepSpeedhttps://www.deepspeed.ai/
MT-530Bhttps://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
BLOOMhttps://huggingface.co/blog/bloom-megatron-deepspeed
system innovationshttps://www.deepspeed.ai/training/
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#deepspeed-adoption
AI at Scalehttps://www.microsoft.com/en-us/research/project/ai-at-scale/
herehttps://innovation.microsoft.com/en-us/exploring-ai-at-scale
Megatron-Turing NLG (530B)https://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
Jurassic-1 (178B)https://uploads-ssl.webflow.com/60fd4503684b466578c0d307/61138924626a6981ee09caf6_jurassic_tech_paper.pdf
BLOOM (176B)https://huggingface.co/blog/bloom-megatron-deepspeed
GLM (130B)https://github.com/THUDM/GLM-130B
xTrimoPGLM (100B)https://www.biorxiv.org/content/10.1101/2023.07.05.547496v2
YaLM (100B)https://github.com/yandex/YaLM-100B
GPT-NeoX (20B)https://github.com/EleutherAI/gpt-neox
AlexaTM (20B)https://www.amazon.science/blog/20b-parameter-alexa-model-sets-new-marks-in-few-shot-learning
Turing NLG (17B)https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft/
METRO-LM (5.4B)https://arxiv.org/pdf/2204.06644.pdf
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/transformers-light.png#gh-light-mode-only
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/transformers-dark.png#gh-dark-mode-only
Transformers with DeepSpeedhttps://huggingface.co/docs/transformers/deepspeed
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/accelerate-light.png#gh-light-mode-only
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/accelerate-dark.png#gh-dark-mode-only
Accelerate with DeepSpeedhttps://huggingface.co/docs/accelerate/usage_guides/deepspeed
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/lightning-light.svg#gh-light-mode-only
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/lightning-dark.svg#gh-dark-mode-only
Lightning with DeepSpeedhttps://lightning.ai/docs/pytorch/stable/advanced/model_parallel.html#deepspeed
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/mosaicml.svg
MosaicML with DeepSpeedhttps://docs.mosaicml.com/projects/composer/en/latest/trainer/using_the_trainer.html?highlight=deepspeed#deepspeed-integration
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/determined.svg
Determined with DeepSpeedhttps://docs.determined.ai/latest/training/apis-howto/deepspeed/overview.html
https://user-images.githubusercontent.com/58739961/187154444-fce76639-ac8d-429b-9354-c6fac64b7ef8.jpg
MMEngine with DeepSpeedhttps://mmengine.readthedocs.io/en/latest/common_usage/large_model_training.html#deepspeed
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#build-pipeline-status
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-pre-compile-ops.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/aws-torch-latest.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/amd-mi200.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/cpu-torch-latest.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/hpu-gaudi2.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/xpu-max1100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/aws-accelerate.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/formatting.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/pages/pages-build-deployment
https://deepspeed.readthedocs.io/en/latest/?badge=latest
https://github.com/deepspeedai/DeepSpeed/actions/workflows/python.yml
https://github.com/Ascend/Ascend-CI/actions/workflows/deepspeed.yaml
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#installation
torch's JIT C++ extension loader that relies on ninjahttps://pytorch.org/docs/stable/cpp_extension.html
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#requirements
PyTorchhttps://pytorch.org/
nvcchttps://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/#introduction
hipcchttps://github.com/ROCm-Developer-Tools/HIPCC
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#contributed-hw-support
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#pypi
PyPIhttps://pypi.org/project/deepspeed/
advanced installation instructionshttps://www.deepspeed.ai/tutorials/advanced-install/
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#windows
herehttps://github.com/deepspeedai/DeepSpeed/tree/master/blogs/windows/08-2024/README.md
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#further-reading
deepspeed.aihttps://www.deepspeed.ai/
Getting Startedhttps://www.deepspeed.ai/getting-started/
DeepSpeed JSON Configurationhttps://www.deepspeed.ai/docs/config-json/
API Documentationhttps://deepspeed.readthedocs.io/en/latest/
Tutorialshttps://www.deepspeed.ai/tutorials/
Blogshttps://www.deepspeed.ai/posts/
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#ci-funding
https://modal.comhttps://modal.com
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#contributing
contributinghttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
https://github.com/deepspeedai/DeepSpeed/graphs/contributors
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#developer-certificate-of-origin
DCOhttps://wiki.linuxfoundation.org/dco
https://developercertificate.orghttps://developercertificate.org
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#code-of-conduct
Microsoft Open Source Code of Conducthttps://opensource.microsoft.com/codeofconduct/
Code of Conduct FAQhttps://opensource.microsoft.com/codeofconduct/faq/
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#publications
arXiv:1910.02054https://arxiv.org/abs/1910.02054
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '20)https://dl.acm.org/doi/10.5555/3433701.3433727
In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20, Tutorial)https://dl.acm.org/doi/10.1145/3394486.3406703
arXiv:2010.13369https://arxiv.org/abs/2010.13369
NeurIPS 2020https://proceedings.neurips.cc/paper/2020/hash/a1140a3d0df1c81e24ae954d935e8926-Abstract.html
arXiv:2101.06840https://arxiv.org/abs/2101.06840
USENIX ATC 2021https://www.usenix.org/conference/atc21/presentation/ren-jie
[paper]https://arxiv.org/abs/2101.06840
[slides]https://www.usenix.org/system/files/atc21_slides_ren-jie.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-extreme-scale-model-training-for-everyone/
arXiv:2102.02888https://arxiv.org/abs/2102.02888
ICML 2021http://proceedings.mlr.press/v139/tang21a.html
arXiv:2104.07857https://arxiv.org/abs/2104.07857
SC 2021https://dl.acm.org/doi/abs/10.1145/3458817.3476205
[paper]https://arxiv.org/abs/2104.07857
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/SC21-ZeRO-Infinity.pdf
[blog]https://www.microsoft.com/en-us/research/blog/zero-infinity-and-deepspeed-unlocking-unprecedented-model-scale-for-deep-learning-training/
arXiv:2104.06069https://arxiv.org/abs/2104.06069
HiPC 2022https://hipc.org/advance-program/
arXiv:2108.06084https://arxiv.org/abs/2108.06084
NeurIPS 2022https://openreview.net/forum?id=JpZ5du_Kdh
arXiv:2202.06009https://arxiv.org/abs/2202.06009
arXiv:2201.05596https://arxiv.org/abs/2201.05596
ICML 2022https://proceedings.mlr.press/v162/rajbhandari22a.html
[pdf]https://arxiv.org/abs/2201.05596
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/ICML-5mins.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-advancing-moe-inference-and-training-to-power-next-generation-ai-scale/
arXiv:2201.11990https://arxiv.org/abs/2201.11990
arXiv:2206.01859https://arxiv.org/abs/2206.01859
NeurIPS 2022https://openreview.net/forum?id=xNeAhc2CNAl
arXiv:2206.01861https://arxiv.org/abs/2206.01861
NeurIPS 2022https://openreview.net/forum?id=f-fVCElZ-G1
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-compression-a-composable-library-for-extreme-compression-and-zero-cost-quantization/
arXiv:2207.00032https://arxiv.org/abs/2207.00032
SC 2022https://dl.acm.org/doi/abs/10.5555/3571885.3571946
[paper]https://arxiv.org/abs/2207.00032
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/sc22-ds-inference.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/
arXiv:2211.11586https://arxiv.org/abs/2211.11586
arXiv:2212.03597https://arxiv.org/abs/2212.03597
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
arXiv:2301.12017https://arxiv.org/abs/2301.12017
ICML2023https://icml.cc/Conferences/2023
ICLR:2023https://openreview.net/forum?id=Pgtn4l6eKjv
arXiv:2303.07226https://arxiv.org/abs/2303.07226
Finding at EMNLP2023https://2023.emnlp.org/
arXiv:2303.08374https://arxiv.org/abs/2303.08374
arXiv:2303.06318https://arxiv.org/abs/2303.06318
ICS 2023https://dl.acm.org/doi/10.1145/3577193.3593704
arXiv:2306.10209https://arxiv.org/abs/2306.10209
ML for Sys Workshop at NeurIPS2023http://mlforsystems.org/
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-zero-a-leap-in-speed-for-llm-and-chat-model-training-with-4x-less-communication/
arXiv:2303.08302https://arxiv.org/abs/2303.08302
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
arXiv:2305.09847https://arxiv.org/abs/2305.09847
arXiv:2308.01320https://arxiv.org/abs/2308.01320
arXiv:2307.09782https://arxiv.org/abs/2307.09782
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
arXiv:2309.14327https://arxiv.org/pdf/2309.14327.pdf
arXiv:2310.04610https://arxiv.org/abs/2310.04610
[blog]https://www.microsoft.com/en-us/research/blog/announcing-the-deepspeed4science-initiative-enabling-large-scale-scientific-discovery-through-sophisticated-ai-system-technologies/
arXiv:2310.17723https://arxiv.org/abs/2310.17723
arXiv:2312.08583https://arxiv.org/abs/2312.08583
arXiv:2401.14112https://arxiv.org/abs/2401.14112
System Optimizations for Enabling Training of Extreme Long Sequence Transformer Modelshttps://dl.acm.org/doi/10.1145/3662158.3662806
arXiv:2406.18820https://arxiv.org/abs/2406.18820
arXiv:2506.13996https://arxiv.org/abs/2506.13996
arXiv:2505.12242https://arxiv.org/abs/2505.12242
arxivhttps://arxiv.org/abs/2509.21271
ASPLOS 2026https://www.asplos-conference.org/asplos2026
https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#videos
Overviewhttps://www.youtube.com/watch?v=CaseqC45DNc&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=29
ZeRO + large model traininghttps://www.youtube.com/watch?v=y4_bCiAsIAk&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=28
17B T-NLG demohttps://www.youtube.com/watch?v=9V-ZbP92drg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=27
Fastest BERT training + RScan tuninghttps://www.youtube.com/watch?v=o1K-ZG9F6u0&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=26
part 1https://www.youtube.com/watch?v=_NOk-mBwDYg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=92
part 2https://www.youtube.com/watch?v=sG6_c4VXLww&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=94
part 3https://www.youtube.com/watch?v=k9yPkBTayos&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=93
FAQhttps://www.youtube.com/watch?v=nsHu6vEgPew&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=24
ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeedhttps://note.microsoft.com/MSR-Webinar-DeepSpeed-Registration-On-Demand.html
DeepSpeed on AzureMLhttps://youtu.be/yBVXR8G8Bg8
Large Model Training and Inference with DeepSpeed // Samyam Rajbhandari // LLMs in Prod Conferencehttps://www.youtube.com/watch?v=cntxC3g22oU
[slides]https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/presentation-mlops.pdf
DeepSpeed: All the tricks to scale to gigantic models (Mark Saroufim)https://www.youtube.com/watch?v=pDGI668pNg0
Turing-NLG, DeepSpeed and the ZeRO optimizer (Yannic Kilcher)https://www.youtube.com/watch?v=tC01FRB0M7w
Ultimate Guide To Scaling ML Models (The AI Epiphany)https://www.youtube.com/watch?v=hc0u4avAkuM
www.deepspeed.ai/https://www.deepspeed.ai/
machine-learning https://patch-diff.githubusercontent.com/topics/machine-learning
compression https://patch-diff.githubusercontent.com/topics/compression
deep-learning https://patch-diff.githubusercontent.com/topics/deep-learning
gpu https://patch-diff.githubusercontent.com/topics/gpu
inference https://patch-diff.githubusercontent.com/topics/inference
pytorch https://patch-diff.githubusercontent.com/topics/pytorch
zero https://patch-diff.githubusercontent.com/topics/zero
data-parallelism https://patch-diff.githubusercontent.com/topics/data-parallelism
model-parallelism https://patch-diff.githubusercontent.com/topics/model-parallelism
mixture-of-experts https://patch-diff.githubusercontent.com/topics/mixture-of-experts
pipeline-parallelism https://patch-diff.githubusercontent.com/topics/pipeline-parallelism
billion-parameters https://patch-diff.githubusercontent.com/topics/billion-parameters
trillion-parameters https://patch-diff.githubusercontent.com/topics/trillion-parameters
Readme https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#readme-ov-file
Apache-2.0 license https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#Apache-2.0-1-ov-file
Code of conduct https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#coc-ov-file
Contributing https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#contributing-ov-file
Security policy https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed#security-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Activityhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/activity
Custom propertieshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/custom-properties
41.3k starshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/stargazers
352 watchinghttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/watchers
4.7k forkshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed&report=deepspeedai+%28user%29
Releases 104https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/releases
v0.18.4 Patch Release Latest Jan 7, 2026 https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/releases/tag/v0.18.4
+ 103 releaseshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/releases
Packages 0https://patch-diff.githubusercontent.com/orgs/deepspeedai/packages?repo_name=DeepSpeed
Used by 14.8khttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/network/dependents
+ 14,796 https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/network/dependents
Contributors 466https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/graphs/contributors
Please reload this pagehttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
+ 452 contributorshttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/graphs/contributors
Python 72.7% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=python
C++ 18.0% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=c%2B%2B
Cuda 8.5% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=cuda
C 0.4% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=c
Shell 0.3% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=shell
Dockerfile 0.1% https://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed/search?l=dockerfile
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.