René's URL Explorer Experiment


Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Open Graph Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

X Title: GitHub - deepspeedai/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

Open Graph Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

X Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed

Mail addresses
opencode@microsoft.com

Opengraph URL: https://github.com/deepspeedai/DeepSpeed

X: @github

direct link

Domain: github.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:4943f305-c18b-323b-97a6-9f4d30030401
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idA242:32B525:7128A3:968AB5:696486B0
html-safe-nonce55f770692323e4cd6d95e1f9c152ad9df57db2387aaaba184272370d628cd8c6
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBMjQyOjMyQjUyNTo3MTI4QTM6OTY4QUI1OjY5NjQ4NkIwIiwidmlzaXRvcl9pZCI6IjU2NjY1MjM3MDAyNjEwNjIzMjAiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmacc8cda6f7028eace7a04771bbc1ebfb4c43e82c23599d193578a2ce52f49493da
hovercard-subject-tagrepository:235860204
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/deepspeedai/DeepSpeed
twitter:imagehttps://opengraph.githubassets.com/80b31f3276bbf244a7dfff952ce6e1d37f58f2b942b96d8b8f1227aec9e6a3ca/deepspeedai/DeepSpeed
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/80b31f3276bbf244a7dfff952ce6e1d37f58f2b942b96d8b8f1227aec9e6a3ca/deepspeedai/DeepSpeed
og:image:altDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - deepspeedai/DeepSpeed
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
Nonebaa7d9900fdf7b27d604f36887af878d569cfbdcf97126832a5f4f0caf0c6ba5
turbo-cache-controlno-preview
go-importgithub.com/deepspeedai/DeepSpeed git https://github.com/deepspeedai/DeepSpeed.git
octolytics-dimension-user_id74068820
octolytics-dimension-user_logindeepspeedai
octolytics-dimension-repository_id235860204
octolytics-dimension-repository_nwodeepspeedai/DeepSpeed
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id235860204
octolytics-dimension-repository_network_root_nwodeepspeedai/DeepSpeed
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release842eff1d11f899d02b6b3b98fa3ea4860e64b34e
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/deepspeedai/DeepSpeed#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=deepspeedai%2FDeepSpeed
Reloadhttps://github.com/deepspeedai/DeepSpeed
Reloadhttps://github.com/deepspeedai/DeepSpeed
Reloadhttps://github.com/deepspeedai/DeepSpeed
deepspeedai https://github.com/deepspeedai
DeepSpeedhttps://github.com/deepspeedai/DeepSpeed
Notifications https://github.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Fork 4.7k https://github.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Star 41.2k https://github.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
www.deepspeed.ai/https://www.deepspeed.ai/
Apache-2.0 license https://github.com/deepspeedai/DeepSpeed/blob/master/LICENSE
41.2k stars https://github.com/deepspeedai/DeepSpeed/stargazers
4.7k forks https://github.com/deepspeedai/DeepSpeed/forks
Branches https://github.com/deepspeedai/DeepSpeed/branches
Tags https://github.com/deepspeedai/DeepSpeed/tags
Activity https://github.com/deepspeedai/DeepSpeed/activity
Star https://github.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Notifications https://github.com/login?return_to=%2Fdeepspeedai%2FDeepSpeed
Code https://github.com/deepspeedai/DeepSpeed
Issues 1.1k https://github.com/deepspeedai/DeepSpeed/issues
Pull requests 107 https://github.com/deepspeedai/DeepSpeed/pulls
Discussions https://github.com/deepspeedai/DeepSpeed/discussions
Actions https://github.com/deepspeedai/DeepSpeed/actions
Projects 0 https://github.com/deepspeedai/DeepSpeed/projects
Security Uh oh! There was an error while loading. Please reload this page. https://github.com/deepspeedai/DeepSpeed/security
Please reload this pagehttps://github.com/deepspeedai/DeepSpeed
Insights https://github.com/deepspeedai/DeepSpeed/pulse
Code https://github.com/deepspeedai/DeepSpeed
Issues https://github.com/deepspeedai/DeepSpeed/issues
Pull requests https://github.com/deepspeedai/DeepSpeed/pulls
Discussions https://github.com/deepspeedai/DeepSpeed/discussions
Actions https://github.com/deepspeedai/DeepSpeed/actions
Projects https://github.com/deepspeedai/DeepSpeed/projects
Security https://github.com/deepspeedai/DeepSpeed/security
Insights https://github.com/deepspeedai/DeepSpeed/pulse
Brancheshttps://github.com/deepspeedai/DeepSpeed/branches
Tagshttps://github.com/deepspeedai/DeepSpeed/tags
https://github.com/deepspeedai/DeepSpeed/branches
https://github.com/deepspeedai/DeepSpeed/tags
3,025 Commitshttps://github.com/deepspeedai/DeepSpeed/commits/master/
https://github.com/deepspeedai/DeepSpeed/commits/master/
.githubhttps://github.com/deepspeedai/DeepSpeed/tree/master/.github
.githubhttps://github.com/deepspeedai/DeepSpeed/tree/master/.github
acceleratorhttps://github.com/deepspeedai/DeepSpeed/tree/master/accelerator
acceleratorhttps://github.com/deepspeedai/DeepSpeed/tree/master/accelerator
azurehttps://github.com/deepspeedai/DeepSpeed/tree/master/azure
azurehttps://github.com/deepspeedai/DeepSpeed/tree/master/azure
benchmarkshttps://github.com/deepspeedai/DeepSpeed/tree/master/benchmarks
benchmarkshttps://github.com/deepspeedai/DeepSpeed/tree/master/benchmarks
binhttps://github.com/deepspeedai/DeepSpeed/tree/master/bin
binhttps://github.com/deepspeedai/DeepSpeed/tree/master/bin
blogshttps://github.com/deepspeedai/DeepSpeed/tree/master/blogs
blogshttps://github.com/deepspeedai/DeepSpeed/tree/master/blogs
cihttps://github.com/deepspeedai/DeepSpeed/tree/master/ci
cihttps://github.com/deepspeedai/DeepSpeed/tree/master/ci
csrchttps://github.com/deepspeedai/DeepSpeed/tree/master/csrc
csrchttps://github.com/deepspeedai/DeepSpeed/tree/master/csrc
deepspeedhttps://github.com/deepspeedai/DeepSpeed/tree/master/deepspeed
deepspeedhttps://github.com/deepspeedai/DeepSpeed/tree/master/deepspeed
dockerhttps://github.com/deepspeedai/DeepSpeed/tree/master/docker
dockerhttps://github.com/deepspeedai/DeepSpeed/tree/master/docker
docshttps://github.com/deepspeedai/DeepSpeed/tree/master/docs
docshttps://github.com/deepspeedai/DeepSpeed/tree/master/docs
exampleshttps://github.com/deepspeedai/DeepSpeed/tree/master/examples
exampleshttps://github.com/deepspeedai/DeepSpeed/tree/master/examples
op_builderhttps://github.com/deepspeedai/DeepSpeed/tree/master/op_builder
op_builderhttps://github.com/deepspeedai/DeepSpeed/tree/master/op_builder
releasehttps://github.com/deepspeedai/DeepSpeed/tree/master/release
releasehttps://github.com/deepspeedai/DeepSpeed/tree/master/release
requirementshttps://github.com/deepspeedai/DeepSpeed/tree/master/requirements
requirementshttps://github.com/deepspeedai/DeepSpeed/tree/master/requirements
scriptshttps://github.com/deepspeedai/DeepSpeed/tree/master/scripts
scriptshttps://github.com/deepspeedai/DeepSpeed/tree/master/scripts
testshttps://github.com/deepspeedai/DeepSpeed/tree/master/tests
testshttps://github.com/deepspeedai/DeepSpeed/tree/master/tests
.clang-formathttps://github.com/deepspeedai/DeepSpeed/blob/master/.clang-format
.clang-formathttps://github.com/deepspeedai/DeepSpeed/blob/master/.clang-format
.flake8https://github.com/deepspeedai/DeepSpeed/blob/master/.flake8
.flake8https://github.com/deepspeedai/DeepSpeed/blob/master/.flake8
.gitignorehttps://github.com/deepspeedai/DeepSpeed/blob/master/.gitignore
.gitignorehttps://github.com/deepspeedai/DeepSpeed/blob/master/.gitignore
.gitmoduleshttps://github.com/deepspeedai/DeepSpeed/blob/master/.gitmodules
.gitmoduleshttps://github.com/deepspeedai/DeepSpeed/blob/master/.gitmodules
.pre-commit-config.yamlhttps://github.com/deepspeedai/DeepSpeed/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://github.com/deepspeedai/DeepSpeed/blob/master/.pre-commit-config.yaml
.pylintrchttps://github.com/deepspeedai/DeepSpeed/blob/master/.pylintrc
.pylintrchttps://github.com/deepspeedai/DeepSpeed/blob/master/.pylintrc
.readthedocs.ymlhttps://github.com/deepspeedai/DeepSpeed/blob/master/.readthedocs.yml
.readthedocs.ymlhttps://github.com/deepspeedai/DeepSpeed/blob/master/.readthedocs.yml
.style.yapfhttps://github.com/deepspeedai/DeepSpeed/blob/master/.style.yapf
.style.yapfhttps://github.com/deepspeedai/DeepSpeed/blob/master/.style.yapf
CODEOWNERShttps://github.com/deepspeedai/DeepSpeed/blob/master/CODEOWNERS
CODEOWNERShttps://github.com/deepspeedai/DeepSpeed/blob/master/CODEOWNERS
CODE_OF_CONDUCT.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
COMMITTERS.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/COMMITTERS.md
COMMITTERS.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/COMMITTERS.md
CONTRIBUTING.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
GOVERNANCE.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/GOVERNANCE.md
GOVERNANCE.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/GOVERNANCE.md
LICENSEhttps://github.com/deepspeedai/DeepSpeed/blob/master/LICENSE
LICENSEhttps://github.com/deepspeedai/DeepSpeed/blob/master/LICENSE
MANIFEST.inhttps://github.com/deepspeedai/DeepSpeed/blob/master/MANIFEST.in
MANIFEST.inhttps://github.com/deepspeedai/DeepSpeed/blob/master/MANIFEST.in
MANIFEST_win.inhttps://github.com/deepspeedai/DeepSpeed/blob/master/MANIFEST_win.in
MANIFEST_win.inhttps://github.com/deepspeedai/DeepSpeed/blob/master/MANIFEST_win.in
Makefilehttps://github.com/deepspeedai/DeepSpeed/blob/master/Makefile
Makefilehttps://github.com/deepspeedai/DeepSpeed/blob/master/Makefile
README.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/README.md
README.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/README.md
SECURITY.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/SECURITY.md
SECURITY.mdhttps://github.com/deepspeedai/DeepSpeed/blob/master/SECURITY.md
build_win.bathttps://github.com/deepspeedai/DeepSpeed/blob/master/build_win.bat
build_win.bathttps://github.com/deepspeedai/DeepSpeed/blob/master/build_win.bat
environment.ymlhttps://github.com/deepspeedai/DeepSpeed/blob/master/environment.yml
environment.ymlhttps://github.com/deepspeedai/DeepSpeed/blob/master/environment.yml
install.shhttps://github.com/deepspeedai/DeepSpeed/blob/master/install.sh
install.shhttps://github.com/deepspeedai/DeepSpeed/blob/master/install.sh
setup.cfghttps://github.com/deepspeedai/DeepSpeed/blob/master/setup.cfg
setup.cfghttps://github.com/deepspeedai/DeepSpeed/blob/master/setup.cfg
setup.pyhttps://github.com/deepspeedai/DeepSpeed/blob/master/setup.py
setup.pyhttps://github.com/deepspeedai/DeepSpeed/blob/master/setup.py
version.txthttps://github.com/deepspeedai/DeepSpeed/blob/master/version.txt
version.txthttps://github.com/deepspeedai/DeepSpeed/blob/master/version.txt
READMEhttps://github.com/deepspeedai/DeepSpeed
Code of conducthttps://github.com/deepspeedai/DeepSpeed
Contributinghttps://github.com/deepspeedai/DeepSpeed
Apache-2.0 licensehttps://github.com/deepspeedai/DeepSpeed
Securityhttps://github.com/deepspeedai/DeepSpeed
https://github.com/deepspeedai/DeepSpeed/blob/master/LICENSE
https://pypi.org/project/deepspeed/
https://pepy.tech/project/deepspeed
https://github.com/deepspeedai/DeepSpeed#build-pipeline-status
https://www.bestpractices.dev/projects/9530
https://twitter.com/intent/follow?screen_name=DeepSpeedAI
https://twitter.com/DeepSpeedAI_JP
https://www.zhihu.com/people/deepspeed
https://join.slack.com/t/deepspeedworkspace/shared_invite/zt-3a8pjd8dd-PCj2hMvR4Y2syPwVnjEoww
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_light.svg#gh-light-mode-only
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_dark_transparent.svg#gh-dark-mode-only
https://github.com/deepspeedai/DeepSpeed#latest-news
DeepSpeed Core API updates: PyTorch-style backward and low-precision master stateshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/core_api_update/README.md
Ray x DeepSpeed Meetuphttps://luma.com/3wctqteh
herehttps://docs.google.com/presentation/d/1eM3mY6oW9GYkRy1Xz0iOnbbEr5T1t0JJXOM5BKtR-Ks/edit?slide=id.g38615d6b4c2_0_87#slide=id.g38615d6b4c2_0_87
SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchipshttps://pytorch.org/blog/superoffload-unleashing-the-power-of-large-scale-llm-training-on-superchips/
Study of ZenFlow and ZeRO offload performance with DeepSpeed CPU core bindinghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/zenflow-corebinding/README.md
ZenFlow: Stall-Free Offloading Engine for LLM Traininghttps://pytorch.org/blog/zenflow-stall-free-offloading-engine-for-llm-training/
Arctic Long Sequence Training (ALST) with DeepSpeed: Scalable And Efficient Training For Multi-Million Token Sequenceshttps://www.snowflake.com/en/engineering-blog/arctic-long-sequence-training-multi-million-token-ai/
DeepNVMe: Affordable I/O scaling for Deep Learning Applicationshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepnvme/06-2025/README.md
DeepCompile: Unlocking Compiler Optimization for Distributed Traininghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/deepcompile/README.md
DeepSpeed AutoTP: Automatic Tensor Parallel Training of Hugging Face modelshttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/huggingface-tp/README.md
Ulysses-Offload: Democratizing Long Context LLM Traininghttps://github.com/deepspeedai/DeepSpeed/blob/master/blogs/ulysses-offload/README.md
https://github.com/deepspeedai/DeepSpeed#extreme-speed-and-scale-for-dl-training
DeepSpeedhttps://www.deepspeed.ai/
MT-530Bhttps://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
BLOOMhttps://huggingface.co/blog/bloom-megatron-deepspeed
system innovationshttps://www.deepspeed.ai/training/
https://github.com/deepspeedai/DeepSpeed#deepspeed-adoption
AI at Scalehttps://www.microsoft.com/en-us/research/project/ai-at-scale/
herehttps://innovation.microsoft.com/en-us/exploring-ai-at-scale
Megatron-Turing NLG (530B)https://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
Jurassic-1 (178B)https://uploads-ssl.webflow.com/60fd4503684b466578c0d307/61138924626a6981ee09caf6_jurassic_tech_paper.pdf
BLOOM (176B)https://huggingface.co/blog/bloom-megatron-deepspeed
GLM (130B)https://github.com/THUDM/GLM-130B
xTrimoPGLM (100B)https://www.biorxiv.org/content/10.1101/2023.07.05.547496v2
YaLM (100B)https://github.com/yandex/YaLM-100B
GPT-NeoX (20B)https://github.com/EleutherAI/gpt-neox
AlexaTM (20B)https://www.amazon.science/blog/20b-parameter-alexa-model-sets-new-marks-in-few-shot-learning
Turing NLG (17B)https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft/
METRO-LM (5.4B)https://arxiv.org/pdf/2204.06644.pdf
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/transformers-light.png#gh-light-mode-only
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/transformers-dark.png#gh-dark-mode-only
Transformers with DeepSpeedhttps://huggingface.co/docs/transformers/deepspeed
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/accelerate-light.png#gh-light-mode-only
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/accelerate-dark.png#gh-dark-mode-only
Accelerate with DeepSpeedhttps://huggingface.co/docs/accelerate/usage_guides/deepspeed
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/lightning-light.svg#gh-light-mode-only
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/lightning-dark.svg#gh-dark-mode-only
Lightning with DeepSpeedhttps://lightning.ai/docs/pytorch/stable/advanced/model_parallel.html#deepspeed
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/mosaicml.svg
MosaicML with DeepSpeedhttps://docs.mosaicml.com/projects/composer/en/latest/trainer/using_the_trainer.html?highlight=deepspeed#deepspeed-integration
https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/images/determined.svg
Determined with DeepSpeedhttps://docs.determined.ai/latest/training/apis-howto/deepspeed/overview.html
https://user-images.githubusercontent.com/58739961/187154444-fce76639-ac8d-429b-9354-c6fac64b7ef8.jpg
MMEngine with DeepSpeedhttps://mmengine.readthedocs.io/en/latest/common_usage/large_model_training.html#deepspeed
https://github.com/deepspeedai/DeepSpeed#build-pipeline-status
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-torch-latest-v100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-inference.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-nightly.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/amd-mi200.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/cpu-torch-latest.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/hpu-gaudi2.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/xpu-max1100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-torch-nightly-v100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-transformers-v100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-lightning-v100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-accelerate-v100.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-mii.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-ds-chat.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/nv-sd.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/formatting.yml
https://github.com/deepspeedai/DeepSpeed/actions/workflows/pages/pages-build-deployment
https://deepspeed.readthedocs.io/en/latest/?badge=latest
https://github.com/deepspeedai/DeepSpeed/actions/workflows/python.yml
https://github.com/Ascend/Ascend-CI/actions/workflows/deepspeed.yaml
https://github.com/deepspeedai/DeepSpeed#installation
torch's JIT C++ extension loader that relies on ninjahttps://pytorch.org/docs/stable/cpp_extension.html
https://github.com/deepspeedai/DeepSpeed#requirements
PyTorchhttps://pytorch.org/
nvcchttps://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/#introduction
hipcchttps://github.com/ROCm-Developer-Tools/HIPCC
https://github.com/deepspeedai/DeepSpeed#contributed-hw-support
https://github.com/deepspeedai/DeepSpeed#pypi
PyPIhttps://pypi.org/project/deepspeed/
advanced installation instructionshttps://www.deepspeed.ai/tutorials/advanced-install/
https://github.com/deepspeedai/DeepSpeed#windows
herehttps://github.com/deepspeedai/DeepSpeed/tree/master/blogs/windows/08-2024/README.md
https://github.com/deepspeedai/DeepSpeed#further-reading
deepspeed.aihttps://www.deepspeed.ai/
Getting Startedhttps://www.deepspeed.ai/getting-started/
DeepSpeed JSON Configurationhttps://www.deepspeed.ai/docs/config-json/
API Documentationhttps://deepspeed.readthedocs.io/en/latest/
Tutorialshttps://www.deepspeed.ai/tutorials/
Blogshttps://www.deepspeed.ai/posts/
https://github.com/deepspeedai/DeepSpeed#ci-funding
https://modal.comhttps://modal.com
https://github.com/deepspeedai/DeepSpeed#contributing
contributinghttps://github.com/deepspeedai/DeepSpeed/blob/master/CONTRIBUTING.md
https://github.com/deepspeedai/DeepSpeed/graphs/contributors
https://github.com/deepspeedai/DeepSpeed#contributor-license-agreement
https://cla.opensource.microsoft.comhttps://cla.opensource.microsoft.com
https://github.com/deepspeedai/DeepSpeed#code-of-conduct
Microsoft Open Source Code of Conducthttps://opensource.microsoft.com/codeofconduct/
Code of Conduct FAQhttps://opensource.microsoft.com/codeofconduct/faq/
https://github.com/deepspeedai/DeepSpeed#publications
arXiv:1910.02054https://arxiv.org/abs/1910.02054
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '20)https://dl.acm.org/doi/10.5555/3433701.3433727
In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20, Tutorial)https://dl.acm.org/doi/10.1145/3394486.3406703
arXiv:2010.13369https://arxiv.org/abs/2010.13369
NeurIPS 2020https://proceedings.neurips.cc/paper/2020/hash/a1140a3d0df1c81e24ae954d935e8926-Abstract.html
arXiv:2101.06840https://arxiv.org/abs/2101.06840
USENIX ATC 2021https://www.usenix.org/conference/atc21/presentation/ren-jie
[paper]https://arxiv.org/abs/2101.06840
[slides]https://www.usenix.org/system/files/atc21_slides_ren-jie.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-extreme-scale-model-training-for-everyone/
arXiv:2102.02888https://arxiv.org/abs/2102.02888
ICML 2021http://proceedings.mlr.press/v139/tang21a.html
arXiv:2104.07857https://arxiv.org/abs/2104.07857
SC 2021https://dl.acm.org/doi/abs/10.1145/3458817.3476205
[paper]https://arxiv.org/abs/2104.07857
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/SC21-ZeRO-Infinity.pdf
[blog]https://www.microsoft.com/en-us/research/blog/zero-infinity-and-deepspeed-unlocking-unprecedented-model-scale-for-deep-learning-training/
arXiv:2104.06069https://arxiv.org/abs/2104.06069
HiPC 2022https://hipc.org/advance-program/
arXiv:2108.06084https://arxiv.org/abs/2108.06084
NeurIPS 2022https://openreview.net/forum?id=JpZ5du_Kdh
arXiv:2202.06009https://arxiv.org/abs/2202.06009
arXiv:2201.05596https://arxiv.org/abs/2201.05596
ICML 2022https://proceedings.mlr.press/v162/rajbhandari22a.html
[pdf]https://arxiv.org/abs/2201.05596
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/ICML-5mins.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-advancing-moe-inference-and-training-to-power-next-generation-ai-scale/
arXiv:2201.11990https://arxiv.org/abs/2201.11990
arXiv:2206.01859https://arxiv.org/abs/2206.01859
NeurIPS 2022https://openreview.net/forum?id=xNeAhc2CNAl
arXiv:2206.01861https://arxiv.org/abs/2206.01861
NeurIPS 2022https://openreview.net/forum?id=f-fVCElZ-G1
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-compression-a-composable-library-for-extreme-compression-and-zero-cost-quantization/
arXiv:2207.00032https://arxiv.org/abs/2207.00032
SC 2022https://dl.acm.org/doi/abs/10.5555/3571885.3571946
[paper]https://arxiv.org/abs/2207.00032
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/sc22-ds-inference.pdf
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/
arXiv:2211.11586https://arxiv.org/abs/2211.11586
arXiv:2212.03597https://arxiv.org/abs/2212.03597
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
arXiv:2301.12017https://arxiv.org/abs/2301.12017
ICML2023https://icml.cc/Conferences/2023
ICLR:2023https://openreview.net/forum?id=Pgtn4l6eKjv
arXiv:2303.07226https://arxiv.org/abs/2303.07226
Finding at EMNLP2023https://2023.emnlp.org/
arXiv:2303.08374https://arxiv.org/abs/2303.08374
arXiv:2303.06318https://arxiv.org/abs/2303.06318
ICS 2023https://dl.acm.org/doi/10.1145/3577193.3593704
arXiv:2306.10209https://arxiv.org/abs/2306.10209
ML for Sys Workshop at NeurIPS2023http://mlforsystems.org/
[blog]https://www.microsoft.com/en-us/research/blog/deepspeed-zero-a-leap-in-speed-for-llm-and-chat-model-training-with-4x-less-communication/
arXiv:2303.08302https://arxiv.org/abs/2303.08302
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
arXiv:2305.09847https://arxiv.org/abs/2305.09847
arXiv:2308.01320https://arxiv.org/abs/2308.01320
arXiv:2307.09782https://arxiv.org/abs/2307.09782
ENLSP2023 Workshop at NeurIPS2023https://neurips2023-enlsp.github.io/
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/zeroquant_series.pdf
arXiv:2309.14327https://arxiv.org/pdf/2309.14327.pdf
arXiv:2310.04610https://arxiv.org/abs/2310.04610
[blog]https://www.microsoft.com/en-us/research/blog/announcing-the-deepspeed4science-initiative-enabling-large-scale-scientific-discovery-through-sophisticated-ai-system-technologies/
arXiv:2310.17723https://arxiv.org/abs/2310.17723
arXiv:2312.08583https://arxiv.org/abs/2312.08583
arXiv:2401.14112https://arxiv.org/abs/2401.14112
System Optimizations for Enabling Training of Extreme Long Sequence Transformer Modelshttps://dl.acm.org/doi/10.1145/3662158.3662806
arXiv:2406.18820https://arxiv.org/abs/2406.18820
arXiv:2506.13996https://arxiv.org/abs/2506.13996
arXiv:2505.12242https://arxiv.org/abs/2505.12242
arxivhttps://arxiv.org/abs/2509.21271
ASPLOS 2026https://www.asplos-conference.org/asplos2026
https://github.com/deepspeedai/DeepSpeed#videos
Overviewhttps://www.youtube.com/watch?v=CaseqC45DNc&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=29
ZeRO + large model traininghttps://www.youtube.com/watch?v=y4_bCiAsIAk&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=28
17B T-NLG demohttps://www.youtube.com/watch?v=9V-ZbP92drg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=27
Fastest BERT training + RScan tuninghttps://www.youtube.com/watch?v=o1K-ZG9F6u0&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=26
part 1https://www.youtube.com/watch?v=_NOk-mBwDYg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=92
part 2https://www.youtube.com/watch?v=sG6_c4VXLww&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=94
part 3https://www.youtube.com/watch?v=k9yPkBTayos&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=93
FAQhttps://www.youtube.com/watch?v=nsHu6vEgPew&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=24
ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeedhttps://note.microsoft.com/MSR-Webinar-DeepSpeed-Registration-On-Demand.html
DeepSpeed on AzureMLhttps://youtu.be/yBVXR8G8Bg8
Large Model Training and Inference with DeepSpeed // Samyam Rajbhandari // LLMs in Prod Conferencehttps://www.youtube.com/watch?v=cntxC3g22oU
[slides]https://github.com/deepspeedai/DeepSpeed/blob/master/docs/assets/files/presentation-mlops.pdf
DeepSpeed: All the tricks to scale to gigantic models (Mark Saroufim)https://www.youtube.com/watch?v=pDGI668pNg0
Turing-NLG, DeepSpeed and the ZeRO optimizer (Yannic Kilcher)https://www.youtube.com/watch?v=tC01FRB0M7w
Ultimate Guide To Scaling ML Models (The AI Epiphany)https://www.youtube.com/watch?v=hc0u4avAkuM
www.deepspeed.ai/https://www.deepspeed.ai/
machine-learning https://github.com/topics/machine-learning
compression https://github.com/topics/compression
deep-learning https://github.com/topics/deep-learning
gpu https://github.com/topics/gpu
inference https://github.com/topics/inference
pytorch https://github.com/topics/pytorch
zero https://github.com/topics/zero
data-parallelism https://github.com/topics/data-parallelism
model-parallelism https://github.com/topics/model-parallelism
mixture-of-experts https://github.com/topics/mixture-of-experts
pipeline-parallelism https://github.com/topics/pipeline-parallelism
billion-parameters https://github.com/topics/billion-parameters
trillion-parameters https://github.com/topics/trillion-parameters
Readme https://github.com/deepspeedai/DeepSpeed#readme-ov-file
Apache-2.0 license https://github.com/deepspeedai/DeepSpeed#Apache-2.0-1-ov-file
Code of conduct https://github.com/deepspeedai/DeepSpeed#coc-ov-file
Contributing https://github.com/deepspeedai/DeepSpeed#contributing-ov-file
Security policy https://github.com/deepspeedai/DeepSpeed#security-ov-file
Please reload this pagehttps://github.com/deepspeedai/DeepSpeed
Activityhttps://github.com/deepspeedai/DeepSpeed/activity
Custom propertieshttps://github.com/deepspeedai/DeepSpeed/custom-properties
41.2k starshttps://github.com/deepspeedai/DeepSpeed/stargazers
350 watchinghttps://github.com/deepspeedai/DeepSpeed/watchers
4.7k forkshttps://github.com/deepspeedai/DeepSpeed/forks
Report repository https://github.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fdeepspeedai%2FDeepSpeed&report=deepspeedai+%28user%29
Releases 104https://github.com/deepspeedai/DeepSpeed/releases
v0.18.4 Patch Release Latest Jan 7, 2026 https://github.com/deepspeedai/DeepSpeed/releases/tag/v0.18.4
+ 103 releaseshttps://github.com/deepspeedai/DeepSpeed/releases
Packages 0https://github.com/orgs/deepspeedai/packages?repo_name=DeepSpeed
Used by 14.6khttps://github.com/deepspeedai/DeepSpeed/network/dependents
+ 14,639 https://github.com/deepspeedai/DeepSpeed/network/dependents
Contributors 463https://github.com/deepspeedai/DeepSpeed/graphs/contributors
https://github.com/jeffra
https://github.com/loadams
https://github.com/mrwyattii
https://github.com/stas00
https://github.com/ShadenSmith
https://github.com/tohtana
https://github.com/RezaYazdaniAminabadi
https://github.com/awan-10
https://github.com/sfc-gh-truwase
https://github.com/conglongli
https://github.com/lekurile
https://github.com/delock
https://github.com/cmikeh2
https://github.com/samyam
+ 449 contributorshttps://github.com/deepspeedai/DeepSpeed/graphs/contributors
Python 72.7% https://github.com/deepspeedai/DeepSpeed/search?l=python
C++ 18.0% https://github.com/deepspeedai/DeepSpeed/search?l=c%2B%2B
Cuda 8.5% https://github.com/deepspeedai/DeepSpeed/search?l=cuda
C 0.4% https://github.com/deepspeedai/DeepSpeed/search?l=c
Shell 0.3% https://github.com/deepspeedai/DeepSpeed/search?l=shell
Dockerfile 0.1% https://github.com/deepspeedai/DeepSpeed/search?l=dockerfile
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.