René's URL Explorer Experiment


Title: GitHub - tgwrite/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Open Graph Title: GitHub - tgwrite/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

X Title: GitHub - tgwrite/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - tgwrite/DeepSpeed

Open Graph Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - tgwrite/DeepSpeed

X Description: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - tgwrite/DeepSpeed

Mail addresses
opencode@microsoft.com

Opengraph URL: https://github.com/tgwrite/DeepSpeed

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:4bc7f64b-0158-8a0a-b8b7-553a60c34e6c
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idE824:275035:724D28:9E158C:696F3A09
html-safe-nonce17d7b1c40af9d84e333603592282b052f0ea85ed1498c13574d2779685de9c3d
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFODI0OjI3NTAzNTo3MjREMjg6OUUxNThDOjY5NkYzQTA5IiwidmlzaXRvcl9pZCI6IjI3MzU5NjA1MjY1NjU2MjAyMzMiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmacc061de373af5153c609677a317956cd476e073d4e5892a32bf213860fbae6f2b
hovercard-subject-tagrepository:628614301
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/tgwrite/DeepSpeed
twitter:imagehttps://opengraph.githubassets.com/c6be35df64bf3681bca89cb86626cde265cadeb063f4dd639c864e32140258f5/tgwrite/DeepSpeed
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/c6be35df64bf3681bca89cb86626cde265cadeb063f4dd639c864e32140258f5/tgwrite/DeepSpeed
og:image:altDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. - tgwrite/DeepSpeed
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
Noneb278ad162d35332b6de714dfb005de04386c4d92df6475522bef910f491a35ee
turbo-cache-controlno-preview
go-importgithub.com/tgwrite/DeepSpeed git https://github.com/tgwrite/DeepSpeed.git
octolytics-dimension-user_id32328281
octolytics-dimension-user_logintgwrite
octolytics-dimension-repository_id628614301
octolytics-dimension-repository_nwotgwrite/DeepSpeed
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id235860204
octolytics-dimension-repository_parent_nwodeepspeedai/DeepSpeed
octolytics-dimension-repository_network_root_id235860204
octolytics-dimension-repository_network_root_nwodeepspeedai/DeepSpeed
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release39aed5006635ab6f45e6b77d23e73b08a00272a3
ui-targetcanary-1
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftgwrite%2FDeepSpeed
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftgwrite%2FDeepSpeed
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=tgwrite%2FDeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
tgwrite https://patch-diff.githubusercontent.com/tgwrite
DeepSpeedhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
deepspeedai/DeepSpeedhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Ftgwrite%2FDeepSpeed
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Ftgwrite%2FDeepSpeed
Star 0 https://patch-diff.githubusercontent.com/login?return_to=%2Ftgwrite%2FDeepSpeed
www.deepspeed.ai/https://www.deepspeed.ai/
Apache-2.0 license https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/LICENSE
0 stars https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/stargazers
4.7k forks https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/forks
Branches https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/branches
Tags https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tags
Activity https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Ftgwrite%2FDeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Ftgwrite%2FDeepSpeed
Code https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Pull requests 0 https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/pulls
Actions https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/actions
Projects 0 https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/projects
Security Uh oh! There was an error while loading. Please reload this page. https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/security
Please reload this pagehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Insights https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/pulse
Code https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Pull requests https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/pulls
Actions https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/actions
Projects https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/projects
Security https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/security
Insights https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/pulse
Brancheshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/branches
Tagshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tags
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/branches
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tags
1,494 Commitshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/commits/master/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/commits/master/
.githubhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/.github
acceleratorhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/accelerator
acceleratorhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/accelerator
azurehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/azure
azurehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/azure
benchmarkshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/benchmarks
benchmarkshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/benchmarks
binhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/bin
binhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/bin
blogshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/blogs
blogshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/blogs
csrchttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/csrc
csrchttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/csrc
deepspeedhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/deepspeed
deepspeedhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/deepspeed
dockerhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/docker
dockerhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/docker
docshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/docs
docshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/examples
op_builderhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/op_builder
op_builderhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/op_builder
releasehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/release
releasehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/release
requirementshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/requirements
requirementshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/requirements
scriptshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/scripts
scriptshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/scripts
testshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/tests
testshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/tree/master/tests
.clang-formathttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.clang-format
.clang-formathttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.clang-format
.gitignorehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.gitignore
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.pre-commit-config.yaml
.pylintrchttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.pylintrc
.pylintrchttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.pylintrc
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.readthedocs.yml
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.readthedocs.yml
.style.yapfhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.style.yapf
.style.yapfhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/.style.yapf
CODEOWNERShttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CODEOWNERS
CODEOWNERShttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CODEOWNERS
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CONTRIBUTING.md
LICENSEhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/MANIFEST.in
MANIFEST_win.inhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/MANIFEST_win.in
MANIFEST_win.inhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/MANIFEST_win.in
README.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/README.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/SECURITY.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/SECURITY.md
build_win.bathttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/build_win.bat
build_win.bathttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/build_win.bat
install.shhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/install.sh
install.shhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/install.sh
setup.cfghttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/setup.cfg
setup.cfghttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/setup.cfg
setup.pyhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/setup.py
version.txthttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/version.txt
version.txthttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/version.txt
READMEhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Code of conducthttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Contributinghttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Licensehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Securityhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
https://github.com/Microsoft/DeepSpeed/blob/master/LICENSE
https://pypi.org/project/deepspeed/
https://pepy.tech/project/deepspeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#build-pipeline-status
https://twitter.com/intent/follow?screen_name=MSFTDeepSpeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_light.svg#gh-light-mode-only
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/DeepSpeed_dark_transparent.svg#gh-dark-mode-only
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#latest-news
learn howhttps://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat
DeepSpeed Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scaleshttps://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat
Englishhttps://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat/README.md
中文https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat/chinese/README.md
日本語https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat/japanese/README.md
Scaling Large-Scale Generative Mixture-of-Expert Multimodal Model With VL-MoEhttps://www.deepspeed.ai/2023/03/30/multi-modal.html
Automatic Tensor Parallelism: Enables tensor parallelism by default without an injection policyhttps://www.deepspeed.ai/tutorials/automatic-tensor-parallelism/
DeepSpeed Data Efficiency: A composable library that makes better use of data, increases training efficiency, and improves model qualityhttps://www.deepspeed.ai/2022/12/11/data-efficiency.html
Stable Diffusion Image Generation under 1 second w. DeepSpeed MIIhttps://github.com/microsoft/DeepSpeed-MII/tree/main/examples/benchmark/txt2img
DeepSpeed-MII: instant speedup on 24,000+ open-source DL models with up to 40x cheaper inferencehttps://www.deepspeed.ai/2022/10/10/mii.html
ZeRO-Inference: Democratizing massive model inferencehttps://www.deepspeed.ai/2022/09/09/zero-inference.html
Azure and DeepSpeed empower easy-to-use and high-performance model traininghttps://azure.microsoft.com/en-us/blog/azure-empowers-easytouse-highperformance-and-hyperscale-model-training-using-deepspeed/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#extreme-speed-and-scale-for-dl-training-and-inference
DeepSpeedhttps://www.deepspeed.ai/
MT-530Bhttps://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
BLOOMhttps://huggingface.co/blog/bloom-megatron-deepspeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeeds-three-innovation-pillars
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/3pillars.png
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-training
DeepSpeed-Traininghttps://www.deepspeed.ai/training/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-inference
DeepSpeed-Inferencehttps://www.deepspeed.ai/inference
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-compression
DeepSpeed-Compressionhttps://www.deepspeed.ai/compression
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-software-suite
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-library
DeepSpeedhttps://github.com/microsoft/deepspeed
DeepSpeed Adoptionhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-adoption
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#model-implementations-for-inference-mii
Model Implementations for Inference (MII)https://github.com/microsoft/deepspeed-mii
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-on-azure
recipeshttps://github.com/Azure/azureml-examples/tree/main/v1/python-sdk/workflows/train/deepspeed
herehttps://github.com/microsoft/Megatron-DeepSpeed/tree/main/examples/azureml
Azure tutorialhttps://www.deepspeed.ai/tutorials/azure/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#deepspeed-adoption
AI at Scalehttps://www.microsoft.com/en-us/research/project/ai-at-scale/
herehttps://innovation.microsoft.com/en-us/exploring-ai-at-scale
Megatron-Turing NLG (530B)https://www.microsoft.com/en-us/research/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
Jurassic-1 (178B)https://uploads-ssl.webflow.com/60fd4503684b466578c0d307/61138924626a6981ee09caf6_jurassic_tech_paper.pdf
BLOOM (176B)https://huggingface.co/blog/bloom-megatron-deepspeed
GLM (130B)https://github.com/THUDM/GLM-130B
YaLM (100B)https://github.com/yandex/YaLM-100B
GPT-NeoX (20B)https://github.com/EleutherAI/gpt-neox
AlexaTM (20B)https://www.amazon.science/blog/20b-parameter-alexa-model-sets-new-marks-in-few-shot-learning
Turing NLG (17B)https://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft/
METRO-LM (5.4B)https://arxiv.org/pdf/2204.06644.pdf
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/transformers-light.png#gh-light-mode-only
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/transformers-dark.png#gh-dark-mode-only
Transformers with DeepSpeedhttps://huggingface.co/docs/transformers/main/main_classes/deepspeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/accelerate-light.png#gh-light-mode-only
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/accelerate-dark.png#gh-dark-mode-only
Accelerate with DeepSpeedhttps://huggingface.co/docs/accelerate/usage_guides/deepspeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/lightning-light.svg#gh-light-mode-only
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/lightning-dark.svg#gh-dark-mode-only
Lightning with DeepSpeedhttps://lightning.ai/docs/pytorch/stable/advanced/model_parallel.html#deepspeed
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/mosaicml.svg
MosaicML with DeepSpeedhttps://docs.mosaicml.com/projects/composer/en/latest/trainer/using_the_trainer.html?highlight=deepspeed#deepspeed-integration
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/docs/assets/images/determined.svg
Determined with DeepSpeedhttps://docs.determined.ai/latest/training/apis-howto/deepspeed/overview.html
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#build-pipeline-status
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-torch19-p40.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-torch19-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-torch-latest-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-inference.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-nightly.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/amd-mi100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/amd-mi200.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-torch-latest-cpu.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-torch-nightly-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-transformers-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-lightning-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-accelerate-v100.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-megatron.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/nv-mii.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/formatting.yml
https://github.com/microsoft/DeepSpeed/actions/workflows/pages/pages-build-deployment
https://deepspeed.readthedocs.io/en/latest/?badge=latest
https://github.com/microsoft/DeepSpeed/actions/workflows/python.yml
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#installation
torch's JIT C++ extension loader that relies on ninjahttps://pytorch.org/docs/stable/cpp_extension.html
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#requirements
PyTorchhttps://pytorch.org/
nvcchttps://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/#introduction
hipcchttps://github.com/ROCm-Developer-Tools/HIPCC
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#pypi
PyPIhttps://pypi.org/project/deepspeed/
advanced installation instructionshttps://www.deepspeed.ai/tutorials/advanced-install/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#windows
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#features
DeepSpeed-Traininghttps://www.deepspeed.ai/training
DeepSpeed-Inferencehttps://www.deepspeed.ai/inference
DeepSpeed-Compressionhttps://www.deepspeed.ai/compression
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#further-reading
deepspeed.aihttps://www.deepspeed.ai/
Getting Startedhttps://www.deepspeed.ai/getting-started/
DeepSpeed JSON Configurationhttps://www.deepspeed.ai/docs/config-json/
API Documentationhttps://deepspeed.readthedocs.io/en/latest/
Tutorialshttps://www.deepspeed.ai/tutorials/
Blogshttps://www.deepspeed.ai/posts/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#contributing
contributinghttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/blob/master/CONTRIBUTING.md
https://github.com/microsoft/DeepSpeed/graphs/contributors
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#contributor-license-agreement
https://cla.opensource.microsoft.comhttps://cla.opensource.microsoft.com
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#code-of-conduct
Microsoft Open Source Code of Conducthttps://opensource.microsoft.com/codeofconduct/
Code of Conduct FAQhttps://opensource.microsoft.com/codeofconduct/faq/
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#publications
arXiv:1910.02054https://arxiv.org/abs/1910.02054
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '20)https://dl.acm.org/doi/10.5555/3433701.3433727
In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20, Tutorial)https://dl.acm.org/doi/10.1145/3394486.3406703
arXiv:2010.13369https://arxiv.org/abs/2010.13369
NeurIPS 2020https://proceedings.neurips.cc/paper/2020/hash/a1140a3d0df1c81e24ae954d935e8926-Abstract.html
arXiv:2101.06840https://arxiv.org/abs/2101.06840
USENIX ATC 2021https://www.usenix.org/conference/atc21/presentation/ren-jie
arXiv:2102.02888https://arxiv.org/abs/2102.02888
ICML 2021http://proceedings.mlr.press/v139/tang21a.html
arXiv:2104.07857https://arxiv.org/abs/2104.07857
SC 2021https://dl.acm.org/doi/abs/10.1145/3458817.3476205
arXiv:2104.06069https://arxiv.org/abs/2104.06069
HiPC 2022https://hipc.org/advance-program/
arXiv:2108.06084https://arxiv.org/abs/2108.06084
NeurIPS 2022https://openreview.net/forum?id=JpZ5du_Kdh
arXiv:2202.06009https://arxiv.org/abs/2202.06009
arXiv:2201.05596https://arxiv.org/abs/2201.05596
ICML 2022https://proceedings.mlr.press/v162/rajbhandari22a.html
arXiv:2201.11990https://arxiv.org/abs/2201.11990
arXiv:2206.01859https://arxiv.org/abs/2206.01859
NeurIPS 2022https://openreview.net/forum?id=xNeAhc2CNAl
arXiv:2206.01861https://arxiv.org/abs/2206.01861
NeurIPS 2022https://openreview.net/forum?id=f-fVCElZ-G1
arXiv:2207.00032https://arxiv.org/abs/2207.00032
SC 2022https://dl.acm.org/doi/abs/10.5555/3571885.3571946
arXiv:2211.11586https://arxiv.org/abs/2211.11586
arXiv:2212.03597https://arxiv.org/abs/2212.03597
arXiv:2301.12017https://arxiv.org/abs/2301.12017
ICLR:2023https://openreview.net/forum?id=Pgtn4l6eKjv
arXiv:2303.07226https://arxiv.org/abs/2303.07226
arXiv:2303.08374https://arxiv.org/abs/2303.08374
https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#videos
Overviewhttps://www.youtube.com/watch?v=CaseqC45DNc&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=29
ZeRO + large model traininghttps://www.youtube.com/watch?v=y4_bCiAsIAk&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=28
17B T-NLG demohttps://www.youtube.com/watch?v=9V-ZbP92drg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=27
Fastest BERT training + RScan tuninghttps://www.youtube.com/watch?v=o1K-ZG9F6u0&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=26
part 1https://www.youtube.com/watch?v=_NOk-mBwDYg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=92
part 2https://www.youtube.com/watch?v=sG6_c4VXLww&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=94
part 3https://www.youtube.com/watch?v=k9yPkBTayos&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=93
FAQhttps://www.youtube.com/watch?v=nsHu6vEgPew&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=24
ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeedhttps://note.microsoft.com/MSR-Webinar-DeepSpeed-Registration-On-Demand.html
DeepSpeed on AzureMLhttps://youtu.be/yBVXR8G8Bg8
DeepSpeed: All the tricks to scale to gigantic models (Mark Saroufim)https://www.youtube.com/watch?v=pDGI668pNg0
Turing-NLG, DeepSpeed and the ZeRO optimizer (Yannic Kilcher)https://www.youtube.com/watch?v=tC01FRB0M7w
Ultimate Guide To Scaling ML Models (The AI Epiphany)https://www.youtube.com/watch?v=hc0u4avAkuM
www.deepspeed.ai/https://www.deepspeed.ai/
Readme https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#readme-ov-file
Apache-2.0 license https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#Apache-2.0-1-ov-file
Code of conduct https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#coc-ov-file
Contributing https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#contributing-ov-file
Security policy https://patch-diff.githubusercontent.com/tgwrite/DeepSpeed#security-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed
Activityhttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/activity
0 starshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/watchers
0 forkshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Ftgwrite%2FDeepSpeed&report=tgwrite+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/tgwrite/DeepSpeed/releases
Packages 0https://patch-diff.githubusercontent.com/users/tgwrite/packages?repo_name=DeepSpeed
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.