René's URL Explorer Experiment


Title: GitHub - schittli/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Open Graph Title: GitHub - schittli/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

X Title: GitHub - schittli/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Description: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. - schittli/DeepSpeed

Open Graph Description: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. - schittli/DeepSpeed

X Description: DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. - schittli/DeepSpeed

Mail addresses
opencode@microsoft.com

Opengraph URL: https://github.com/schittli/DeepSpeed

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:0c30e74f-242c-8ab2-858f-127f470c9c51
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id96CA:37C55D:8192B72:A754140:697E7846
html-safe-noncee59843a7fe47c8efd19cfeda628176af619d56294a526ccaa04f12adf823eb16
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI5NkNBOjM3QzU1RDo4MTkyQjcyOkE3NTQxNDA6Njk3RTc4NDYiLCJ2aXNpdG9yX2lkIjoiNjk5OTI4Nzk4MzkwNzk2MDkwMiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac1cf122227c7a81cf1d2e0dd9e7d0931c90ec7bd3e6659d47cd750ed7d4b97931
hovercard-subject-tagrepository:378671319
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/schittli/DeepSpeed
twitter:imagehttps://opengraph.githubassets.com/695a4a488926876f326e424aede49bfed8358b2d0737f4ef001edea87ca78937/schittli/DeepSpeed
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/695a4a488926876f326e424aede49bfed8358b2d0737f4ef001edea87ca78937/schittli/DeepSpeed
og:image:altDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective. - schittli/DeepSpeed
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None60279d4097367e16897439d16d6bbe4180663db828c666eeed2656988ffe59f6
turbo-cache-controlno-preview
go-importgithub.com/schittli/DeepSpeed git https://github.com/schittli/DeepSpeed.git
octolytics-dimension-user_id8282673
octolytics-dimension-user_loginschittli
octolytics-dimension-repository_id378671319
octolytics-dimension-repository_nwoschittli/DeepSpeed
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forktrue
octolytics-dimension-repository_parent_id235860204
octolytics-dimension-repository_parent_nwodeepspeedai/DeepSpeed
octolytics-dimension-repository_network_root_id235860204
octolytics-dimension-repository_network_root_nwodeepspeedai/DeepSpeed
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release7c85641c598ad130c74f7bcc27f58575cac69551
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fschittli%2FDeepSpeed
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fschittli%2FDeepSpeed
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=schittli%2FDeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Reloadhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
schittli https://patch-diff.githubusercontent.com/schittli
DeepSpeedhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
deepspeedai/DeepSpeedhttps://patch-diff.githubusercontent.com/deepspeedai/DeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fschittli%2FDeepSpeed
Fork 0 https://patch-diff.githubusercontent.com/login?return_to=%2Fschittli%2FDeepSpeed
Star 1 https://patch-diff.githubusercontent.com/login?return_to=%2Fschittli%2FDeepSpeed
www.deepspeed.ai/https://www.deepspeed.ai/
MIT license https://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/LICENSE
1 star https://patch-diff.githubusercontent.com/schittli/DeepSpeed/stargazers
4.7k forks https://patch-diff.githubusercontent.com/schittli/DeepSpeed/forks
Branches https://patch-diff.githubusercontent.com/schittli/DeepSpeed/branches
Tags https://patch-diff.githubusercontent.com/schittli/DeepSpeed/tags
Activity https://patch-diff.githubusercontent.com/schittli/DeepSpeed/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fschittli%2FDeepSpeed
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fschittli%2FDeepSpeed
Code https://patch-diff.githubusercontent.com/schittli/DeepSpeed
Pull requests 0 https://patch-diff.githubusercontent.com/schittli/DeepSpeed/pulls
Actions https://patch-diff.githubusercontent.com/schittli/DeepSpeed/actions
Projects 0 https://patch-diff.githubusercontent.com/schittli/DeepSpeed/projects
Security 0 https://patch-diff.githubusercontent.com/schittli/DeepSpeed/security
Insights https://patch-diff.githubusercontent.com/schittli/DeepSpeed/pulse
Code https://patch-diff.githubusercontent.com/schittli/DeepSpeed
Pull requests https://patch-diff.githubusercontent.com/schittli/DeepSpeed/pulls
Actions https://patch-diff.githubusercontent.com/schittli/DeepSpeed/actions
Projects https://patch-diff.githubusercontent.com/schittli/DeepSpeed/projects
Security https://patch-diff.githubusercontent.com/schittli/DeepSpeed/security
Insights https://patch-diff.githubusercontent.com/schittli/DeepSpeed/pulse
Brancheshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/branches
Tagshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tags
https://patch-diff.githubusercontent.com/schittli/DeepSpeed/branches
https://patch-diff.githubusercontent.com/schittli/DeepSpeed/tags
603 Commitshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/commits/master/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed/commits/master/
.github/workflowshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/.github/workflows
.github/workflowshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/.github/workflows
DeepSpeedExamples @ 25d73cfhttps://patch-diff.githubusercontent.com/microsoft/DeepSpeedExamples/tree/25d73cf73fb3dc66faefa141b7319526555be9fc
DeepSpeedExamples @ 25d73cfhttps://patch-diff.githubusercontent.com/microsoft/DeepSpeedExamples/tree/25d73cf73fb3dc66faefa141b7319526555be9fc
azurehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/azure
azurehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/azure
binhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/bin
binhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/bin
csrchttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/csrc
csrchttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/csrc
deepspeedhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/deepspeed
deepspeedhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/deepspeed
dockerhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/docker
dockerhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/docker
docshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/docs
docshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/docs
op_builderhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/op_builder
op_builderhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/op_builder
requirementshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/requirements
requirementshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/requirements
testshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/tests
testshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/tree/master/tests
.clang-formathttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.clang-format
.clang-formathttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.clang-format
.gitignorehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.gitignore
.gitmoduleshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.gitmodules
.gitmoduleshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.gitmodules
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.pre-commit-config.yaml
.pre-commit-config.yamlhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.pre-commit-config.yaml
.pylintrchttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.pylintrc
.pylintrchttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.pylintrc
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.readthedocs.yml
.readthedocs.ymlhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.readthedocs.yml
.style.yapfhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.style.yapf
.style.yapfhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/.style.yapf
CODEOWNERShttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CODEOWNERS
CODEOWNERShttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CODEOWNERS
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CODE_OF_CONDUCT.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CONTRIBUTING.md
CONTRIBUTING.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CONTRIBUTING.md
LICENSEhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/MANIFEST.in
README.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/README.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/SECURITY.md
SECURITY.mdhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/SECURITY.md
install.shhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/install.sh
install.shhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/install.sh
setup.cfghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/setup.cfg
setup.cfghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/setup.cfg
setup.pyhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/setup.py
version.txthttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/version.txt
version.txthttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/version.txt
READMEhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Code of conducthttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Contributinghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
MIT licensehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Securityhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
https://github.com/microsoft/DeepSpeed/actions
https://pypi.org/project/deepspeed/
https://deepspeed.readthedocs.io/en/latest/?badge=latest
https://github.com/Microsoft/DeepSpeed/blob/master/LICENSE
https://pepy.tech/project/deepspeed
SDE 2https://careers.microsoft.com/us/en/job/1013160/Software-Engineer-2
Sr. SDEhttps://careers.microsoft.com/us/en/job/1017151/Senior-Software-Engineer
Sr. Researcherhttps://careers.microsoft.com/us/en/job/1016440/Senior-Researcher
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#deepspeed-is-hiring-come-join-us-sde-2-sr-sde-sr-researcher
DeepSpeedhttps://www.deepspeed.ai/
Turing-NLGhttps://www.microsoft.com/en-us/research/blog/turing-nlg-a-17-billion-parameter-language-model-by-microsoft
AI at Scalehttps://www.microsoft.com/en-us/research/project/ai-at-scale/
herehttps://innovation.microsoft.com/en-us/exploring-ai-at-scale
deepspeed.aihttps://www.deepspeed.ai/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#news
DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compressionhttps://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/
1-bit LAMB: up to 4.6x less communication and 2.8x faster training, together with LAMB's convergence speed at large batch sizeshttps://www.deepspeed.ai/tutorials/onebit-lamb/
ZeRO-Infinity unlocks unprecedented model scale for deep learning traininghttps://www.microsoft.com/en-us/research/blog/zero-infinity-and-deepspeed-unlocking-unprecedented-model-scale-for-deep-learning-training/
Tutorial on how to use different stages of ZeROhttps://www.deepspeed.ai/tutorials/zero/
[DeepSpeed on AzureML] Transformers and CIFAR examples are now available on AzureML GitHubhttps://github.com/Azure/azureml-examples/tree/main/python-sdk/workflows/train/deepspeed
[PyTorch Lightning Blog] Accessible Multi-Billion Parameter Model Training with PyTorch Lightning + DeepSpeedhttps://medium.com/pytorch-lightning/accessible-multi-billion-parameter-model-training-with-pytorch-lightning-deepspeed-c9333ac3bb59
1-bit Adam v2: NCCL-based implementation and morehttps://www.deepspeed.ai/tutorials/onebit-adam/
ZeRO-3 Offload: Scale your models to trillion parameters without code changes while leveraging both CPUs & GPUshttps://www.deepspeed.ai/news/2021/03/07/zero3-offload.html
[🤗Hugging Face Blog] Fit More and Train Faster With ZeRO via DeepSpeed and FairScalehttps://huggingface.co/blog/zero-deepspeed-fairscale
Simplified install, JIT compiled ops, PyPI releases, and reduced dependencieshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#installation
Efficient and robust compressed training through progressive layer droppinghttps://www.deepspeed.ai/news/2020/10/28/progressive-layer-dropping-news.html
DeepSpeed v0.3: Extreme-scale model training for everyonehttps://www.microsoft.com/en-us/research/blog/deepspeed-extreme-scale-model-training-for-everyone/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#table-of-contents
Why DeepSpeed?https://patch-diff.githubusercontent.com/schittli/DeepSpeed#why-deepspeed
Installhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#installation
Featureshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#features
Further Readinghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#further-reading
Contributinghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#contributing
Publicationshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#publications
Videoshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed#videos
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#why-deepspeed
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#installation
torch's JIT C++ extension loader that relies on ninjahttps://pytorch.org/docs/stable/cpp_extension.html
PyTorchhttps://pytorch.org/
advanced installation instructionshttps://www.deepspeed.ai/tutorials/advanced-install/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#features
feature overviewhttps://www.deepspeed.ai/features/
Distributed Training with Mixed Precisionhttps://www.deepspeed.ai/features/#distributed-training-with-mixed-precision
Model Parallelismhttps://www.deepspeed.ai/features/#model-parallelism
Pipeline Parallelismhttps://www.deepspeed.ai/tutorials/pipeline/
The Zero Redundancy Optimizer (ZeRO)https://www.deepspeed.ai/tutorials/zero/
ZeRO-Offloadhttps://www.deepspeed.ai/tutorials/zero-offload/
Ultra-fast dense transformer kernelshttps://www.deepspeed.ai/news/2020/05/18/bert-record.html
Sparse attentionhttps://www.deepspeed.ai/news/2020/09/08/sparse-attention.html
1-bit Adamhttps://www.deepspeed.ai/news/2020/09/08/onebit-adam-blog-post.html
1-bit LAMBhttps://www.deepspeed.ai/tutorials/onebit-lamb/
Additional Memory and Bandwidth Optimizationshttps://www.deepspeed.ai/features/#additional-memory-and-bandwidth-optimizations
Training Featureshttps://www.deepspeed.ai/features/#training-features
Training Optimizershttps://www.deepspeed.ai/features/#training-optimizers
Training Agnostic Checkpointinghttps://www.deepspeed.ai/features/#training-agnostic-checkpointing
Advanced Parameter Searchhttps://www.deepspeed.ai/features/#advanced-parameter-search
Simplified Data Loaderhttps://www.deepspeed.ai/features/#simplified-data-loader
Performance Analysis and Debugginghttps://www.deepspeed.ai/features/#performance-analysis-and-debugging
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#further-reading
deepspeed.aihttps://www.deepspeed.ai/
DeepSpeed Featureshttps://www.deepspeed.ai/features/
Getting Startedhttps://www.deepspeed.ai/getting-started/
DeepSpeed JSON Configurationhttps://www.deepspeed.ai/docs/config-json/
API Documentationhttps://deepspeed.readthedocs.io/en/latest/
CIFAR-10 Tutorialhttps://www.deepspeed.ai/tutorials/cifar-10
Megatron-LM Tutorialhttps://www.deepspeed.ai/tutorials/megatron/
BERT Pre-training Tutorialhttps://www.deepspeed.ai/tutorials/bert-pretraining/
Learning Rate Range Test Tutorialhttps://www.deepspeed.ai/tutorials/lrrt/
1Cycle Tutorialhttps://www.deepspeed.ai/tutorials/one-cycle/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#contributing
contributinghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/blob/master/CONTRIBUTING.md
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#contributor-license-agreement
https://cla.opensource.microsoft.comhttps://cla.opensource.microsoft.com
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#code-of-conduct
Microsoft Open Source Code of Conducthttps://opensource.microsoft.com/codeofconduct/
Code of Conduct FAQhttps://opensource.microsoft.com/codeofconduct/faq/
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#publications
arXiv:1910.02054https://arxiv.org/abs/1910.02054
In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '20)https://dl.acm.org/doi/10.5555/3433701.3433727
In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20, Tutorial)https://dl.acm.org/doi/10.1145/3394486.3406703
arXiv:2010.13369https://arxiv.org/abs/2010.13369
NeurIPS 2020https://proceedings.neurips.cc/paper/2020/hash/a1140a3d0df1c81e24ae954d935e8926-Abstract.html
arXiv:2101.06840https://arxiv.org/abs/2101.06840
arXiv:2102.02888https://arxiv.org/abs/2102.02888
arXiv:2104.07857https://arxiv.org/abs/2104.07857
arXiv:2104.06069https://arxiv.org/abs/2104.06069
https://patch-diff.githubusercontent.com/schittli/DeepSpeed#videos
Overviewhttps://www.youtube.com/watch?v=CaseqC45DNc&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=29
ZeRO + large model traininghttps://www.youtube.com/watch?v=y4_bCiAsIAk&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=28
17B T-NLG demohttps://www.youtube.com/watch?v=9V-ZbP92drg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=27
Fastest BERT training + RScan tuninghttps://www.youtube.com/watch?v=o1K-ZG9F6u0&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=26
part 1https://www.youtube.com/watch?v=_NOk-mBwDYg&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=92
part 2https://www.youtube.com/watch?v=sG6_c4VXLww&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=94
part 3https://www.youtube.com/watch?v=k9yPkBTayos&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=93
FAQhttps://www.youtube.com/watch?v=nsHu6vEgPew&list=PLa85ZdUjfWS21mgibJ2vCvLziprjpKoW0&index=24
ZeRO & Fastest BERT: Increasing the scale and speed of deep learning training in DeepSpeedhttps://note.microsoft.com/MSR-Webinar-DeepSpeed-Registration-On-Demand.html
DeepSpeed on AzureMLhttps://youtu.be/yBVXR8G8Bg8
DeepSpeed: All the tricks to scale to gigantic modelshttps://www.youtube.com/watch?v=pDGI668pNg0
Turing-NLG, DeepSpeed and the ZeRO optimizerhttps://www.youtube.com/watch?v=tC01FRB0M7w
www.deepspeed.ai/https://www.deepspeed.ai/
ai https://patch-diff.githubusercontent.com/topics/ai
ki https://patch-diff.githubusercontent.com/topics/ki
Readme https://patch-diff.githubusercontent.com/schittli/DeepSpeed#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/schittli/DeepSpeed#MIT-1-ov-file
Code of conduct https://patch-diff.githubusercontent.com/schittli/DeepSpeed#coc-ov-file
Contributing https://patch-diff.githubusercontent.com/schittli/DeepSpeed#contributing-ov-file
Security policy https://patch-diff.githubusercontent.com/schittli/DeepSpeed#security-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/schittli/DeepSpeed
Activityhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/activity
1 starhttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/stargazers
0 watchinghttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/watchers
0 forkshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fschittli%2FDeepSpeed&report=schittli+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/schittli/DeepSpeed/releases
21 tags https://patch-diff.githubusercontent.com/schittli/DeepSpeed/tags
Packages 0https://patch-diff.githubusercontent.com/users/schittli/packages?repo_name=DeepSpeed
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.