René's URL Explorer Experiment

Title: GitHub - echallenge/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Open Graph Title: GitHub - echallenge/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

X Title: GitHub - echallenge/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Description: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - echallenge/transformers

Open Graph Description: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - echallenge/transformers

X Description: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - echallenge/transformers

Opengraph URL: https://github.com/echallenge/transformers

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern	/:user_id/:repository
route-controller	files
route-action	disambiguate
fetch-nonce	v2:3a7cd6c1-2ab9-01c2-8929-4367ae73640b
current-catalog-service-hash	f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-id	C9BC:55A65:3F72E:515B7:697F5B96
html-safe-nonce	13e96f1bf0a48fa82041eb3ddc01e1a407bd4c86139add02431cc388706d65b6
visitor-payload	eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDOUJDOjU1QTY1OjNGNzJFOjUxNUI3OjY5N0Y1Qjk2IiwidmlzaXRvcl9pZCI6IjIzNzk4MzYxMjc3Mzg1NTExOTAiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmac	7a8b588dd61cc400537e9e09a3999129f5fd2ebd8ea9a3b0e6df66555e292469
hovercard-subject-tag	repository:566458262
github-keyboard-shortcuts	repository,copilot
google-site-verification	Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-url	https://collector.github.com/github/collect
analytics-location	//
fb:app_id	1401488693436528
apple-itunes-app	app-id=1477376905, app-argument=https://github.com/echallenge/transformers
twitter:image	https://opengraph.githubassets.com/59535837f78800ab46089086bdbfd3190eb2c3f533b0684942b9dba468a3ea65/echallenge/transformers
twitter:card	summary_large_image
og:image	https://opengraph.githubassets.com/59535837f78800ab46089086bdbfd3190eb2c3f533b0684942b9dba468a3ea65/echallenge/transformers
og:image:alt	🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - echallenge/transformers
og:image:width	1200
og:image:height	600
og:site_name	GitHub
og:type	object
hostname	github.com
expected-hostname	github.com
None	60279d4097367e16897439d16d6bbe4180663db828c666eeed2656988ffe59f6
turbo-cache-control	no-preview
go-import	github.com/echallenge/transformers git https://github.com/echallenge/transformers.git
octolytics-dimension-user_id	46498010
octolytics-dimension-user_login	echallenge
octolytics-dimension-repository_id	566458262
octolytics-dimension-repository_nwo	echallenge/transformers
octolytics-dimension-repository_public	true
octolytics-dimension-repository_is_fork	true
octolytics-dimension-repository_parent_id	155220641
octolytics-dimension-repository_parent_nwo	huggingface/transformers
octolytics-dimension-repository_network_root_id	155220641
octolytics-dimension-repository_network_root_nwo	huggingface/transformers
turbo-body-classes	logged-out env-production page-responsive
disable-turbo	false
browser-stats-url	https://api.github.com/_private/browser/stats
browser-errors-url	https://api.github.com/_private/browser/errors
release	7c85641c598ad130c74f7bcc27f58575cac69551
ui-target	full
theme-color	#1e2327
color-scheme	light dark

Links:

Skip to content	https://patch-diff.githubusercontent.com/echallenge/transformers#start-of-content
	https://patch-diff.githubusercontent.com/
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fechallenge%2Ftransformers
GitHub CopilotWrite better code with AI	https://github.com/features/copilot
GitHub SparkBuild and deploy intelligent apps	https://github.com/features/spark
GitHub ModelsManage and compare prompts	https://github.com/features/models
MCP RegistryNewIntegrate external tools	https://github.com/mcp
ActionsAutomate any workflow	https://github.com/features/actions
CodespacesInstant dev environments	https://github.com/features/codespaces
IssuesPlan and track work	https://github.com/features/issues
Code ReviewManage code changes	https://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilities	https://github.com/security/advanced-security
Code securitySecure your code as you build	https://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they start	https://github.com/security/advanced-security/secret-protection
Why GitHub	https://github.com/why-github
Documentation	https://docs.github.com
Blog	https://github.blog
Changelog	https://github.blog/changelog
Marketplace	https://github.com/marketplace
View all features	https://github.com/features
Enterprises	https://github.com/enterprise
Small and medium teams	https://github.com/team
Startups	https://github.com/enterprise/startups
Nonprofits	https://github.com/solutions/industry/nonprofits
App Modernization	https://github.com/solutions/use-case/app-modernization
DevSecOps	https://github.com/solutions/use-case/devsecops
DevOps	https://github.com/solutions/use-case/devops
CI/CD	https://github.com/solutions/use-case/ci-cd
View all use cases	https://github.com/solutions/use-case
Healthcare	https://github.com/solutions/industry/healthcare
Financial services	https://github.com/solutions/industry/financial-services
Manufacturing	https://github.com/solutions/industry/manufacturing
Government	https://github.com/solutions/industry/government
View all industries	https://github.com/solutions/industry
View all solutions	https://github.com/solutions
AI	https://github.com/resources/articles?topic=ai
Software Development	https://github.com/resources/articles?topic=software-development
DevOps	https://github.com/resources/articles?topic=devops
Security	https://github.com/resources/articles?topic=security
View all topics	https://github.com/resources/articles
Customer stories	https://github.com/customer-stories
Events & webinars	https://github.com/resources/events
Ebooks & reports	https://github.com/resources/whitepapers
Business insights	https://github.com/solutions/executive-insights
GitHub Skills	https://skills.github.com
Documentation	https://docs.github.com
Customer support	https://support.github.com
Community forum	https://github.com/orgs/community/discussions
Trust center	https://github.com/trust-center
Partners	https://github.com/partners
GitHub SponsorsFund open source developers	https://github.com/sponsors
Security Lab	https://securitylab.github.com
Maintainer Community	https://maintainers.github.com
Accelerator	https://github.com/accelerator
Archive Program	https://archiveprogram.github.com
Topics	https://github.com/topics
Trending	https://github.com/trending
Collections	https://github.com/collections
Enterprise platformAI-powered developer platform	https://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security features	https://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI features	https://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 support	https://github.com/premium-support
Pricing	https://github.com/pricing
Search syntax tips	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentation	https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in	https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fechallenge%2Ftransformers
Sign up	https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=echallenge%2Ftransformers
Reload	https://patch-diff.githubusercontent.com/echallenge/transformers
Reload	https://patch-diff.githubusercontent.com/echallenge/transformers
Reload	https://patch-diff.githubusercontent.com/echallenge/transformers
echallenge	https://patch-diff.githubusercontent.com/echallenge
transformers	https://patch-diff.githubusercontent.com/echallenge/transformers
huggingface/transformers	https://patch-diff.githubusercontent.com/huggingface/transformers
Notifications	https://patch-diff.githubusercontent.com/login?return_to=%2Fechallenge%2Ftransformers
Fork 0	https://patch-diff.githubusercontent.com/login?return_to=%2Fechallenge%2Ftransformers
Star 1	https://patch-diff.githubusercontent.com/login?return_to=%2Fechallenge%2Ftransformers
huggingface.co/transformers	https://huggingface.co/transformers
Apache-2.0 license	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/LICENSE
1 star	https://patch-diff.githubusercontent.com/echallenge/transformers/stargazers
31.9k forks	https://patch-diff.githubusercontent.com/echallenge/transformers/forks
Branches	https://patch-diff.githubusercontent.com/echallenge/transformers/branches
Tags	https://patch-diff.githubusercontent.com/echallenge/transformers/tags
Activity	https://patch-diff.githubusercontent.com/echallenge/transformers/activity
Star	https://patch-diff.githubusercontent.com/login?return_to=%2Fechallenge%2Ftransformers
Notifications	https://patch-diff.githubusercontent.com/login?return_to=%2Fechallenge%2Ftransformers
Code	https://patch-diff.githubusercontent.com/echallenge/transformers
Pull requests 0	https://patch-diff.githubusercontent.com/echallenge/transformers/pulls
Actions	https://patch-diff.githubusercontent.com/echallenge/transformers/actions
Projects 0	https://patch-diff.githubusercontent.com/echallenge/transformers/projects
Security 0	https://patch-diff.githubusercontent.com/echallenge/transformers/security
Insights	https://patch-diff.githubusercontent.com/echallenge/transformers/pulse
Code	https://patch-diff.githubusercontent.com/echallenge/transformers
Pull requests	https://patch-diff.githubusercontent.com/echallenge/transformers/pulls
Actions	https://patch-diff.githubusercontent.com/echallenge/transformers/actions
Projects	https://patch-diff.githubusercontent.com/echallenge/transformers/projects
Security	https://patch-diff.githubusercontent.com/echallenge/transformers/security
Insights	https://patch-diff.githubusercontent.com/echallenge/transformers/pulse
Branches	https://patch-diff.githubusercontent.com/echallenge/transformers/branches
Tags	https://patch-diff.githubusercontent.com/echallenge/transformers/tags
	https://patch-diff.githubusercontent.com/echallenge/transformers/branches
	https://patch-diff.githubusercontent.com/echallenge/transformers/tags
11,313 Commits	https://patch-diff.githubusercontent.com/echallenge/transformers/commits/main/
	https://patch-diff.githubusercontent.com/echallenge/transformers/commits/main/
.circleci	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/.circleci
.circleci	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/.circleci
.github	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/.github
.github	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/.github
docker	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/docker
docker	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/docker
docs	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/docs
docs	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/docs
examples	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/examples
examples	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/examples
model_cards	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/model_cards
model_cards	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/model_cards
notebooks	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/notebooks
notebooks	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/notebooks
scripts	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/scripts
scripts	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/scripts
src/transformers	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/src/transformers
src/transformers	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/src/transformers
templates	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/templates
templates	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/templates
tests	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/tests
tests	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/tests
utils	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/utils
utils	https://patch-diff.githubusercontent.com/echallenge/transformers/tree/main/utils
.coveragerc	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.coveragerc
.coveragerc	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.coveragerc
.gitattributes	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.gitattributes
.gitattributes	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.gitattributes
.gitignore	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.gitignore
.gitignore	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/.gitignore
CITATION.cff	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CITATION.cff
CITATION.cff	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CITATION.cff
CODE_OF_CONDUCT.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CODE_OF_CONDUCT.md
CODE_OF_CONDUCT.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CODE_OF_CONDUCT.md
CONTRIBUTING.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CONTRIBUTING.md
CONTRIBUTING.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CONTRIBUTING.md
ISSUES.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/ISSUES.md
ISSUES.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/ISSUES.md
LICENSE	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/LICENSE
LICENSE	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/LICENSE
MANIFEST.in	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/MANIFEST.in
MANIFEST.in	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/MANIFEST.in
Makefile	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/Makefile
Makefile	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/Makefile
README.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README.md
README.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README.md
README_es.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_es.md
README_es.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_es.md
README_ja.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_ja.md
README_ja.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_ja.md
README_ko.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_ko.md
README_ko.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_ko.md
README_zh-hans.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_zh-hans.md
README_zh-hans.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_zh-hans.md
README_zh-hant.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_zh-hant.md
README_zh-hant.md	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/README_zh-hant.md
conftest.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/conftest.py
conftest.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/conftest.py
hubconf.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/hubconf.py
hubconf.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/hubconf.py
pyproject.toml	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/pyproject.toml
pyproject.toml	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/pyproject.toml
setup.cfg	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/setup.cfg
setup.cfg	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/setup.cfg
setup.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/setup.py
setup.py	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/setup.py
README	https://patch-diff.githubusercontent.com/echallenge/transformers
Code of conduct	https://patch-diff.githubusercontent.com/echallenge/transformers
Contributing	https://patch-diff.githubusercontent.com/echallenge/transformers
License	https://patch-diff.githubusercontent.com/echallenge/transformers
	https://camo.githubusercontent.com/76f57ec96af2ae06fb867b2b30de437e3f6a80cd3537015fc851e1c384a8422e/68747470733a2f2f68756767696e67666163652e636f2f64617461736574732f68756767696e67666163652f646f63756d656e746174696f6e2d696d616765732f7265736f6c76652f6d61696e2f7472616e73666f726d6572735f6c6f676f5f6e616d652e706e67
	https://circleci.com/gh/huggingface/transformers
	https://github.com/huggingface/transformers/blob/main/LICENSE
	https://huggingface.co/docs/transformers/index
	https://github.com/huggingface/transformers/releases
	https://github.com/huggingface/transformers/blob/main/CODE_OF_CONDUCT.md
	https://zenodo.org/badge/latestdoi/155220641
简体中文	https://github.com/huggingface/transformers/blob/main/README_zh-hans.md
繁體中文	https://github.com/huggingface/transformers/blob/main/README_zh-hant.md
한국어	https://github.com/huggingface/transformers/blob/main/README_ko.md
Español	https://github.com/huggingface/transformers/blob/main/README_es.md
日本語	https://github.com/huggingface/transformers/blob/main/README_ja.md
	https://patch-diff.githubusercontent.com/echallenge/transformers#------------english---------简体中文---------繁體中文---------한국어---------español---------日本語----
	https://patch-diff.githubusercontent.com/echallenge/transformers#----state-of-the-art-machine-learning-for-jax-pytorch-and-tensorflow
	https://hf.co/course
	https://patch-diff.githubusercontent.com/echallenge/transformers#----
model hub	https://huggingface.co/models
Jax	https://jax.readthedocs.io/en/latest/
PyTorch	https://pytorch.org/
TensorFlow	https://www.tensorflow.org/
	https://patch-diff.githubusercontent.com/echallenge/transformers#online-demos
model hub	https://huggingface.co/models
private model hosting, versioning, & an inference API	https://huggingface.co/pricing
Masked word completion with BERT	https://huggingface.co/bert-base-uncased?text=Paris+is+the+%5BMASK%5D+of+France
Name Entity Recognition with Electra	https://huggingface.co/dbmdz/electra-large-discriminator-finetuned-conll03-english?text=My+name+is+Sarah+and+I+live+in+London+city
Text generation with GPT-2	https://huggingface.co/gpt2?text=A+long+time+ago%2C+
Natural Language Inference with RoBERTa	https://huggingface.co/roberta-large-mnli?text=The+dog+was+lost.+Nobody+lost+any+animal
Summarization with BART	https://huggingface.co/facebook/bart-large-cnn?text=The+tower+is+324+metres+%281%2C063+ft%29+tall%2C+about+the+same+height+as+an+81-storey+building%2C+and+the+tallest+structure+in+Paris.+Its+base+is+square%2C+measuring+125+metres+%28410+ft%29+on+each+side.+During+its+construction%2C+the+Eiffel+Tower+surpassed+the+Washington+Monument+to+become+the+tallest+man-made+structure+in+the+world%2C+a+title+it+held+for+41+years+until+the+Chrysler+Building+in+New+York+City+was+finished+in+1930.+It+was+the+first+structure+to+reach+a+height+of+300+metres.+Due+to+the+addition+of+a+broadcasting+aerial+at+the+top+of+the+tower+in+1957%2C+it+is+now+taller+than+the+Chrysler+Building+by+5.2+metres+%2817+ft%29.+Excluding+transmitters%2C+the+Eiffel+Tower+is+the+second+tallest+free-standing+structure+in+France+after+the+Millau+Viaduct
Question answering with DistilBERT	https://huggingface.co/distilbert-base-uncased-distilled-squad?text=Which+name+is+also+used+to+describe+the+Amazon+rainforest+in+English%3F&context=The+Amazon+rainforest+%28Portuguese%3A+Floresta+Amaz%C3%B4nica+or+Amaz%C3%B4nia%3B+Spanish%3A+Selva+Amaz%C3%B3nica%2C+Amazon%C3%ADa+or+usually+Amazonia%3B+French%3A+For%C3%AAt+amazonienne%3B+Dutch%3A+Amazoneregenwoud%29%2C+also+known+in+English+as+Amazonia+or+the+Amazon+Jungle%2C+is+a+moist+broadleaf+forest+that+covers+most+of+the+Amazon+basin+of+South+America.+This+basin+encompasses+7%2C000%2C000+square+kilometres+%282%2C700%2C000+sq+mi%29%2C+of+which+5%2C500%2C000+square+kilometres+%282%2C100%2C000+sq+mi%29+are+covered+by+the+rainforest.+This+region+includes+territory+belonging+to+nine+nations.+The+majority+of+the+forest+is+contained+within+Brazil%2C+with+60%25+of+the+rainforest%2C+followed+by+Peru+with+13%25%2C+Colombia+with+10%25%2C+and+with+minor+amounts+in+Venezuela%2C+Ecuador%2C+Bolivia%2C+Guyana%2C+Suriname+and+French+Guiana.+States+or+departments+in+four+nations+contain+%22Amazonas%22+in+their+names.+The+Amazon+represents+over+half+of+the+planet%27s+remaining+rainforests%2C+and+comprises+the+largest+and+most+biodiverse+tract+of+tropical+rainforest+in+the+world%2C+with+an+estimated+390+billion+individual+trees+divided+into+16%2C000+species
Translation with T5	https://huggingface.co/t5-base?text=My+name+is+Wolfgang+and+I+live+in+Berlin
Image classification with ViT	https://huggingface.co/google/vit-base-patch16-224
Object Detection with DETR	https://huggingface.co/facebook/detr-resnet-50
Semantic Segmentation with SegFormer	https://huggingface.co/nvidia/segformer-b0-finetuned-ade-512-512
Panoptic Segmentation with DETR	https://huggingface.co/facebook/detr-resnet-50-panoptic
Automatic Speech Recognition with Wav2Vec2	https://huggingface.co/facebook/wav2vec2-base-960h
Keyword Spotting with Wav2Vec2	https://huggingface.co/superb/wav2vec2-base-superb-ks
Visual Question Answering with ViLT	https://huggingface.co/dandelin/vilt-b32-finetuned-vqa
Write With Transformer	https://transformer.huggingface.co
	https://patch-diff.githubusercontent.com/echallenge/transformers#if-you-are-looking-for-custom-support-from-the-hugging-face-team
	https://huggingface.co/support
	https://patch-diff.githubusercontent.com/echallenge/transformers#quick-tour
	https://camo.githubusercontent.com/4153c3f6ae91d9b2d21065c7ff7596b0e24b0c5c23febf3e66eb503324eed1d3/68747470733a2f2f68756767696e67666163652e636f2f64617461736574732f68756767696e67666163652f646f63756d656e746174696f6e2d696d616765732f7265736f6c76652f6d61696e2f636f636f5f73616d706c652e706e67
	https://camo.githubusercontent.com/c8821fb97a1b525d5ea9b5f67057b37392c430ee7b5915b4d6ad481202f410a8/68747470733a2f2f68756767696e67666163652e636f2f64617461736574732f68756767696e67666163652f646f63756d656e746174696f6e2d696d616765732f7265736f6c76652f6d61696e2f636f636f5f73616d706c655f706f73745f70726f6365737365642e706e67
	https://patch-diff.githubusercontent.com/echallenge/transformers#--------
this tutorial	https://huggingface.co/docs/transformers/task_summary
Pytorch nn.Module	https://pytorch.org/docs/stable/nn.html#torch.nn.Module
TensorFlow tf.keras.Model	https://www.tensorflow.org/api_docs/python/tf/keras/Model
This tutorial	https://huggingface.co/docs/transformers/training
	https://patch-diff.githubusercontent.com/echallenge/transformers#why-should-i-use-transformers
	https://patch-diff.githubusercontent.com/echallenge/transformers#why-shouldnt-i-use-transformers
Accelerate	https://huggingface.co/docs/accelerate
examples folder	https://github.com/huggingface/transformers/tree/main/examples
	https://patch-diff.githubusercontent.com/echallenge/transformers#installation
	https://patch-diff.githubusercontent.com/echallenge/transformers#with-pip
virtual environment	https://docs.python.org/3/library/venv.html
user guide	https://packaging.python.org/guides/installing-using-pip-and-virtual-environments/
TensorFlow installation page	https://www.tensorflow.org/install/
PyTorch installation page	https://pytorch.org/get-started/locally/#start-locally
Flax	https://github.com/google/flax#quick-install
Jax	https://github.com/google/jax#installation
install the library from source	https://huggingface.co/docs/transformers/installation#installing-from-source
	https://patch-diff.githubusercontent.com/echallenge/transformers#with-conda
this issue	https://github.com/huggingface/huggingface_hub/issues/1062
	https://patch-diff.githubusercontent.com/echallenge/transformers#model-architectures
All the model checkpoints	https://huggingface.co/models
model hub	https://huggingface.co
users	https://huggingface.co/users
organizations	https://huggingface.co/organizations
	https://camo.githubusercontent.com/f36a36c84f2ff8605938db0f71595cdfebb5ebc941833aeb2591205f220bc9d2/68747470733a2f2f696d672e736869656c64732e696f2f656e64706f696e743f75726c3d68747470733a2f2f68756767696e67666163652e636f2f6170692f736869656c64732f6d6f64656c7326636f6c6f723d627269676874677265656e
here	https://huggingface.co/docs/transformers/model_summary
ALBERT	https://huggingface.co/docs/transformers/model_doc/albert
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations	https://arxiv.org/abs/1909.11942
BART	https://huggingface.co/docs/transformers/model_doc/bart
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension	https://arxiv.org/abs/1910.13461
BARThez	https://huggingface.co/docs/transformers/model_doc/barthez
BARThez: a Skilled Pretrained French Sequence-to-Sequence Model	https://arxiv.org/abs/2010.12321
BARTpho	https://huggingface.co/docs/transformers/model_doc/bartpho
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese	https://arxiv.org/abs/2109.09701
BEiT	https://huggingface.co/docs/transformers/model_doc/beit
BEiT: BERT Pre-Training of Image Transformers	https://arxiv.org/abs/2106.08254
BERT	https://huggingface.co/docs/transformers/model_doc/bert
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	https://arxiv.org/abs/1810.04805
BERT For Sequence Generation	https://huggingface.co/docs/transformers/model_doc/bert-generation
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks	https://arxiv.org/abs/1907.12461
BERTweet	https://huggingface.co/docs/transformers/model_doc/bertweet
BERTweet: A pre-trained language model for English Tweets	https://aclanthology.org/2020.emnlp-demos.2/
BigBird-Pegasus	https://huggingface.co/docs/transformers/model_doc/bigbird_pegasus
Big Bird: Transformers for Longer Sequences	https://arxiv.org/abs/2007.14062
BigBird-RoBERTa	https://huggingface.co/docs/transformers/model_doc/big_bird
Big Bird: Transformers for Longer Sequences	https://arxiv.org/abs/2007.14062
Blenderbot	https://huggingface.co/docs/transformers/model_doc/blenderbot
Recipes for building an open-domain chatbot	https://arxiv.org/abs/2004.13637
BlenderbotSmall	https://huggingface.co/docs/transformers/model_doc/blenderbot-small
Recipes for building an open-domain chatbot	https://arxiv.org/abs/2004.13637
BLOOM	https://huggingface.co/docs/transformers/model_doc/bloom
BigSicence Workshop	https://bigscience.huggingface.co/
BORT	https://huggingface.co/docs/transformers/model_doc/bort
Optimal Subarchitecture Extraction For BERT	https://arxiv.org/abs/2010.10499
ByT5	https://huggingface.co/docs/transformers/model_doc/byt5
ByT5: Towards a token-free future with pre-trained byte-to-byte models	https://arxiv.org/abs/2105.13626
CamemBERT	https://huggingface.co/docs/transformers/model_doc/camembert
CamemBERT: a Tasty French Language Model	https://arxiv.org/abs/1911.03894
CANINE	https://huggingface.co/docs/transformers/model_doc/canine
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation	https://arxiv.org/abs/2103.06874
CLIP	https://huggingface.co/docs/transformers/model_doc/clip
Learning Transferable Visual Models From Natural Language Supervision	https://arxiv.org/abs/2103.00020
CLIPSeg	https://huggingface.co/docs/transformers/main/model_doc/clipseg
Image Segmentation Using Text and Image Prompts	https://arxiv.org/abs/2112.10003
CodeGen	https://huggingface.co/docs/transformers/model_doc/codegen
A Conversational Paradigm for Program Synthesis	https://arxiv.org/abs/2203.13474
Conditional DETR	https://huggingface.co/docs/transformers/model_doc/conditional_detr
Conditional DETR for Fast Training Convergence	https://arxiv.org/abs/2108.06152
ConvBERT	https://huggingface.co/docs/transformers/model_doc/convbert
ConvBERT: Improving BERT with Span-based Dynamic Convolution	https://arxiv.org/abs/2008.02496
ConvNeXT	https://huggingface.co/docs/transformers/model_doc/convnext
A ConvNet for the 2020s	https://arxiv.org/abs/2201.03545
CPM	https://huggingface.co/docs/transformers/model_doc/cpm
CPM: A Large-scale Generative Chinese Pre-trained Language Model	https://arxiv.org/abs/2012.00413
CTRL	https://huggingface.co/docs/transformers/model_doc/ctrl
CTRL: A Conditional Transformer Language Model for Controllable Generation	https://arxiv.org/abs/1909.05858
CvT	https://huggingface.co/docs/transformers/model_doc/cvt
CvT: Introducing Convolutions to Vision Transformers	https://arxiv.org/abs/2103.15808
Data2Vec	https://huggingface.co/docs/transformers/model_doc/data2vec
Data2Vec: A General Framework for Self-supervised Learning in Speech, Vision and Language	https://arxiv.org/abs/2202.03555
DeBERTa	https://huggingface.co/docs/transformers/model_doc/deberta
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	https://arxiv.org/abs/2006.03654
DeBERTa-v2	https://huggingface.co/docs/transformers/model_doc/deberta-v2
DeBERTa: Decoding-enhanced BERT with Disentangled Attention	https://arxiv.org/abs/2006.03654
Decision Transformer	https://huggingface.co/docs/transformers/model_doc/decision_transformer
Decision Transformer: Reinforcement Learning via Sequence Modeling	https://arxiv.org/abs/2106.01345
Deformable DETR	https://huggingface.co/docs/transformers/model_doc/deformable_detr
Deformable DETR: Deformable Transformers for End-to-End Object Detection	https://arxiv.org/abs/2010.04159
DeiT	https://huggingface.co/docs/transformers/model_doc/deit
Training data-efficient image transformers & distillation through attention	https://arxiv.org/abs/2012.12877
DETR	https://huggingface.co/docs/transformers/model_doc/detr
End-to-End Object Detection with Transformers	https://arxiv.org/abs/2005.12872
DialoGPT	https://huggingface.co/docs/transformers/model_doc/dialogpt
DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation	https://arxiv.org/abs/1911.00536
DistilBERT	https://huggingface.co/docs/transformers/model_doc/distilbert
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter	https://arxiv.org/abs/1910.01108
DistilGPT2	https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation
DistilRoBERTa	https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation
DistilmBERT	https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation
DiT	https://huggingface.co/docs/transformers/model_doc/dit
DiT: Self-supervised Pre-training for Document Image Transformer	https://arxiv.org/abs/2203.02378
Donut	https://huggingface.co/docs/transformers/model_doc/donut
OCR-free Document Understanding Transformer	https://arxiv.org/abs/2111.15664
DPR	https://huggingface.co/docs/transformers/model_doc/dpr
Dense Passage Retrieval for Open-Domain Question Answering	https://arxiv.org/abs/2004.04906
DPT	https://huggingface.co/docs/transformers/master/model_doc/dpt
Vision Transformers for Dense Prediction	https://arxiv.org/abs/2103.13413
ELECTRA	https://huggingface.co/docs/transformers/model_doc/electra
ELECTRA: Pre-training text encoders as discriminators rather than generators	https://arxiv.org/abs/2003.10555
EncoderDecoder	https://huggingface.co/docs/transformers/model_doc/encoder-decoder
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks	https://arxiv.org/abs/1907.12461
ERNIE	https://huggingface.co/docs/transformers/model_doc/ernie
ERNIE: Enhanced Representation through Knowledge Integration	https://arxiv.org/abs/1904.09223
ESM	https://huggingface.co/docs/transformers/model_doc/esm
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences	https://www.pnas.org/content/118/15/e2016239118
Language models enable zero-shot prediction of the effects of mutations on protein function	https://doi.org/10.1101/2021.07.09.450648
Language models of protein sequences at the scale of evolution enable accurate structure prediction	https://doi.org/10.1101/2022.07.20.500902
FLAN-T5	https://huggingface.co/docs/transformers/model_doc/flan-t5
google-research/t5x	https://github.com/google-research/t5x/blob/main/docs/models.md#flan-t5-checkpoints
FlauBERT	https://huggingface.co/docs/transformers/model_doc/flaubert
FlauBERT: Unsupervised Language Model Pre-training for French	https://arxiv.org/abs/1912.05372
FLAVA	https://huggingface.co/docs/transformers/model_doc/flava
FLAVA: A Foundational Language And Vision Alignment Model	https://arxiv.org/abs/2112.04482
FNet	https://huggingface.co/docs/transformers/model_doc/fnet
FNet: Mixing Tokens with Fourier Transforms	https://arxiv.org/abs/2105.03824
Funnel Transformer	https://huggingface.co/docs/transformers/model_doc/funnel
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing	https://arxiv.org/abs/2006.03236
GLPN	https://huggingface.co/docs/transformers/model_doc/glpn
Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth	https://arxiv.org/abs/2201.07436
GPT	https://huggingface.co/docs/transformers/model_doc/openai-gpt
Improving Language Understanding by Generative Pre-Training	https://blog.openai.com/language-unsupervised/
GPT Neo	https://huggingface.co/docs/transformers/model_doc/gpt_neo
EleutherAI/gpt-neo	https://github.com/EleutherAI/gpt-neo
GPT NeoX	https://huggingface.co/docs/transformers/model_doc/gpt_neox
GPT-NeoX-20B: An Open-Source Autoregressive Language Model	https://arxiv.org/abs/2204.06745
GPT NeoX Japanese	https://huggingface.co/docs/transformers/model_doc/gpt_neox_japanese
GPT-2	https://huggingface.co/docs/transformers/model_doc/gpt2
Language Models are Unsupervised Multitask Learners	https://blog.openai.com/better-language-models/
GPT-J	https://huggingface.co/docs/transformers/model_doc/gptj
kingoflolz/mesh-transformer-jax	https://github.com/kingoflolz/mesh-transformer-jax/
GroupViT	https://huggingface.co/docs/transformers/model_doc/groupvit
GroupViT: Semantic Segmentation Emerges from Text Supervision	https://arxiv.org/abs/2202.11094
Hubert	https://huggingface.co/docs/transformers/model_doc/hubert
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units	https://arxiv.org/abs/2106.07447
I-BERT	https://huggingface.co/docs/transformers/model_doc/ibert
I-BERT: Integer-only BERT Quantization	https://arxiv.org/abs/2101.01321
ImageGPT	https://huggingface.co/docs/transformers/model_doc/imagegpt
Generative Pretraining from Pixels	https://openai.com/blog/image-gpt/
Jukebox	https://huggingface.co/docs/transformers/main/model_doc/jukebox
Jukebox: A Generative Model for Music	https://arxiv.org/pdf/2005.00341.pdf
LayoutLM	https://huggingface.co/docs/transformers/model_doc/layoutlm
LayoutLM: Pre-training of Text and Layout for Document Image Understanding	https://arxiv.org/abs/1912.13318
LayoutLMv2	https://huggingface.co/docs/transformers/model_doc/layoutlmv2
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding	https://arxiv.org/abs/2012.14740
LayoutLMv3	https://huggingface.co/docs/transformers/model_doc/layoutlmv3
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking	https://arxiv.org/abs/2204.08387
LayoutXLM	https://huggingface.co/docs/transformers/model_doc/layoutxlm
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding	https://arxiv.org/abs/2104.08836
LED	https://huggingface.co/docs/transformers/model_doc/led
Longformer: The Long-Document Transformer	https://arxiv.org/abs/2004.05150
LeViT	https://huggingface.co/docs/transformers/model_doc/levit
LeViT: A Vision Transformer in ConvNet's Clothing for Faster Inference	https://arxiv.org/abs/2104.01136
LiLT	https://huggingface.co/docs/transformers/model_doc/lilt
LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding	https://arxiv.org/abs/2202.13669
Longformer	https://huggingface.co/docs/transformers/model_doc/longformer
Longformer: The Long-Document Transformer	https://arxiv.org/abs/2004.05150
LongT5	https://huggingface.co/docs/transformers/model_doc/longt5
LongT5: Efficient Text-To-Text Transformer for Long Sequences	https://arxiv.org/abs/2112.07916
LUKE	https://huggingface.co/docs/transformers/model_doc/luke
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention	https://arxiv.org/abs/2010.01057
LXMERT	https://huggingface.co/docs/transformers/model_doc/lxmert
LXMERT: Learning Cross-Modality Encoder Representations from Transformers for Open-Domain Question Answering	https://arxiv.org/abs/1908.07490
M-CTC-T	https://huggingface.co/docs/transformers/model_doc/mctct
Pseudo-Labeling For Massively Multilingual Speech Recognition	https://arxiv.org/abs/2111.00161
M2M100	https://huggingface.co/docs/transformers/model_doc/m2m_100
Beyond English-Centric Multilingual Machine Translation	https://arxiv.org/abs/2010.11125
MarianMT	https://huggingface.co/docs/transformers/model_doc/marian
OPUS	http://opus.nlpl.eu/
Marian Framework	https://marian-nmt.github.io/
MarkupLM	https://huggingface.co/docs/transformers/model_doc/markuplm
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding	https://arxiv.org/abs/2110.08518
MaskFormer	https://huggingface.co/docs/transformers/model_doc/maskformer
Per-Pixel Classification is Not All You Need for Semantic Segmentation	https://arxiv.org/abs/2107.06278
mBART	https://huggingface.co/docs/transformers/model_doc/mbart
Multilingual Denoising Pre-training for Neural Machine Translation	https://arxiv.org/abs/2001.08210
mBART-50	https://huggingface.co/docs/transformers/model_doc/mbart
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning	https://arxiv.org/abs/2008.00401
Megatron-BERT	https://huggingface.co/docs/transformers/model_doc/megatron-bert
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism	https://arxiv.org/abs/1909.08053
Megatron-GPT2	https://huggingface.co/docs/transformers/model_doc/megatron_gpt2
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism	https://arxiv.org/abs/1909.08053
mLUKE	https://huggingface.co/docs/transformers/model_doc/mluke
mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models	https://arxiv.org/abs/2110.08151
MobileBERT	https://huggingface.co/docs/transformers/model_doc/mobilebert
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices	https://arxiv.org/abs/2004.02984
MobileNetV2	https://huggingface.co/docs/transformers/model_doc/mobilenet_v2
MobileNetV2: Inverted Residuals and Linear Bottlenecks	https://arxiv.org/abs/1801.04381
MobileViT	https://huggingface.co/docs/transformers/model_doc/mobilevit
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer	https://arxiv.org/abs/2110.02178
MPNet	https://huggingface.co/docs/transformers/model_doc/mpnet
MPNet: Masked and Permuted Pre-training for Language Understanding	https://arxiv.org/abs/2004.09297
MT5	https://huggingface.co/docs/transformers/model_doc/mt5
mT5: A massively multilingual pre-trained text-to-text transformer	https://arxiv.org/abs/2010.11934
MVP	https://huggingface.co/docs/transformers/model_doc/mvp
MVP: Multi-task Supervised Pre-training for Natural Language Generation	https://arxiv.org/abs/2206.12131
Nezha	https://huggingface.co/docs/transformers/model_doc/nezha
NEZHA: Neural Contextualized Representation for Chinese Language Understanding	https://arxiv.org/abs/1909.00204
NLLB	https://huggingface.co/docs/transformers/model_doc/nllb
No Language Left Behind: Scaling Human-Centered Machine Translation	https://arxiv.org/abs/2207.04672
Nyströmformer	https://huggingface.co/docs/transformers/model_doc/nystromformer
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention	https://arxiv.org/abs/2102.03902
OPT	https://huggingface.co/docs/transformers/master/model_doc/opt
OPT: Open Pre-trained Transformer Language Models	https://arxiv.org/abs/2205.01068
OWL-ViT	https://huggingface.co/docs/transformers/model_doc/owlvit
Simple Open-Vocabulary Object Detection with Vision Transformers	https://arxiv.org/abs/2205.06230
Pegasus	https://huggingface.co/docs/transformers/model_doc/pegasus
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization	https://arxiv.org/abs/1912.08777
PEGASUS-X	https://huggingface.co/docs/transformers/model_doc/pegasus_x
Investigating Efficiently Extending Transformers for Long Input Summarization	https://arxiv.org/abs/2208.04347
Perceiver IO	https://huggingface.co/docs/transformers/model_doc/perceiver
Perceiver IO: A General Architecture for Structured Inputs & Outputs	https://arxiv.org/abs/2107.14795
PhoBERT	https://huggingface.co/docs/transformers/model_doc/phobert
PhoBERT: Pre-trained language models for Vietnamese	https://www.aclweb.org/anthology/2020.findings-emnlp.92/
PLBart	https://huggingface.co/docs/transformers/model_doc/plbart
Unified Pre-training for Program Understanding and Generation	https://arxiv.org/abs/2103.06333
PoolFormer	https://huggingface.co/docs/transformers/model_doc/poolformer
MetaFormer is Actually What You Need for Vision	https://arxiv.org/abs/2111.11418
ProphetNet	https://huggingface.co/docs/transformers/model_doc/prophetnet
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training	https://arxiv.org/abs/2001.04063
QDQBert	https://huggingface.co/docs/transformers/model_doc/qdqbert
Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation	https://arxiv.org/abs/2004.09602
RAG	https://huggingface.co/docs/transformers/model_doc/rag
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks	https://arxiv.org/abs/2005.11401
REALM	https://huggingface.co/docs/transformers/model_doc/realm.html
REALM: Retrieval-Augmented Language Model Pre-Training	https://arxiv.org/abs/2002.08909
Reformer	https://huggingface.co/docs/transformers/model_doc/reformer
Reformer: The Efficient Transformer	https://arxiv.org/abs/2001.04451
RegNet	https://huggingface.co/docs/transformers/model_doc/regnet
Designing Network Design Space	https://arxiv.org/abs/2003.13678
RemBERT	https://huggingface.co/docs/transformers/model_doc/rembert
Rethinking embedding coupling in pre-trained language models	https://arxiv.org/abs/2010.12821
ResNet	https://huggingface.co/docs/transformers/model_doc/resnet
Deep Residual Learning for Image Recognition	https://arxiv.org/abs/1512.03385
RoBERTa	https://huggingface.co/docs/transformers/model_doc/roberta
RoBERTa: A Robustly Optimized BERT Pretraining Approach	https://arxiv.org/abs/1907.11692
RoCBert	https://huggingface.co/docs/transformers/main/model_doc/roc_bert
RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining	https://aclanthology.org/2022.acl-long.65.pdf
RoFormer	https://huggingface.co/docs/transformers/model_doc/roformer
RoFormer: Enhanced Transformer with Rotary Position Embedding	https://arxiv.org/abs/2104.09864
SegFormer	https://huggingface.co/docs/transformers/model_doc/segformer
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers	https://arxiv.org/abs/2105.15203
SEW	https://huggingface.co/docs/transformers/model_doc/sew
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition	https://arxiv.org/abs/2109.06870
SEW-D	https://huggingface.co/docs/transformers/model_doc/sew_d
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition	https://arxiv.org/abs/2109.06870
SpeechToTextTransformer	https://huggingface.co/docs/transformers/model_doc/speech_to_text
fairseq S2T: Fast Speech-to-Text Modeling with fairseq	https://arxiv.org/abs/2010.05171
SpeechToTextTransformer2	https://huggingface.co/docs/transformers/model_doc/speech_to_text_2
Large-Scale Self- and Semi-Supervised Learning for Speech Translation	https://arxiv.org/abs/2104.06678
Splinter	https://huggingface.co/docs/transformers/model_doc/splinter
Few-Shot Question Answering by Pretraining Span Selection	https://arxiv.org/abs/2101.00438
SqueezeBERT	https://huggingface.co/docs/transformers/model_doc/squeezebert
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?	https://arxiv.org/abs/2006.11316
Swin Transformer	https://huggingface.co/docs/transformers/model_doc/swin
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows	https://arxiv.org/abs/2103.14030
Swin Transformer V2	https://huggingface.co/docs/transformers/model_doc/swinv2
Swin Transformer V2: Scaling Up Capacity and Resolution	https://arxiv.org/abs/2111.09883
SwitchTransformers	https://huggingface.co/docs/transformers/main/model_doc/switch_transformers
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity	https://arxiv.org/abs/2101.03961
T5	https://huggingface.co/docs/transformers/model_doc/t5
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer	https://arxiv.org/abs/1910.10683
T5v1.1	https://huggingface.co/docs/transformers/model_doc/t5v1.1
google-research/text-to-text-transfer-transformer	https://github.com/google-research/text-to-text-transfer-transformer/blob/main/released_checkpoints.md#t511
Table Transformer	https://huggingface.co/docs/transformers/model_doc/table-transformer
PubTables-1M: Towards Comprehensive Table Extraction From Unstructured Documents	https://arxiv.org/abs/2110.00061
TAPAS	https://huggingface.co/docs/transformers/model_doc/tapas
TAPAS: Weakly Supervised Table Parsing via Pre-training	https://arxiv.org/abs/2004.02349
TAPEX	https://huggingface.co/docs/transformers/model_doc/tapex
TAPEX: Table Pre-training via Learning a Neural SQL Executor	https://arxiv.org/abs/2107.07653
Time Series Transformer	https://huggingface.co/docs/transformers/model_doc/time_series_transformer
Trajectory Transformer	https://huggingface.co/docs/transformers/model_doc/trajectory_transformers
Offline Reinforcement Learning as One Big Sequence Modeling Problem	https://arxiv.org/abs/2106.02039
Transformer-XL	https://huggingface.co/docs/transformers/model_doc/transfo-xl
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context	https://arxiv.org/abs/1901.02860
TrOCR	https://huggingface.co/docs/transformers/model_doc/trocr
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models	https://arxiv.org/abs/2109.10282
UL2	https://huggingface.co/docs/transformers/model_doc/ul2
Unifying Language Learning Paradigms	https://arxiv.org/abs/2205.05131v1
UniSpeech	https://huggingface.co/docs/transformers/model_doc/unispeech
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data	https://arxiv.org/abs/2101.07597
UniSpeechSat	https://huggingface.co/docs/transformers/model_doc/unispeech-sat
UNISPEECH-SAT: UNIVERSAL SPEECH REPRESENTATION LEARNING WITH SPEAKER AWARE PRE-TRAINING	https://arxiv.org/abs/2110.05752
VAN	https://huggingface.co/docs/transformers/model_doc/van
Visual Attention Network	https://arxiv.org/abs/2202.09741
VideoMAE	https://huggingface.co/docs/transformers/model_doc/videomae
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training	https://arxiv.org/abs/2203.12602
ViLT	https://huggingface.co/docs/transformers/model_doc/vilt
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision	https://arxiv.org/abs/2102.03334
Vision Transformer (ViT)	https://huggingface.co/docs/transformers/model_doc/vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale	https://arxiv.org/abs/2010.11929
VisualBERT	https://huggingface.co/docs/transformers/model_doc/visual_bert
VisualBERT: A Simple and Performant Baseline for Vision and Language	https://arxiv.org/pdf/1908.03557
ViTMAE	https://huggingface.co/docs/transformers/model_doc/vit_mae
Masked Autoencoders Are Scalable Vision Learners	https://arxiv.org/abs/2111.06377
ViTMSN	https://huggingface.co/docs/transformers/model_doc/vit_msn
Masked Siamese Networks for Label-Efficient Learning	https://arxiv.org/abs/2204.07141
Wav2Vec2	https://huggingface.co/docs/transformers/model_doc/wav2vec2
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations	https://arxiv.org/abs/2006.11477
Wav2Vec2-Conformer	https://huggingface.co/docs/transformers/model_doc/wav2vec2-conformer
FAIRSEQ S2T: Fast Speech-to-Text Modeling with FAIRSEQ	https://arxiv.org/abs/2010.05171
Wav2Vec2Phoneme	https://huggingface.co/docs/transformers/model_doc/wav2vec2_phoneme
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition	https://arxiv.org/abs/2109.11680
WavLM	https://huggingface.co/docs/transformers/model_doc/wavlm
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing	https://arxiv.org/abs/2110.13900
Whisper	https://huggingface.co/docs/transformers/model_doc/whisper
Robust Speech Recognition via Large-Scale Weak Supervision	https://cdn.openai.com/papers/whisper.pdf
X-CLIP	https://huggingface.co/docs/transformers/model_doc/xclip
Expanding Language-Image Pretrained Models for General Video Recognition	https://arxiv.org/abs/2208.02816
XGLM	https://huggingface.co/docs/transformers/model_doc/xglm
Few-shot Learning with Multilingual Language Models	https://arxiv.org/abs/2112.10668
XLM	https://huggingface.co/docs/transformers/model_doc/xlm
Cross-lingual Language Model Pretraining	https://arxiv.org/abs/1901.07291
XLM-ProphetNet	https://huggingface.co/docs/transformers/model_doc/xlm-prophetnet
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training	https://arxiv.org/abs/2001.04063
XLM-RoBERTa	https://huggingface.co/docs/transformers/model_doc/xlm-roberta
Unsupervised Cross-lingual Representation Learning at Scale	https://arxiv.org/abs/1911.02116
XLM-RoBERTa-XL	https://huggingface.co/docs/transformers/model_doc/xlm-roberta-xl
Larger-Scale Transformers for Multilingual Masked Language Modeling	https://arxiv.org/abs/2105.00572
XLNet	https://huggingface.co/docs/transformers/model_doc/xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding	https://arxiv.org/abs/1906.08237
XLS-R	https://huggingface.co/docs/transformers/model_doc/xls_r
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale	https://arxiv.org/abs/2111.09296
XLSR-Wav2Vec2	https://huggingface.co/docs/transformers/model_doc/xlsr_wav2vec2
Unsupervised Cross-Lingual Representation Learning For Speech Recognition	https://arxiv.org/abs/2006.13979
YOLOS	https://huggingface.co/docs/transformers/model_doc/yolos
You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection	https://arxiv.org/abs/2106.00666
YOSO	https://huggingface.co/docs/transformers/model_doc/yoso
You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling	https://arxiv.org/abs/2111.09714
templates	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/templates
contributing guidelines	https://patch-diff.githubusercontent.com/echallenge/transformers/blob/main/CONTRIBUTING.md
this table	https://huggingface.co/docs/transformers/index#supported-frameworks
documentation	https://github.com/huggingface/transformers/tree/main/examples
	https://patch-diff.githubusercontent.com/echallenge/transformers#learn-more
Documentation	https://huggingface.co/docs/transformers/
Task summary	https://huggingface.co/docs/transformers/task_summary
Preprocessing tutorial	https://huggingface.co/docs/transformers/preprocessing
Training and fine-tuning	https://huggingface.co/docs/transformers/training
Quick tour: Fine-tuning/usage scripts	https://github.com/huggingface/transformers/tree/main/examples
Model sharing and uploading	https://huggingface.co/docs/transformers/model_sharing
Migration	https://huggingface.co/docs/transformers/migration
	https://patch-diff.githubusercontent.com/echallenge/transformers#citation
paper	https://www.aclweb.org/anthology/2020.emnlp-demos.6/
huggingface.co/transformers	https://huggingface.co/transformers
Readme	https://patch-diff.githubusercontent.com/echallenge/transformers#readme-ov-file
Apache-2.0 license	https://patch-diff.githubusercontent.com/echallenge/transformers#Apache-2.0-1-ov-file
Code of conduct	https://patch-diff.githubusercontent.com/echallenge/transformers#coc-ov-file
Contributing	https://patch-diff.githubusercontent.com/echallenge/transformers#contributing-ov-file
Please reload this page	https://patch-diff.githubusercontent.com/echallenge/transformers
Activity	https://patch-diff.githubusercontent.com/echallenge/transformers/activity
1 star	https://patch-diff.githubusercontent.com/echallenge/transformers/stargazers
0 watching	https://patch-diff.githubusercontent.com/echallenge/transformers/watchers
0 forks	https://patch-diff.githubusercontent.com/echallenge/transformers/forks
Report repository	https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fechallenge%2Ftransformers&report=echallenge+%28user%29
Releases	https://patch-diff.githubusercontent.com/echallenge/transformers/releases
Packages 0	https://patch-diff.githubusercontent.com/users/echallenge/packages?repo_name=transformers
	https://github.com
Terms	https://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacy	https://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Security	https://github.com/security
Status	https://www.githubstatus.com/
Community	https://github.community/
Docs	https://docs.github.com/
Contact	https://support.github.com?tags=dotcom-footer

Viewport: width=device-width

URLs of crawlers that visited me.