| Skip to content | https://patch-diff.githubusercontent.com/texonom/transformers.js#start-of-content |
|
| https://patch-diff.githubusercontent.com/ |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftexonom%2Ftransformers.js |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Ftexonom%2Ftransformers.js |
|
Sign up
| https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=texonom%2Ftransformers.js |
| Reload | https://patch-diff.githubusercontent.com/texonom/transformers.js |
| Reload | https://patch-diff.githubusercontent.com/texonom/transformers.js |
| Reload | https://patch-diff.githubusercontent.com/texonom/transformers.js |
|
texonom
| https://patch-diff.githubusercontent.com/texonom |
| transformers.js | https://patch-diff.githubusercontent.com/texonom/transformers.js |
| huggingface/transformers.js | https://patch-diff.githubusercontent.com/huggingface/transformers.js |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2Ftexonom%2Ftransformers.js |
|
Fork
0
| https://patch-diff.githubusercontent.com/login?return_to=%2Ftexonom%2Ftransformers.js |
|
Star
0
| https://patch-diff.githubusercontent.com/login?return_to=%2Ftexonom%2Ftransformers.js |
| huggingface.co/docs/transformers.js | https://huggingface.co/docs/transformers.js |
|
Apache-2.0 license
| https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/LICENSE |
|
0
stars
| https://patch-diff.githubusercontent.com/texonom/transformers.js/stargazers |
|
1.1k
forks
| https://patch-diff.githubusercontent.com/texonom/transformers.js/forks |
|
Branches
| https://patch-diff.githubusercontent.com/texonom/transformers.js/branches |
|
Tags
| https://patch-diff.githubusercontent.com/texonom/transformers.js/tags |
|
Activity
| https://patch-diff.githubusercontent.com/texonom/transformers.js/activity |
|
Star
| https://patch-diff.githubusercontent.com/login?return_to=%2Ftexonom%2Ftransformers.js |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2Ftexonom%2Ftransformers.js |
|
Code
| https://patch-diff.githubusercontent.com/texonom/transformers.js |
|
Pull requests
0
| https://patch-diff.githubusercontent.com/texonom/transformers.js/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/texonom/transformers.js/actions |
|
Projects
0
| https://patch-diff.githubusercontent.com/texonom/transformers.js/projects |
|
Security
0
| https://patch-diff.githubusercontent.com/texonom/transformers.js/security |
|
Insights
| https://patch-diff.githubusercontent.com/texonom/transformers.js/pulse |
|
Code
| https://patch-diff.githubusercontent.com/texonom/transformers.js |
|
Pull requests
| https://patch-diff.githubusercontent.com/texonom/transformers.js/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/texonom/transformers.js/actions |
|
Projects
| https://patch-diff.githubusercontent.com/texonom/transformers.js/projects |
|
Security
| https://patch-diff.githubusercontent.com/texonom/transformers.js/security |
|
Insights
| https://patch-diff.githubusercontent.com/texonom/transformers.js/pulse |
| Branches | https://patch-diff.githubusercontent.com/texonom/transformers.js/branches |
| Tags | https://patch-diff.githubusercontent.com/texonom/transformers.js/tags |
| https://patch-diff.githubusercontent.com/texonom/transformers.js/branches |
| https://patch-diff.githubusercontent.com/texonom/transformers.js/tags |
| 1,639 Commits | https://patch-diff.githubusercontent.com/texonom/transformers.js/commits/main/ |
| https://patch-diff.githubusercontent.com/texonom/transformers.js/commits/main/ |
| .github | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/.github |
| .github | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/.github |
| docs | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/docs |
| docs | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/docs |
| examples | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/examples |
| examples | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/examples |
| scripts | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/scripts |
| scripts | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/scripts |
| src | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/src |
| src | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/src |
| tests | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/tests |
| tests | https://patch-diff.githubusercontent.com/texonom/transformers.js/tree/main/tests |
| .gitattributes | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.gitattributes |
| .gitattributes | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.gitattributes |
| .gitignore | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.gitignore |
| .gitignore | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.gitignore |
| .prettierignore | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.prettierignore |
| .prettierignore | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.prettierignore |
| .prettierrc | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.prettierrc |
| .prettierrc | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/.prettierrc |
| LICENSE | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/LICENSE |
| LICENSE | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/LICENSE |
| README.md | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/README.md |
| README.md | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/README.md |
| jest.config.mjs | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/jest.config.mjs |
| jest.config.mjs | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/jest.config.mjs |
| jsconfig.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/jsconfig.json |
| jsconfig.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/jsconfig.json |
| package-lock.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/package-lock.json |
| package-lock.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/package-lock.json |
| package.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/package.json |
| package.json | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/package.json |
| webpack.config.js | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/webpack.config.js |
| webpack.config.js | https://patch-diff.githubusercontent.com/texonom/transformers.js/blob/main/webpack.config.js |
| README | https://patch-diff.githubusercontent.com/texonom/transformers.js |
| License | https://patch-diff.githubusercontent.com/texonom/transformers.js |
| https://www.npmjs.com/package/@huggingface/transformers |
| https://www.npmjs.com/package/@huggingface/transformers |
| https://www.jsdelivr.com/package/npm/@huggingface/transformers |
| https://github.com/huggingface/transformers.js/blob/main/LICENSE |
| https://huggingface.co/docs/transformers.js/index |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#--state-of-the-art-machine-learning-for-the-web |
| transformers | https://github.com/huggingface/transformers |
| ONNX Runtime | https://onnxruntime.ai/ |
| convert | https://patch-diff.githubusercontent.com/texonom/transformers.js#convert-your-models-to-onnx |
| π€ Optimum | https://github.com/huggingface/optimum#onnx--onnx-runtime |
| documentation | https://huggingface.co/docs/transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#installation |
| NPM | https://www.npmjs.com/package/@huggingface/transformers |
| ES Modules | https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Modules |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#quick-tour |
| WebGPU guide | https://huggingface.co/docs/transformers.js/guides/webgpu |
| bug report | https://github.com/huggingface/transformers.js/issues/new?title=%5BWebGPU%5D%20Error%20running%20MODEL_ID_GOES_HERE&assignees=&labels=bug,webgpu&projects=&template=1_bug-report.yml |
| quantization guide | https://huggingface.co/docs/transformers.js/guides/dtypes |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#examples |
| here | https://github.com/huggingface/transformers.js-examples |
| code | https://github.com/xenova/whisper-web |
| demo | https://huggingface.co/spaces/Xenova/whisper-web |
| blog | https://huggingface.co/blog/ml-web-games |
| code | https://github.com/xenova/doodle-dash |
| demo | https://huggingface.co/spaces/Xenova/doodle-dash |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/code-completion/ |
| demo | https://huggingface.co/spaces/Xenova/ai-code-playground |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/semantic-image-search-client/ |
| demo | https://huggingface.co/spaces/Xenova/semantic-image-search-client |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/semantic-image-search/ |
| demo | https://huggingface.co/spaces/Xenova/semantic-image-search |
| video | https://scrimba.com/scrim/cKm9bDAg |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/vanilla-js/ |
| demo | https://huggingface.co/spaces/Scrimba/vanilla-js-object-detector |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/react-translator/ |
| demo | https://huggingface.co/spaces/Xenova/react-translator |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/text-to-speech-client/ |
| demo | https://huggingface.co/spaces/Xenova/text-to-speech-client |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/extension/ |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/electron/ |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/next-client/ |
| demo | https://huggingface.co/spaces/Xenova/next-example-app |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/next-server/ |
| demo | https://huggingface.co/spaces/Xenova/next-server-example-app |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/node/ |
| code | https://github.com/huggingface/transformers.js/tree/main/examples/demo-site/ |
| demo | https://xenova.github.io/transformers.js/ |
| template | https://huggingface.co/new-space?template=static-templates%2Ftransformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#custom-usage |
| hosted pretrained models | https://huggingface.co/models?library=transformers.js |
| precompiled WASM binaries | https://cdn.jsdelivr.net/npm/@huggingface/transformers@3.1.1/dist/ |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#settings |
| API Reference | https://huggingface.co/docs/transformers.js/api/env |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#convert-your-models-to-onnx |
| conversion script | https://github.com/huggingface/transformers.js/blob/main/scripts/convert.py |
| π€ Optimum | https://huggingface.co/docs/optimum |
| bert-base-uncased | https://huggingface.co/bert-base-uncased |
| Optimum documentation | https://huggingface.co/docs/optimum/main/en/exporters/onnx/overview |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#supported-tasksmodels |
| here | https://github.com/huggingface/transformers.js/issues/new/choose |
| this link | https://huggingface.co/models?library=transformers.js |
| text-classification | https://huggingface.co/models?pipeline_tag=text-classification&library=transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#tasks |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#natural-language-processing |
| Fill-Mask | https://huggingface.co/tasks/fill-mask |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.FillMaskPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=fill-mask&library=transformers.js |
| Question Answering | https://huggingface.co/tasks/question-answering |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.QuestionAnsweringPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=question-answering&library=transformers.js |
| Sentence Similarity | https://huggingface.co/tasks/sentence-similarity |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.FeatureExtractionPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=sentence-similarity&library=transformers.js |
| Summarization | https://huggingface.co/tasks/summarization |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.SummarizationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=summarization&library=transformers.js |
| Table Question Answering | https://huggingface.co/tasks/table-question-answering |
| Text Classification | https://huggingface.co/tasks/text-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.TextClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=text-classification&library=transformers.js |
| Text Generation | https://huggingface.co/tasks/text-generation#completion-generation-models |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.TextGenerationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=text-generation&library=transformers.js |
| Text-to-text Generation | https://huggingface.co/tasks/text-generation#text-to-text-generation-models |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.Text2TextGenerationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=text2text-generation&library=transformers.js |
| Token Classification | https://huggingface.co/tasks/token-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.TokenClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=token-classification&library=transformers.js |
| Translation | https://huggingface.co/tasks/translation |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.TranslationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=translation&library=transformers.js |
| Zero-Shot Classification | https://huggingface.co/tasks/zero-shot-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ZeroShotClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=zero-shot-classification&library=transformers.js |
| Feature Extraction | https://huggingface.co/tasks/feature-extraction |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.FeatureExtractionPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=feature-extraction&library=transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#vision |
| Depth Estimation | https://huggingface.co/tasks/depth-estimation |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.DepthEstimationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=depth-estimation&library=transformers.js |
| Image Classification | https://huggingface.co/tasks/image-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ImageClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=image-classification&library=transformers.js |
| Image Segmentation | https://huggingface.co/tasks/image-segmentation |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ImageSegmentationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=image-segmentation&library=transformers.js |
| Image-to-Image | https://huggingface.co/tasks/image-to-image |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ImageToImagePipeline |
| (models) | https://huggingface.co/models?pipeline_tag=image-to-image&library=transformers.js |
| Mask Generation | https://huggingface.co/tasks/mask-generation |
| Object Detection | https://huggingface.co/tasks/object-detection |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ObjectDetectionPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=object-detection&library=transformers.js |
| Video Classification | https://huggingface.co/tasks/video-classification |
| Unconditional Image Generation | https://huggingface.co/tasks/unconditional-image-generation |
| Image Feature Extraction | https://huggingface.co/tasks/image-feature-extraction |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ImageFeatureExtractionPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=image-feature-extraction&library=transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#audio |
| Audio Classification | https://huggingface.co/tasks/audio-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.AudioClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=audio-classification&library=transformers.js |
| Audio-to-Audio | https://huggingface.co/tasks/audio-to-audio |
| Automatic Speech Recognition | https://huggingface.co/tasks/automatic-speech-recognition |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.AutomaticSpeechRecognitionPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=automatic-speech-recognition&library=transformers.js |
| Text-to-Speech | https://huggingface.co/tasks/text-to-speech |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.TextToAudioPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=text-to-audio&library=transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#tabular |
| Tabular Classification | https://huggingface.co/tasks/tabular-classification |
| Tabular Regression | https://huggingface.co/tasks/tabular-regression |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#multimodal |
| Document Question Answering | https://huggingface.co/tasks/document-question-answering |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.DocumentQuestionAnsweringPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=document-question-answering&library=transformers.js |
| Image-to-Text | https://huggingface.co/tasks/image-to-text |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ImageToTextPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=image-to-text&library=transformers.js |
| Text-to-Image | https://huggingface.co/tasks/text-to-image |
| Visual Question Answering | https://huggingface.co/tasks/visual-question-answering |
| Zero-Shot Audio Classification | https://huggingface.co/learn/audio-course/chapter4/classification_models#zero-shot-audio-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ZeroShotAudioClassificationPipeline |
| (models) | https://huggingface.co/models?other=zero-shot-audio-classification&library=transformers.js |
| Zero-Shot Image Classification | https://huggingface.co/tasks/zero-shot-image-classification |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ZeroShotImageClassificationPipeline |
| (models) | https://huggingface.co/models?pipeline_tag=zero-shot-image-classification&library=transformers.js |
| Zero-Shot Object Detection | https://huggingface.co/tasks/zero-shot-object-detection |
| (docs) | https://huggingface.co/docs/transformers.js/api/pipelines#module_pipelines.ZeroShotObjectDetectionPipeline |
| (models) | https://huggingface.co/models?other=zero-shot-object-detection&library=transformers.js |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#reinforcement-learning |
| Reinforcement Learning | https://huggingface.co/tasks/reinforcement-learning |
| https://patch-diff.githubusercontent.com/texonom/transformers.js#models |
| ALBERT | https://huggingface.co/docs/transformers/model_doc/albert |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | https://arxiv.org/abs/1909.11942 |
| Audio Spectrogram Transformer | https://huggingface.co/docs/transformers/model_doc/audio-spectrogram-transformer |
| AST: Audio Spectrogram Transformer | https://arxiv.org/abs/2104.01778 |
| BART | https://huggingface.co/docs/transformers/model_doc/bart |
| BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension | https://arxiv.org/abs/1910.13461 |
| BEiT | https://huggingface.co/docs/transformers/model_doc/beit |
| BEiT: BERT Pre-Training of Image Transformers | https://arxiv.org/abs/2106.08254 |
| BERT | https://huggingface.co/docs/transformers/model_doc/bert |
| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | https://arxiv.org/abs/1810.04805 |
| Blenderbot | https://huggingface.co/docs/transformers/model_doc/blenderbot |
| Recipes for building an open-domain chatbot | https://arxiv.org/abs/2004.13637 |
| BlenderbotSmall | https://huggingface.co/docs/transformers/model_doc/blenderbot-small |
| Recipes for building an open-domain chatbot | https://arxiv.org/abs/2004.13637 |
| BLOOM | https://huggingface.co/docs/transformers/model_doc/bloom |
| BigScience Workshop | https://bigscience.huggingface.co/ |
| CamemBERT | https://huggingface.co/docs/transformers/model_doc/camembert |
| CamemBERT: a Tasty French Language Model | https://arxiv.org/abs/1911.03894 |
| Chinese-CLIP | https://huggingface.co/docs/transformers/model_doc/chinese_clip |
| Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese | https://arxiv.org/abs/2211.01335 |
| CLAP | https://huggingface.co/docs/transformers/model_doc/clap |
| Large-scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation | https://arxiv.org/abs/2211.06687 |
| CLIP | https://huggingface.co/docs/transformers/model_doc/clip |
| Learning Transferable Visual Models From Natural Language Supervision | https://arxiv.org/abs/2103.00020 |
| CLIPSeg | https://huggingface.co/docs/transformers/model_doc/clipseg |
| Image Segmentation Using Text and Image Prompts | https://arxiv.org/abs/2112.10003 |
| CodeGen | https://huggingface.co/docs/transformers/model_doc/codegen |
| A Conversational Paradigm for Program Synthesis | https://arxiv.org/abs/2203.13474 |
| CodeLlama | https://huggingface.co/docs/transformers/model_doc/llama_code |
| Code Llama: Open Foundation Models for Code | https://ai.meta.com/research/publications/code-llama-open-foundation-models-for-code/ |
| Cohere | https://huggingface.co/docs/transformers/main/model_doc/cohere |
| Command-R: Retrieval Augmented Generation at Production Scale | https://txt.cohere.com/command-r/ |
| ConvBERT | https://huggingface.co/docs/transformers/model_doc/convbert |
| ConvBERT: Improving BERT with Span-based Dynamic Convolution | https://arxiv.org/abs/2008.02496 |
| ConvNeXT | https://huggingface.co/docs/transformers/model_doc/convnext |
| A ConvNet for the 2020s | https://arxiv.org/abs/2201.03545 |
| ConvNeXTV2 | https://huggingface.co/docs/transformers/model_doc/convnextv2 |
| ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders | https://arxiv.org/abs/2301.00808 |
| DeBERTa | https://huggingface.co/docs/transformers/model_doc/deberta |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | https://arxiv.org/abs/2006.03654 |
| DeBERTa-v2 | https://huggingface.co/docs/transformers/model_doc/deberta-v2 |
| DeBERTa: Decoding-enhanced BERT with Disentangled Attention | https://arxiv.org/abs/2006.03654 |
| Decision Transformer | https://huggingface.co/docs/transformers/model_doc/decision_transformer |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | https://arxiv.org/abs/2106.01345 |
| DeiT | https://huggingface.co/docs/transformers/model_doc/deit |
| Training data-efficient image transformers & distillation through attention | https://arxiv.org/abs/2012.12877 |
| Depth Anything | https://huggingface.co/docs/transformers/main/model_doc/depth_anything |
| Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data | https://arxiv.org/abs/2401.10891 |
| Depth Pro: Sharp Monocular Metric Depth in Less Than a Second | https://arxiv.org/abs/2410.02073 |
| DETR | https://huggingface.co/docs/transformers/model_doc/detr |
| End-to-End Object Detection with Transformers | https://arxiv.org/abs/2005.12872 |
| DINOv2 | https://huggingface.co/docs/transformers/model_doc/dinov2 |
| DINOv2: Learning Robust Visual Features without Supervision | https://arxiv.org/abs/2304.07193 |
| DistilBERT | https://huggingface.co/docs/transformers/model_doc/distilbert |
| DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter | https://arxiv.org/abs/1910.01108 |
| DistilGPT2 | https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation |
| DistilRoBERTa | https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation |
| DistilmBERT | https://github.com/huggingface/transformers/tree/main/examples/research_projects/distillation |
| DiT | https://huggingface.co/docs/transformers/model_doc/dit |
| DiT: Self-supervised Pre-training for Document Image Transformer | https://arxiv.org/abs/2203.02378 |
| Donut | https://huggingface.co/docs/transformers/model_doc/donut |
| OCR-free Document Understanding Transformer | https://arxiv.org/abs/2111.15664 |
| DPT | https://huggingface.co/docs/transformers/master/model_doc/dpt |
| Vision Transformers for Dense Prediction | https://arxiv.org/abs/2103.13413 |
| EfficientNet | https://huggingface.co/docs/transformers/model_doc/efficientnet |
| EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks | https://arxiv.org/abs/1905.11946 |
| ELECTRA | https://huggingface.co/docs/transformers/model_doc/electra |
| ELECTRA: Pre-training text encoders as discriminators rather than generators | https://arxiv.org/abs/2003.10555 |
| ESM | https://huggingface.co/docs/transformers/model_doc/esm |
| Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences | https://www.pnas.org/content/118/15/e2016239118 |
| Language models enable zero-shot prediction of the effects of mutations on protein function | https://doi.org/10.1101/2021.07.09.450648 |
| Language models of protein sequences at the scale of evolution enable accurate structure prediction | https://doi.org/10.1101/2022.07.20.500902 |
| Falcon | https://huggingface.co/docs/transformers/model_doc/falcon |
| FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | https://arxiv.org/abs/2303.14189 |
| FLAN-T5 | https://huggingface.co/docs/transformers/model_doc/flan-t5 |
| google-research/t5x | https://github.com/google-research/t5x/blob/main/docs/models.md#flan-t5-checkpoints |
| Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks | https://arxiv.org/abs/2311.06242 |
| Gemma | https://huggingface.co/docs/transformers/main/model_doc/gemma |
| Gemma: Open Models Based on Gemini Technology and Research | https://blog.google/technology/developers/gemma-open-models/ |
| Gemma2 | https://huggingface.co/docs/transformers/main/model_doc/gemma2 |
| Gemma2: Open Models Based on Gemini Technology and Research | https://blog.google/technology/developers/google-gemma-2/ |
| GLPN | https://huggingface.co/docs/transformers/model_doc/glpn |
| Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth | https://arxiv.org/abs/2201.07436 |
| GPT Neo | https://huggingface.co/docs/transformers/model_doc/gpt_neo |
| EleutherAI/gpt-neo | https://github.com/EleutherAI/gpt-neo |
| GPT NeoX | https://huggingface.co/docs/transformers/model_doc/gpt_neox |
| GPT-NeoX-20B: An Open-Source Autoregressive Language Model | https://arxiv.org/abs/2204.06745 |
| GPT-2 | https://huggingface.co/docs/transformers/model_doc/gpt2 |
| Language Models are Unsupervised Multitask Learners | https://blog.openai.com/better-language-models/ |
| GPT-J | https://huggingface.co/docs/transformers/model_doc/gptj |
| kingoflolz/mesh-transformer-jax | https://github.com/kingoflolz/mesh-transformer-jax/ |
| GPTBigCode | https://huggingface.co/docs/transformers/model_doc/gpt_bigcode |
| SantaCoder: don't reach for the stars! | https://arxiv.org/abs/2301.03988 |
| Granite | https://huggingface.co/docs/transformers/main/model_doc/granite |
| Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler | https://arxiv.org/abs/2408.13359 |
| GroupViT | https://huggingface.co/docs/transformers/model_doc/groupvit |
| GroupViT: Semantic Segmentation Emerges from Text Supervision | https://arxiv.org/abs/2202.11094 |
| HerBERT | https://huggingface.co/docs/transformers/model_doc/herbert |
| KLEJ: Comprehensive Benchmark for Polish Language Understanding | https://www.aclweb.org/anthology/2020.acl-main.111.pdf |
| Hiera | https://huggingface.co/docs/transformers/model_doc/hiera |
| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | https://arxiv.org/pdf/2306.00989 |
| Hubert | https://huggingface.co/docs/transformers/model_doc/hubert |
| HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units | https://arxiv.org/abs/2106.07447 |
| Idefics3 | https://huggingface.co/docs/transformers/model_doc/idefics3 |
| Building and better understanding vision-language models: insights and future directions | https://arxiv.org/abs/2408.12637 |
| Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models | https://arxiv.org/pdf/2308.16149 |
| Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation | https://arxiv.org/abs/2410.13848 |
| Jina CLIP: Your CLIP Model Is Also Your Text Retriever | https://arxiv.org/abs/2405.20204 |
| LongT5 | https://huggingface.co/docs/transformers/model_doc/longt5 |
| LongT5: Efficient Text-To-Text Transformer for Long Sequences | https://arxiv.org/abs/2112.07916 |
| LLaMA | https://huggingface.co/docs/transformers/model_doc/llama |
| LLaMA: Open and Efficient Foundation Language Models | https://arxiv.org/abs/2302.13971 |
| Llama2 | https://huggingface.co/docs/transformers/model_doc/llama2 |
| Llama2: Open Foundation and Fine-Tuned Chat Models | https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/XXX |
| LLaVa | https://huggingface.co/docs/transformers/model_doc/llava |
| Visual Instruction Tuning | https://arxiv.org/abs/2304.08485 |
| LLaVA-OneVision | https://huggingface.co/docs/transformers/model_doc/llava_onevision |
| LLaVA-OneVision: Easy Visual Task Transfer | https://arxiv.org/abs/2408.03326 |
| M2M100 | https://huggingface.co/docs/transformers/model_doc/m2m_100 |
| Beyond English-Centric Multilingual Machine Translation | https://arxiv.org/abs/2010.11125 |
| MarianMT | https://huggingface.co/docs/transformers/model_doc/marian |
| OPUS | http://opus.nlpl.eu/ |
| Marian Framework | https://marian-nmt.github.io/ |
| MaskFormer | https://huggingface.co/docs/transformers/model_doc/maskformer |
| Per-Pixel Classification is Not All You Need for Semantic Segmentation | https://arxiv.org/abs/2107.06278 |
| mBART | https://huggingface.co/docs/transformers/model_doc/mbart |
| Multilingual Denoising Pre-training for Neural Machine Translation | https://arxiv.org/abs/2001.08210 |
| mBART-50 | https://huggingface.co/docs/transformers/model_doc/mbart |
| Multilingual Translation with Extensible Multilingual Pretraining and Finetuning | https://arxiv.org/abs/2008.00401 |
| MusicGen | https://huggingface.co/docs/transformers/model_doc/musicgen |
| Simple and Controllable Music Generation | https://arxiv.org/abs/2306.05284 |
| MGP-STR | https://huggingface.co/docs/transformers/model_doc/mgp-str |
| Multi-Granularity Prediction for Scene Text Recognition | https://arxiv.org/abs/2209.03592 |
| Mistral | https://huggingface.co/docs/transformers/model_doc/mistral |
| Mistral AI | https://mistral.ai |
| MMS | https://huggingface.co/docs/transformers/model_doc/mms |
| Scaling Speech Technology to 1,000+ Languages | https://arxiv.org/abs/2305.13516 |
| MobileBERT | https://huggingface.co/docs/transformers/model_doc/mobilebert |
| MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices | https://arxiv.org/abs/2004.02984 |
| MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training | https://arxiv.org/abs/2311.17049 |
| MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases | https://arxiv.org/abs/2402.14905 |
| MobileNetV1 | https://huggingface.co/docs/transformers/model_doc/mobilenet_v1 |
| MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications | https://arxiv.org/abs/1704.04861 |
| MobileNetV2 | https://huggingface.co/docs/transformers/model_doc/mobilenet_v2 |
| MobileNetV2: Inverted Residuals and Linear Bottlenecks | https://arxiv.org/abs/1801.04381 |
| Searching for MobileNetV3 | https://arxiv.org/abs/1905.02244 |
| MobileNetV4 - Universal Models for the Mobile Ecosystem | https://arxiv.org/abs/2404.10518 |
| MobileViT | https://huggingface.co/docs/transformers/model_doc/mobilevit |
| MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer | https://arxiv.org/abs/2110.02178 |
| MobileViTV2 | https://huggingface.co/docs/transformers/model_doc/mobilevitv2 |
| Separable Self-attention for Mobile Vision Transformers | https://arxiv.org/abs/2206.02680 |
| moondream | https://github.com/vikhyat/moondream |
| MPNet | https://huggingface.co/docs/transformers/model_doc/mpnet |
| MPNet: Masked and Permuted Pre-training for Language Understanding | https://arxiv.org/abs/2004.09297 |
| MPT | https://huggingface.co/docs/transformers/model_doc/mpt |
| llm-foundry | https://github.com/mosaicml/llm-foundry/ |
| MT5 | https://huggingface.co/docs/transformers/model_doc/mt5 |
| mT5: A massively multilingual pre-trained text-to-text transformer | https://arxiv.org/abs/2010.11934 |
| NLLB | https://huggingface.co/docs/transformers/model_doc/nllb |
| No Language Left Behind: Scaling Human-Centered Machine Translation | https://arxiv.org/abs/2207.04672 |
| Nougat | https://huggingface.co/docs/transformers/model_doc/nougat |
| Nougat: Neural Optical Understanding for Academic Documents | https://arxiv.org/abs/2308.13418 |
| OLMo | https://huggingface.co/docs/transformers/master/model_doc/olmo |
| OLMo: Accelerating the Science of Language Models | https://arxiv.org/abs/2402.00838 |
| OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework | https://arxiv.org/abs/2404.14619 |
| OPT | https://huggingface.co/docs/transformers/master/model_doc/opt |
| OPT: Open Pre-trained Transformer Language Models | https://arxiv.org/abs/2205.01068 |
| OWL-ViT | https://huggingface.co/docs/transformers/model_doc/owlvit |
| Simple Open-Vocabulary Object Detection with Vision Transformers | https://arxiv.org/abs/2205.06230 |
| OWLv2 | https://huggingface.co/docs/transformers/model_doc/owlv2 |
| Scaling Open-Vocabulary Object Detection | https://arxiv.org/abs/2306.09683 |
| PatchTSMixer | https://huggingface.co/docs/transformers/main/model_doc/patchtsmixer |
| TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting | https://arxiv.org/abs/2306.09364 |
| PatchTST | https://huggingface.co/docs/transformers/main/model_doc/patchtst |
| A Time Series is Worth 64 Words: Long-term Forecasting with Transformers | https://arxiv.org/abs/2211.14730 |
| Phi | https://huggingface.co/docs/transformers/main/model_doc/phi |
| Textbooks Are All You Need | https://arxiv.org/abs/2306.11644 |
| Textbooks Are All You Need II: phi-1.5 technical report | https://arxiv.org/abs/2309.05463 |
| Phi3 | https://huggingface.co/docs/transformers/main/model_doc/phi3 |
| Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone | https://arxiv.org/abs/2404.14219 |
| PVT | https://huggingface.co/docs/transformers/main/model_doc/pvt |
| Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions | https://arxiv.org/pdf/2102.12122.pdf |
| pyannote/pyannote-audio | https://github.com/pyannote/pyannote-audio |
| Qwen2 | https://huggingface.co/docs/transformers/model_doc/qwen2 |
| Qwen Technical Report | https://arxiv.org/abs/2309.16609 |
| Qwen2-VL | https://huggingface.co/docs/transformers/model_doc/qwen2_vl |
| Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond | https://arxiv.org/abs/2308.12966 |
| ResNet | https://huggingface.co/docs/transformers/model_doc/resnet |
| Deep Residual Learning for Image Recognition | https://arxiv.org/abs/1512.03385 |
| RoBERTa | https://huggingface.co/docs/transformers/model_doc/roberta |
| RoBERTa: A Robustly Optimized BERT Pretraining Approach | https://arxiv.org/abs/1907.11692 |
| RoFormer | https://huggingface.co/docs/transformers/model_doc/roformer |
| RoFormer: Enhanced Transformer with Rotary Position Embedding | https://arxiv.org/abs/2104.09864 |
| RT-DETR | https://huggingface.co/docs/transformers/model_doc/rt_detr |
| DETRs Beat YOLOs on Real-time Object Detection | https://arxiv.org/abs/2304.08069 |
| Sapiens: Foundation for Human Vision Models | https://arxiv.org/pdf/2408.12569 |
| SegFormer | https://huggingface.co/docs/transformers/model_doc/segformer |
| SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers | https://arxiv.org/abs/2105.15203 |
| Segment Anything | https://huggingface.co/docs/transformers/model_doc/sam |
| Segment Anything | https://arxiv.org/pdf/2304.02643v1.pdf |
| SigLIP | https://huggingface.co/docs/transformers/main/model_doc/siglip |
| Sigmoid Loss for Language Image Pre-Training | https://arxiv.org/abs/2303.15343 |
| SpeechT5 | https://huggingface.co/docs/transformers/model_doc/speecht5 |
| SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing | https://arxiv.org/abs/2110.07205 |
| SqueezeBERT | https://huggingface.co/docs/transformers/model_doc/squeezebert |
| SqueezeBERT: What can computer vision teach NLP about efficient neural networks? | https://arxiv.org/abs/2006.11316 |
| StableLm | https://huggingface.co/docs/transformers/model_doc/stablelm |
| StableLM 3B 4E1T (Technical Report) | https://stability.wandb.io/stability-llm/stable-lm/reports/StableLM-3B-4E1T--VmlldzoyMjU4?accessToken=u3zujipenkx5g7rtcj9qojjgxpconyjktjkli2po09nffrffdhhchq045vp0wyfo |
| Starcoder2 | https://huggingface.co/docs/transformers/main/model_doc/starcoder2 |
| StarCoder 2 and The Stack v2: The Next Generation | https://arxiv.org/abs/2402.19173 |
| Swin Transformer | https://huggingface.co/docs/transformers/model_doc/swin |
| Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | https://arxiv.org/abs/2103.14030 |
| Swin2SR | https://huggingface.co/docs/transformers/model_doc/swin2sr |
| Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration | https://arxiv.org/abs/2209.11345 |
| T5 | https://huggingface.co/docs/transformers/model_doc/t5 |
| Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | https://arxiv.org/abs/1910.10683 |
| T5v1.1 | https://huggingface.co/docs/transformers/model_doc/t5v1.1 |
| google-research/text-to-text-transfer-transformer | https://github.com/google-research/text-to-text-transfer-transformer/blob/main/released_checkpoints.md#t511 |
| Table Transformer | https://huggingface.co/docs/transformers/model_doc/table-transformer |
| PubTables-1M: Towards Comprehensive Table Extraction From Unstructured Documents | https://arxiv.org/abs/2110.00061 |
| TrOCR | https://huggingface.co/docs/transformers/model_doc/trocr |
| TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models | https://arxiv.org/abs/2109.10282 |
| UniSpeech | https://huggingface.co/docs/transformers/model_doc/unispeech |
| UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data | https://arxiv.org/abs/2101.07597 |
| UniSpeechSat | https://huggingface.co/docs/transformers/model_doc/unispeech-sat |
| UNISPEECH-SAT: UNIVERSAL SPEECH REPRESENTATION LEARNING WITH SPEAKER AWARE PRE-TRAINING | https://arxiv.org/abs/2110.05752 |
| Vision Transformer (ViT) | https://huggingface.co/docs/transformers/model_doc/vit |
| An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | https://arxiv.org/abs/2010.11929 |
| ViTMAE | https://huggingface.co/docs/transformers/model_doc/vit_mae |
| Masked Autoencoders Are Scalable Vision Learners | https://arxiv.org/abs/2111.06377 |
| ViTMatte | https://huggingface.co/docs/transformers/model_doc/vitmatte |
| ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers | https://arxiv.org/abs/2305.15272 |
| ViTMSN | https://huggingface.co/docs/transformers/model_doc/vit_msn |
| Masked Siamese Networks for Label-Efficient Learning | https://arxiv.org/abs/2204.07141 |
| ViTPose | https://huggingface.co/docs/transformers/model_doc/vitpose |
| ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation | https://arxiv.org/abs/2204.12484 |
| VITS | https://huggingface.co/docs/transformers/model_doc/vits |
| Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech | https://arxiv.org/abs/2106.06103 |
| Wav2Vec2 | https://huggingface.co/docs/transformers/model_doc/wav2vec2 |
| wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations | https://arxiv.org/abs/2006.11477 |
| Wav2Vec2-BERT | https://huggingface.co/docs/transformers/main/model_doc/wav2vec2-bert |
| Seamless: Multilingual Expressive and Streaming Speech Translation | https://ai.meta.com/research/publications/seamless-multilingual-expressive-and-streaming-speech-translation/ |
| WavLM | https://huggingface.co/docs/transformers/model_doc/wavlm |
| WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing | https://arxiv.org/abs/2110.13900 |
| Whisper | https://huggingface.co/docs/transformers/model_doc/whisper |
| Robust Speech Recognition via Large-Scale Weak Supervision | https://cdn.openai.com/papers/whisper.pdf |
| XLM | https://huggingface.co/docs/transformers/model_doc/xlm |
| Cross-lingual Language Model Pretraining | https://arxiv.org/abs/1901.07291 |
| XLM-RoBERTa | https://huggingface.co/docs/transformers/model_doc/xlm-roberta |
| Unsupervised Cross-lingual Representation Learning at Scale | https://arxiv.org/abs/1911.02116 |
| YOLOS | https://huggingface.co/docs/transformers/model_doc/yolos |
| You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection | https://arxiv.org/abs/2106.00666 |
| huggingface.co/docs/transformers.js | https://huggingface.co/docs/transformers.js |
|
Readme
| https://patch-diff.githubusercontent.com/texonom/transformers.js#readme-ov-file |
|
Apache-2.0 license
| https://patch-diff.githubusercontent.com/texonom/transformers.js#Apache-2.0-1-ov-file |
| Please reload this page | https://patch-diff.githubusercontent.com/texonom/transformers.js |
|
Activity | https://patch-diff.githubusercontent.com/texonom/transformers.js/activity |
|
Custom properties | https://patch-diff.githubusercontent.com/texonom/transformers.js/custom-properties |
|
0
stars | https://patch-diff.githubusercontent.com/texonom/transformers.js/stargazers |
|
0
watching | https://patch-diff.githubusercontent.com/texonom/transformers.js/watchers |
|
0
forks | https://patch-diff.githubusercontent.com/texonom/transformers.js/forks |
|
Report repository
| https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Ftexonom%2Ftransformers.js&report=texonom+%28user%29 |
| Releases | https://patch-diff.githubusercontent.com/texonom/transformers.js/releases |
| Packages
0 | https://patch-diff.githubusercontent.com/orgs/texonom/packages?repo_name=transformers.js |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |