René's URL Explorer Experiment


Title: GitHub - Tongjilibo/bert4torch: An elegent pytorch implement of transformers

Open Graph Title: GitHub - Tongjilibo/bert4torch: An elegent pytorch implement of transformers

X Title: GitHub - Tongjilibo/bert4torch: An elegent pytorch implement of transformers

Description: An elegent pytorch implement of transformers. Contribute to Tongjilibo/bert4torch development by creating an account on GitHub.

Open Graph Description: An elegent pytorch implement of transformers. Contribute to Tongjilibo/bert4torch development by creating an account on GitHub.

X Description: An elegent pytorch implement of transformers. Contribute to Tongjilibo/bert4torch development by creating an account on GitHub.

Opengraph URL: https://github.com/Tongjilibo/bert4torch

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:d619f5f3-8519-3b0a-4d77-63d0acad1edd
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idB42C:118805:380E103:48B94EF:6992CF7B
html-safe-nonce98934c53a785907e1ca96299f957f2e98980c8c1494dae1ce86c12b7ea7a741c
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCNDJDOjExODgwNTozODBFMTAzOjQ4Qjk0RUY6Njk5MkNGN0IiLCJ2aXNpdG9yX2lkIjoiNTE4MTIxODM4ODk1ODE3MTAwMyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmaca4ad6a1ba079aa1b3d7e2ffde9d24b5b89c6faad6e5f7e7949f8d706eee80b18
hovercard-subject-tagrepository:469136195
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/Tongjilibo/bert4torch
twitter:imagehttps://opengraph.githubassets.com/da3a79eff26a586a0edf3f700cce76d1318088dea2186c23f02093bd4faf33df/Tongjilibo/bert4torch
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/da3a79eff26a586a0edf3f700cce76d1318088dea2186c23f02093bd4faf33df/Tongjilibo/bert4torch
og:image:altAn elegent pytorch implement of transformers. Contribute to Tongjilibo/bert4torch development by creating an account on GitHub.
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None42c603b9d642c4a9065a51770f75e5e27132fef0e858607f5c9cb7e422831a7b
turbo-cache-controlno-preview
go-importgithub.com/Tongjilibo/bert4torch git https://github.com/Tongjilibo/bert4torch.git
octolytics-dimension-user_id33407736
octolytics-dimension-user_loginTongjilibo
octolytics-dimension-repository_id469136195
octolytics-dimension-repository_nwoTongjilibo/bert4torch
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id469136195
octolytics-dimension-repository_network_root_nwoTongjilibo/bert4torch
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release84dcb133269e3cfe6e0296cc85fbacb92cae92bb
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FTongjilibo%2Fbert4torch
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FTongjilibo%2Fbert4torch
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=Tongjilibo%2Fbert4torch
Reloadhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Reloadhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Reloadhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Tongjilibo https://patch-diff.githubusercontent.com/Tongjilibo
bert4torchhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FTongjilibo%2Fbert4torch
Fork 169 https://patch-diff.githubusercontent.com/login?return_to=%2FTongjilibo%2Fbert4torch
Star 1.3k https://patch-diff.githubusercontent.com/login?return_to=%2FTongjilibo%2Fbert4torch
bert4torch.readthedocs.io/https://bert4torch.readthedocs.io/
MIT license https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/LICENSE
1.3k stars https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/stargazers
169 forks https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/forks
Branches https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/branches
Tags https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tags
Activity https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FTongjilibo%2Fbert4torch
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FTongjilibo%2Fbert4torch
Code https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Issues 3 https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/issues
Pull requests 0 https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/pulls
Discussions https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/discussions
Actions https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/actions
Projects 0 https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/projects
Wiki https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/wiki
Security 0 https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/security
Insights https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/pulse
Code https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Issues https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/issues
Pull requests https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/pulls
Discussions https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/discussions
Actions https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/actions
Projects https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/projects
Wiki https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/wiki
Security https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/security
Insights https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/pulse
Brancheshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/branches
Tagshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tags
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/branches
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tags
1,361 Commitshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/commits/master/
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/commits/master/
.githubhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/.github
.githubhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/.github
bert4torchhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/bert4torch
bert4torchhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/bert4torch
datahttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/data
datahttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/data
docshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/docs
docshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/docs
exampleshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/examples
exampleshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/examples
testhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/test
testhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/test
tutorialshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/tutorials
tutorialshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/tree/master/tutorials
.gitignorehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/.gitignore
LICENSEhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/LICENSE
MANIFEST.inhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/MANIFEST.in
MANIFEST.inhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/MANIFEST.in
README.mdhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/README.md
README.mdhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/README.md
requirements.txthttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/requirements.txt
setup.pyhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/setup.py
READMEhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
MIT licensehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/docs/pics/bert4torch.png
https://github.com/Tongjilibo/bert4torch/blob/master/LICENSE
https://github.com/Tongjilibo/bert4torch/releases
https://pypi.org/project/bert4torch/
https://pypistats.org/packages/bert4torch
https://github.com/Tongjilibo/bert4torch
https://github.com/Tongjilibo/bert4torch/issues
https://github.com/Tongjilibo/bert4torch/issues
https://github.com/Tongjilibo/bert4torch/blob/master/docs/pics/wechat_group.jpg
Documentationhttps://bert4torch.readthedocs.io
Torch4kerashttps://github.com/Tongjilibo/torch4keras
Exampleshttps://github.com/Tongjilibo/bert4torch/blob/master/examples
build_MiniLLM_from_scratchhttps://github.com/Tongjilibo/build_MiniLLM_from_scratch
bert4vectorhttps://github.com/Tongjilibo/bert4vector
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#目录
目录https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#%E7%9B%AE%E5%BD%95
1. 下载安装https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#1-%E4%B8%8B%E8%BD%BD%E5%AE%89%E8%A3%85
2. 功能https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#2-%E5%8A%9F%E8%83%BD
3. 快速上手https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#3-%E5%BF%AB%E9%80%9F%E4%B8%8A%E6%89%8B
3.1 上手教程https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#31-%E4%B8%8A%E6%89%8B%E6%95%99%E7%A8%8B
3.2 命令行快速部署大模型服务https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#32-%E5%91%BD%E4%BB%A4%E8%A1%8C%E5%BF%AB%E9%80%9F%E9%83%A8%E7%BD%B2%E5%A4%A7%E6%A8%A1%E5%9E%8B%E6%9C%8D%E5%8A%A1
4. 版本和更新历史https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#4-%E7%89%88%E6%9C%AC%E5%92%8C%E6%9B%B4%E6%96%B0%E5%8E%86%E5%8F%B2
4.1 版本历史https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#41-%E7%89%88%E6%9C%AC%E5%8E%86%E5%8F%B2
4.2 更新历史https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#42-%E6%9B%B4%E6%96%B0%E5%8E%86%E5%8F%B2
5. 预训练权重https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#5-%E9%A2%84%E8%AE%AD%E7%BB%83%E6%9D%83%E9%87%8D
6. 鸣谢https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#6-%E9%B8%A3%E8%B0%A2
7. 引用https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#7-%E5%BC%95%E7%94%A8
8. 其他https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#8-%E5%85%B6%E4%BB%96
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#1-下载安装
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#2-功能
丰富示例https://github.com/Tongjilibo/bert4torch/blob/master/examples/
llmhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/llm
pretrainhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/pretrain
sentence_classficationhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/sentence_classfication
sentence_embeddinghttps://github.com/Tongjilibo/bert4torch/tree/master/examples/sentence_embedding
sequence_labelinghttps://github.com/Tongjilibo/bert4torch/blob/master/examples/sequence_labeling
relation_extractionhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/relation_extraction
seq2seqhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/seq2seq
servinghttps://github.com/Tongjilibo/bert4torch/blob/master/examples/serving/
examples数据集https://github.com/Tongjilibo/bert4torch/blob/master/data/README.md
实验指标https://github.com/Tongjilibo/bert4torch/blob/master/examples/Experiments.md
trickhttps://github.com/Tongjilibo/bert4torch/blob/master/examples/training_trick
加载transformers库模型https://github.com/Tongjilibo/bert4torch/blob/master//tutorials/tutorials_load_transformers_model.py
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/docs/pics/training_process.gif
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#3-快速上手
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#31-上手教程
Quick-Starthttps://bert4torch.readthedocs.io/en/latest//Quick-Start.html
快速上手教程https://github.com/Tongjilibo/bert4torch/blob/master//tutorials/README.md
教程示例https://github.com/Tongjilibo/bert4torch/blob/master//tutorials
实战示例https://github.com/Tongjilibo/bert4torch/blob/master/examples
bert4torch介绍(知乎)https://zhuanlan.zhihu.com/p/486329434
bert4torch快速上手(知乎)https://zhuanlan.zhihu.com/p/508890807
bert4torch又双叒叕更新啦(知乎)https://zhuanlan.zhihu.com/p/560885427
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#32-命令行快速部署大模型服务
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/blob/master/docs/pics/cli_chat.gif
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#4-版本和更新历史
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#41-版本历史
更多版本https://github.com/Tongjilibo/bert4torch/blob/master/docs/Update.md
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#42-更新历史
更多历史https://github.com/Tongjilibo/bert4torch/blob/master/docs/History.md
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#5-预训练权重
google-bert/bert-base-chinesehttps://huggingface.co/google-bert/bert-base-chinese
google-bert/bert-base-chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/google-bert/bert-base-chinese/bert4torch_config.json
chinese_L-12_H-768_A-12https://github.com/google-research/bert
tf权重https://storage.googleapis.com/bert_models/2018_11_03/chinese_L-12_H-768_A-12.zip
Tongjilibo/bert-chinese_L-12_H-768_A-12https://huggingface.co/Tongjilibo/bert-chinese_L-12_H-768_A-12
chinese-bert-wwm-exthttps://github.com/ymcui/Chinese-BERT-wwm
hfl/chinese-bert-wwm-exthttps://huggingface.co/hfl/chinese-bert-wwm-ext
hfl/chinese-bert-wwm-exthttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-bert-wwm-ext/bert4torch_config.json
google-bert/bert-base-multilingual-casedhttps://huggingface.co/google-bert/bert-base-multilingual-cased
google-bert/bert-base-multilingual-casedhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/google-bert/bert-base-multilingual-cased/bert4torch_config.json
google-bert/bert-base-casedhttps://huggingface.co/google-bert/bert-base-cased
google-bert/bert-base-casedhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/google-bert/bert-base-cased/bert4torch_config.json
google-bert/bert-base-uncasedhttps://huggingface.co/google-bert/bert-base-uncased
google-bert/bert-base-uncasedhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/google-bert/bert-base-uncased/bert4torch_config.json
MacBERThttps://github.com/ymcui/MacBERT
hfl/chinese-macbert-basehttps://huggingface.co/hfl/chinese-macbert-base
hfl/chinese-macbert-largehttps://huggingface.co/hfl/chinese-macbert-large
hfl/chinese-macbert-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-macbert-base/bert4torch_config.json
hfl/chinese-macbert-largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-macbert-large/bert4torch_config.json
WoBERThttps://github.com/ZhuiyiTechnology/WoBERT
junnyu/wobert_chinese_basehttps://huggingface.co/junnyu/wobert_chinese_base
junnyu/wobert_chinese_plus_basehttps://huggingface.co/junnyu/wobert_chinese_plus_base
junnyu/wobert_chinese_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/wobert_chinese_base/bert4torch_config.json
junnyu/wobert_chinese_plus_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/wobert_chinese_plus_base/bert4torch_config.json
chinese-roberta-wwm-exthttps://github.com/ymcui/Chinese-BERT-wwm
hfl/chinese-roberta-wwm-exthttps://huggingface.co/hfl/chinese-roberta-wwm-ext
hfl/chinese-roberta-wwm-ext-largehttps://huggingface.co/hfl/chinese-roberta-wwm-ext-large
hfl/chinese-roberta-wwm-exthttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-roberta-wwm-ext/bert4torch_config.json
hfl/chinese-roberta-wwm-ext-largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-roberta-wwm-ext-large/bert4torch_config.json
roberta-small/tinyhttps://github.com/ZhuiyiTechnology/pretrained-models
Tongjilibo/chinese_roberta_L-4_H-312_A-12https://huggingface.co/Tongjilibo/chinese_roberta_L-4_H-312_A-12
Tongjilibo/chinese_roberta_L-6_H-384_A-12https://huggingface.co/Tongjilibo/chinese_roberta_L-6_H-384_A-12
roberta-basehttps://github.com/facebookresearch/fairseq/tree/main/examples/roberta
FacebookAI/roberta-basehttps://huggingface.co/FacebookAI/roberta-base
FacebookAI/roberta-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/FacebookAI/roberta-base/bert4torch_config.json
guwenberthttps://github.com/Ethan-yt/guwenbert
ethanyt/guwenbert-basehttps://huggingface.co/ethanyt/guwenbert-base
ethanyt/guwenbert-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/ethanyt/guwenbert-base/bert4torch_config.json
albert_zhhttps://github.com/brightmart/albert_zh
albert_pytorchhttps://github.com/lonePatient/albert_pytorch
voidful/albert_chinese_tinyhttps://huggingface.co/voidful/albert_chinese_tiny
voidful/albert_chinese_smallhttps://huggingface.co/voidful/albert_chinese_small
voidful/albert_chinese_basehttps://huggingface.co/voidful/albert_chinese_base
voidful/albert_chinese_largehttps://huggingface.co/voidful/albert_chinese_large
voidful/albert_chinese_xlargehttps://huggingface.co/voidful/albert_chinese_xlarge
voidful/albert_chinese_xxlargehttps://huggingface.co/voidful/albert_chinese_xxlarge
voidful/albert_chinese_tinyhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_tiny/bert4torch_config.json
voidful/albert_chinese_smallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_small/bert4torch_config.json
voidful/albert_chinese_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_base/bert4torch_config.json
voidful/albert_chinese_largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_large/bert4torch_config.json
voidful/albert_chinese_xlargehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_xlarge/bert4torch_config.json
voidful/albert_chinese_xxlargehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/voidful/albert_chinese_xxlarge/bert4torch_config.json
NEZHAhttps://github.com/huawei-noah/Pretrained-Language-Model/tree/master/NEZHA-PyTorch
NeZha_Chinese_PyTorchhttps://github.com/lonePatient/NeZha_Chinese_PyTorch
sijunhe/nezha-cn-basehttps://huggingface.co/sijunhe/nezha-cn-base
sijunhe/nezha-cn-largehttps://huggingface.co/sijunhe/nezha-cn-large
sijunhe/nezha-base-wwmhttps://huggingface.co/sijunhe/nezha-base-wwm
sijunhe/nezha-large-wwmhttps://huggingface.co/sijunhe/nezha-large-wwm
sijunhe/nezha-cn-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/sijunhe/nezha-cn-base/bert4torch_config.json
sijunhe/nezha-cn-largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/sijunhe/nezha-cn-large/bert4torch_config.json
sijunhe/nezha-base-wwmhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/sijunhe/nezha-base-wwm/bert4torch_config.json
sijunhe/nezha-large-wwmhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/sijunhe/nezha-large-wwm/bert4torch_config.json
nezha_gpt_dialoghttps://github.com/bojone/nezha_gpt_dialog
Tongjilibo/nezha_gpt_dialoghttps://huggingface.co/Tongjilibo/nezha_gpt_dialog
Chinese-XLNethttps://github.com/ymcui/Chinese-XLNet
hfl/chinese-xlnet-basehttps://huggingface.co/hfl/chinese-xlnet-base
hfl/chinese-xlnet-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-xlnet-base/bert4torch_config.json
tranformer_xlhttps://github.com/kimiyoung/transformer-xl
transfo-xl/transfo-xl-wt103https://huggingface.co/transfo-xl/transfo-xl-wt103
transfo-xl/transfo-xl-wt103https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/transfo-xl/transfo-xl-wt103/bert4torch_config.json
Erlangshen-DeBERTa-v2https://github.com/IDEA-CCNL/Fengshenbang-LM
IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinesehttps://huggingface.co/IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinesehttps://huggingface.co/IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinesehttps://huggingface.co/IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese
IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/IDEA-CCNL/Erlangshen-DeBERTa-v2-97M-Chinese/bert4torch_config.json
IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/IDEA-CCNL/Erlangshen-DeBERTa-v2-320M-Chinese/bert4torch_config.json
IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/IDEA-CCNL/Erlangshen-DeBERTa-v2-710M-Chinese/bert4torch_config.json
Chinese-ELECTRAhttps://github.com/ymcui/Chinese-ELECTRA
hfl/chinese-electra-base-discriminatorhttps://huggingface.co/hfl/chinese-electra-base-discriminator
hfl/chinese-electra-base-discriminatorhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-electra-base-discriminator/bert4torch_config.json
erniehttps://github.com/PaddlePaddle/ERNIE
nghuyong/ernie-1.0-base-zhhttps://huggingface.co/nghuyong/ernie-1.0-base-zh
nghuyong/ernie-3.0-base-zhhttps://huggingface.co/nghuyong/ernie-3.0-base-zh
nghuyong/ernie-1.0-base-zhhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/nghuyong/ernie-1.0-base-zh/bert4torch_config.json
nghuyong/ernie-3.0-base-zhhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/nghuyong/ernie-3.0-base-zh/bert4torch_config.json
roformerhttps://github.com/ZhuiyiTechnology/roformer
junnyu/roformer_chinese_basehttps://huggingface.co/junnyu/roformer_chinese_base
junnyu/roformer_chinese_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_chinese_base/bert4torch_config.json
roformer_v2https://github.com/ZhuiyiTechnology/roformer-v2
junnyu/roformer_v2_chinese_char_basehttps://huggingface.co/junnyu/roformer_v2_chinese_char_base
junnyu/roformer_v2_chinese_char_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_v2_chinese_char_base/bert4torch_config.json
simberthttps://github.com/ZhuiyiTechnology/simbert
Tongjilibo/simbert-chinese-basehttps://huggingface.co/Tongjilibo/simbert-chinese-base
Tongjilibo/simbert-chinese-smallhttps://huggingface.co/Tongjilibo/simbert-chinese-small
Tongjilibo/simbert-chinese-tinyhttps://huggingface.co/Tongjilibo/simbert-chinese-tiny
simbert_v2/roformer-simhttps://github.com/ZhuiyiTechnology/roformer-sim
junnyu/roformer_chinese_sim_char_basehttps://huggingface.co/junnyu/roformer_chinese_sim_char_base
junnyu/roformer_chinese_sim_char_ft_basehttps://huggingface.co/junnyu/roformer_chinese_sim_char_ft_base
junnyu/roformer_chinese_sim_char_smallhttps://huggingface.co/junnyu/roformer_chinese_sim_char_small
junnyu/roformer_chinese_sim_char_ft_smallhttps://huggingface.co/junnyu/roformer_chinese_sim_char_ft_small
junnyu/roformer_chinese_sim_char_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_chinese_sim_char_base/bert4torch_config.json
junnyu/roformer_chinese_sim_char_ft_basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_chinese_sim_char_ft_base/bert4torch_config.json
junnyu/roformer_chinese_sim_char_smallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_chinese_sim_char_small/bert4torch_config.json
junnyu/roformer_chinese_sim_char_ft_smallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/junnyu/roformer_chinese_sim_char_ft_small/bert4torch_config.json
GAU-alphahttps://github.com/ZhuiyiTechnology/GAU-alpha
Tongjilibo/chinese_GAU-alpha-char_L-24_H-768https://huggingface.co/Tongjilibo/chinese_GAU-alpha-char_L-24_H-768
ModernBERThttps://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb
answerdotai/ModernBERT-basehttps://huggingface.co/answerdotai/ModernBERT-base
answerdotai/ModernBERT-largehttps://huggingface.co/answerdotai/ModernBERT-large
answerdotai/ModernBERT-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/answerdotai/ModernBERT-base/bert4torch_config.json
answerdotai/ModernBERT-largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/answerdotai/ModernBERT-large/bert4torch_config.json
uiehttps://github.com/universal-ie/UIE
uie_pytorchhttps://github.com/HUSTAI/uie_pytorch
Tongjilibo/uie-basehttps://huggingface.co/Tongjilibo/uie-base
CDial-GPThttps://github.com/thu-coai/CDial-GPT
thu-coai/CDial-GPT_LCCC-basehttps://huggingface.co/thu-coai/CDial-GPT_LCCC-base
thu-coai/CDial-GPT_LCCC-largehttps://huggingface.co/thu-coai/CDial-GPT_LCCC-large
thu-coai/CDial-GPT_LCCC-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/thu-coai/CDial-GPT_LCCC-base/bert4torch_config.json
thu-coai/CDial-GPT_LCCC-largehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/thu-coai/CDial-GPT_LCCC-large/bert4torch_config.json
cmp_lm(26亿)https://github.com/TsinghuaAI/CPM-1-Generate
TsinghuaAI/CPM-Generatehttps://huggingface.co/TsinghuaAI/CPM-Generate
TsinghuaAI/CPM-Generatehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/TsinghuaAI/CPM-Generate/bert4torch_config.json
nezha_genhttps://github.com/huawei-noah/Pretrained-Language-Model/tree/master/NEZHA-Gen-TensorFlow
Tongjilibo/chinese_nezha_gpt_L-12_H-768_A-12https://huggingface.co/Tongjilibo/chinese_nezha_gpt_L-12_H-768_A-12
gpt2-chinese-cluecorpussmallhttps://github.com/dbiir/UER-py/wiki/Modelzoo
uer/gpt2-chinese-cluecorpussmallhttps://huggingface.co/uer/gpt2-chinese-cluecorpussmall
uer/gpt2-chinese-cluecorpussmallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/uer/gpt2-chinese-cluecorpussmall/bert4torch_config.json
gpt2-mlhttps://github.com/imcaspar/gpt2-ml
Tongjilibo/gpt2-ml_15g_corpushttps://huggingface.co/Tongjilibo/gpt2-ml_15g_corpus
Tongjilibo/gpt2-ml_30g_corpushttps://huggingface.co/Tongjilibo/gpt2-ml_30g_corpus
torchhttps://github.com/ghosthamlet/gpt2-ml-torch
BaiduYun(84dh)https://pan.baidu.com/s/16tL4Bmoh6jPy0cOND0YyeA
bart_base_chinesehttps://github.com/fastnlp/CPT
fnlp/bart-base-chinesehttps://huggingface.co/fnlp/bart-base-chinese
v1.0https://huggingface.co/fnlp/bart-base-chinese/tree/v1.0
fnlp/bart-base-chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/fnlp/bart-base-chinese/bert4torch_config.json
fnlp/bart-base-chinese-v1.0https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/fnlp/bart-base-chinese-v1.0/bert4torch_config.json
t5https://github.com/dbiir/UER-py/wiki/Modelzoo
uer/t5-small-chinese-cluecorpussmallhttps://huggingface.co/uer/t5-small-chinese-cluecorpussmall
uer/t5-base-chinese-cluecorpussmallhttps://huggingface.co/uer/t5-base-chinese-cluecorpussmall
uer/t5-base-chinese-cluecorpussmallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/uer/t5-base-chinese-cluecorpussmall/bert4torch_config.json
uer/t5-small-chinese-cluecorpussmallhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/uer/t5-small-chinese-cluecorpussmall/bert4torch_config.json
google/mt5-basehttps://huggingface.co/google/mt5-base
google/mt5-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/google/mt5-base/bert4torch_config.json
t5_pegasushttps://github.com/ZhuiyiTechnology/t5-pegasus
Tongjilibo/chinese_t5_pegasus_smallhttps://huggingface.co/Tongjilibo/chinese_t5_pegasus_small
Tongjilibo/chinese_t5_pegasus_basehttps://huggingface.co/Tongjilibo/chinese_t5_pegasus_base
chatyuanhttps://github.com/clue-ai/ChatYuan
ClueAI/ChatYuan-large-v1https://huggingface.co/ClueAI/ChatYuan-large-v1
ClueAI/ChatYuan-large-v2https://huggingface.co/ClueAI/ChatYuan-large-v2
ClueAI/ChatYuan-large-v1https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/ClueAI/ChatYuan-large-v1/bert4torch_config.json
ClueAI/ChatYuan-large-v2https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/ClueAI/ChatYuan-large-v2/bert4torch_config.json
PromptCLUEhttps://github.com/clue-ai/PromptCLUE
ClueAI/PromptCLUE-basehttps://huggingface.co/ClueAI/PromptCLUE-base
ClueAI/PromptCLUE-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/ClueAI/PromptCLUE-base/bert4torch_config.json
ChatGLM-6Bhttps://github.com/THUDM/ChatGLM-6B
THUDM/chatglm-6bhttps://huggingface.co/THUDM/chatglm-6b
THUDM/chatglm-6b-int8https://huggingface.co/THUDM/chatglm-6b-int8
THUDM/chatglm-6b-int4https://huggingface.co/THUDM/chatglm-6b-int4
v0.1.0https://huggingface.co/THUDM/chatglm-6b/tree/v0.1.0
THUDM/chatglm-6bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm-6b/bert4torch_config.json
THUDM/chatglm-6b-int8https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm-6b-int8/bert4torch_config.json
THUDM/chatglm-6b-int4https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm-6b-int4/bert4torch_config.json
THUDM/chatglm-6b-v0.1.0https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm-6b-v0.1.0/bert4torch_config.json
ChatGLM2-6Bhttps://github.com/THUDM/ChatGLM2-6B
THUDM/chatglm2-6bhttps://huggingface.co/THUDM/chatglm2-6b
THUDM/chatglm2-6b-int4https://huggingface.co/THUDM/chatglm2-6b-int4
THUDM/chatglm2-6b-32khttps://huggingface.co/THUDM/chatglm2-6b-32k
THUDM/chatglm2-6bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm2-6b/bert4torch_config.json
THUDM/chatglm2-6b-int4https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm2-6b-int4/bert4torch_config.json
THUDM/chatglm2-6b-32khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm2-6b-32k/bert4torch_config.json
ChatGLM3https://github.com/THUDM/ChatGLM3
THUDM/chatglm3-6bhttps://huggingface.co/THUDM/chatglm3-6b
THUDM/chatglm3-6b-32khttps://huggingface.co/THUDM/chatglm3-6b-32k
THUDM/chatglm3-6bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm3-6b/bert4torch_config.json
THUDM/chatglm3-6b-32khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/chatglm3-6b-32k/bert4torch_config.json
GLM-4https://github.com/THUDM/GLM-4
THUDM/glm-4-9bhttps://huggingface.co/THUDM/glm-4-9b
THUDM/glm-4-9b-chathttps://huggingface.co/THUDM/glm-4-9b-chat
THUDM/glm-4-9b-chat-1mhttps://huggingface.co/THUDM/glm-4-9b-chat-1m
THUDM/glm-4v-9bhttps://huggingface.co/THUDM/glm-4v-9b
THUDM/GLM-4-9B-0414https://huggingface.co/THUDM/GLM-4-9B-0414
THUDM/GLM-Z1-9B-0414https://huggingface.co/THUDM/GLM-Z1-9B-0414
THUDM/glm-4-9bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/glm-4-9b/bert4torch_config.json
THUDM/glm-4-9b-chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/glm-4-9b-chat/bert4torch_config.json
THUDM/glm-4-9b-chat-1mhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/glm-4-9b-chat-1m/bert4torch_config.json
THUDM/glm-4v-9bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/THUDM/glm-4v-9b/bert4torch_config.json
llamahttps://github.com/facebookresearch/llama
meta-llama/llama-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/llama-7b/bert4torch_config.json
meta-llama/llama-13bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/llama-13b/bert4torch_config.json
llama-2https://github.com/facebookresearch/llama
meta-llama/Llama-2-7b-hfhttps://huggingface.co/meta-llama/Llama-2-7b-hf
meta-llama/Llama-2-7b-chat-hfhttps://huggingface.co/meta-llama/Llama-2-7b-chat-hf
meta-llama/Llama-2-13b-hfhttps://huggingface.co/meta-llama/Llama-2-13b-hf
meta-llama/Llama-2-13b-chat-hfhttps://huggingface.co/meta-llama/Llama-2-13b-chat-hf
meta-llama/Llama-2-7b-hfhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-2-7b-hf/bert4torch_config.json
meta-llama/Llama-2-7b-chat-hfhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-2-7b-chat-hf/bert4torch_config.json
meta-llama/Llama-2-13b-hfhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-2-13b-hf/bert4torch_config.json
meta-llama/Llama-2-13b-chat-hfhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-2-13b-chat-hf/bert4torch_config.json
llama-3https://github.com/meta-llama/llama3
meta-llama/Meta-Llama-3-8Bhttps://huggingface.co/meta-llama/Meta-Llama-3-8B
meta-llama/Meta-Llama-3-8B-Instructhttps://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
meta-llama/Meta-Llama-3-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Meta-Llama-3-8B/bert4torch_config.json
meta-llama/Meta-Llama-3-8B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Meta-Llama-3-8B-Instruct/bert4torch_config.json
llama-3.1https://github.com/meta-llama/llama-models
meta-llama/Meta-Llama-3.1-8Bhttps://huggingface.co/meta-llama/Meta-Llama-3.1-8B
meta-llama/Meta-Llama-3.1-8B-Instructhttps://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
meta-llama/Meta-Llama-3.1-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Meta-Llama-3.1-8B/bert4torch_config.json
meta-llama/Meta-Llama-3.1-8B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Meta-Llama-3.1-8B-Instruct/bert4torch_config.json
llama-3.2https://github.com/meta-llama/llama-models
meta-llama/Llama-3.2-1Bhttps://huggingface.co/meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instructhttps://huggingface.co/meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3Bhttps://huggingface.co/meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instructhttps://huggingface.co/meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-3.2-1Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-1B/bert4torch_config.json
meta-llama/Llama-3.2-1B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-1B-Instruct/bert4torch_config.json
meta-llama/Llama-3.2-3Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-3B/bert4torch_config.json
meta-llama/Llama-3.2-3B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-3B-Instruct/bert4torch_config.json
llama-3.2-visionhttps://github.com/meta-llama/llama-models
meta-llama/Llama-3.2-11B-Visionhttps://huggingface.co/meta-llama/Llama-3.2-11B-Vision
meta-llama/Llama-3.2-11B-Vision-Instructhttps://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct
meta-llama/Llama-3.2-11B-Visionhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-11B-Vision/bert4torch_config.json
meta-llama/Llama-3.2-11B-Vision-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/meta-llama/Llama-3.2-11B-Vision-Instruct/bert4torch_config.json
Chinese-LLaMA-Alpacahttps://github.com/ymcui/Chinese-LLaMA-Alpaca
hfl/chinese-alpaca-plus-lora-7bhttps://huggingface.co/hfl/chinese-alpaca-plus-lora-7b
hfl/chinese-llama-plus-lora-7bhttps://huggingface.co/hfl/chinese-llama-plus-lora-7b
hfl/chinese-alpaca-plus-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-alpaca-plus-7b/bert4torch_config.json
hfl/chinese-llama-plus-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/hfl/chinese-llama-plus-7b/bert4torch_config.json
Chinese-LLaMA-Alpaca-2https://github.com/ymcui/Chinese-LLaMA-Alpaca-2
Chinese-LLaMA-Alpaca-3https://github.com/ymcui/Chinese-LLaMA-Alpaca-3
Belle_llamahttps://github.com/LianjiaTech/BELLE
BelleGroup/BELLE-LLaMA-7B-2M-enchttps://huggingface.co/BelleGroup/BELLE-LLaMA-7B-2M-enc
合成说明https://github.com/LianjiaTech/BELLE/tree/main/models
BelleGroup/BELLE-LLaMA-7B-2M-enchttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BelleGroup/BELLE-LLaMA-7B-2M-enc
Ziyahttps://github.com/IDEA-CCNL/Fengshenbang-LM
IDEA-CCNL/Ziya-LLaMA-13B-v1https://huggingface.co/IDEA-CCNL/Ziya-LLaMA-13B-v1
IDEA-CCNL/Ziya-LLaMA-13B-v1.1https://huggingface.co/IDEA-CCNL/Ziya-LLaMA-13B-v1.1
IDEA-CCNL/Ziya-LLaMA-13B-Pretrain-v1https://huggingface.co/IDEA-CCNL/Ziya-LLaMA-13B-Pretrain-v1
IDEA-CCNL/Ziya-LLaMA-13B-v1https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/IDEA-CCNL/Ziya-LLaMA-13B-v1/bert4torch_config.json
IDEA-CCNL/Ziya-LLaMA-13B-v1.1https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/IDEA-CCNL/Ziya-LLaMA-13B-v1.1/bert4torch_config.json
vicunahttps://github.com/lm-sys/FastChat
lmsys/vicuna-7b-v1.5https://huggingface.co/lmsys/vicuna-7b-v1.5
lmsys/vicuna-7b-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/lmsys/vicuna-7b-v1.5/bert4torch_config.json
Baichuanhttps://github.com/baichuan-inc/Baichuan
baichuan-inc/Baichuan-7Bhttps://huggingface.co/baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-13B-Basehttps://huggingface.co/baichuan-inc/Baichuan-13B-Base
baichuan-inc/Baichuan-13B-Chathttps://huggingface.co/baichuan-inc/Baichuan-13B-Chat
baichuan-inc/Baichuan-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan-7B/bert4torch_config.json
baichuan-inc/Baichuan-13B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan-13B-Base/bert4torch_config.json
baichuan-inc/Baichuan-13B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan-13B-Chat/bert4torch_config.json
Baichuan2https://github.com/baichuan-inc/Baichuan2
baichuan-inc/Baichuan2-7B-Basehttps://huggingface.co/baichuan-inc/Baichuan2-7B-Base
baichuan-inc/Baichuan2-7B-Chathttps://huggingface.co/baichuan-inc/Baichuan2-7B-Chat
baichuan-inc/Baichuan2-13B-Basehttps://huggingface.co/baichuan-inc/Baichuan2-13B-Base
baichuan-inc/Baichuan2-13B-Chathttps://huggingface.co/baichuan-inc/Baichuan2-13B-Chat
baichuan-inc/Baichuan2-7B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan2-7B-Base/bert4torch_config.json
baichuan-inc/Baichuan2-7B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan2-7B-Chat/bert4torch_config.json
baichuan-inc/Baichuan2-13B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan2-13B-Base/bert4torch_config.json
baichuan-inc/Baichuan2-13B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baichuan-inc/Baichuan2-13B-Chat/bert4torch_config.json
Yihttps://github.com/01-ai/Yi
01-ai/Yi-6Bhttps://huggingface.co/01-ai/Yi-6B
01-ai/Yi-6B-200Khttps://huggingface.co/01-ai/Yi-6B-200K
01-ai/Yi-9Bhttps://huggingface.co/01-ai/Yi-9B
01-ai/Yi-9B-200Khttps://huggingface.co/01-ai/Yi-9B-200K
01-ai/Yi-6Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-6B/bert4torch_config.json
01-ai/Yi-6B-200Khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-6B-200K/bert4torch_config.json
01-ai/Yi-9Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-9B/bert4torch_config.json
01-ai/Yi-9B-200Khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-9B-200K/bert4torch_config.json
Yi-1.5https://github.com/01-ai/Yi-1.5
01-ai/Yi-1.5-6Bhttps://huggingface.co/01-ai/Yi-1.5-6B
01-ai/Yi-1.5-6B-Chathttps://huggingface.co/01-ai/Yi-1.5-6B-Chat
01-ai/Yi-1.5-9Bhttps://huggingface.co/01-ai/Yi-1.5-9B
01-ai/Yi-1.5-9B-32Khttps://huggingface.co/01-ai/Yi-1.5-9B-32K
01-ai/Yi-1.5-9B-Chathttps://huggingface.co/01-ai/Yi-1.5-9B-Chat
01-ai/Yi-1.5-9B-Chat-16Khttps://huggingface.co/01-ai/Yi-1.5-9B-Chat-16K
01-ai/Yi-1.5-6Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-6B/bert4torch_config.json
01-ai/Yi-1.5-6B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-6B-Chat/bert4torch_config.json
01-ai/Yi-1.5-9Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-9B/bert4torch_config.json
01-ai/Yi-1.5-9B-32Khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-9B-32K/bert4torch_config.json
01-ai/Yi-1.5-9B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-9B-Chat
01-ai/Yi-1.5-9B-Chat-16Khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/01-ai/Yi-1.5-9B-Chat-16K/bert4torch_config.json
bloomhttps://github.com/bigscience-workshop/xmtf
bigscience/bloom-560mhttps://huggingface.co/bigscience/bloom-560m
bigscience/bloomz-560mhttps://huggingface.co/bigscience/bloomz-560m
bigscience/bloom-560mhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/bigscience/bloom-560m/bert4torch_config.json
bigscience/bloomz-560mhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/bigscience/bloomz-560m/bert4torch_config.json
Qwenhttps://github.com/QwenLM/Qwen
Qwen/Qwen-1_8Bhttps://huggingface.co/Qwen/Qwen-1_8B
Qwen/Qwen-1_8B-Chathttps://huggingface.co/Qwen/Qwen-1_8B-Chat
Qwen/Qwen-7Bhttps://huggingface.co/Qwen/Qwen-7B
Qwen/Qwen-7B-Chathttps://huggingface.co/Qwen/Qwen-7B-Chat
Qwen/Qwen-14Bhttps://huggingface.co/Qwen/Qwen-14B
Qwen/Qwen-14B-Chathttps://huggingface.co/Qwen/Qwen-14B-Chat
Qwen/Qwen-1_8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-1_8B/bert4torch_config.json
Qwen/Qwen-1_8B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-1_8B-Chat/bert4torch_config.json
Qwen/Qwen-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-7B/bert4torch_config.json
Qwen/Qwen-7B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-7B-Chat/bert4torch_config.json
Qwen/Qwen-14Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-14B/bert4torch_config.json
Qwen/Qwen-14B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen-14B-Chat/bert4torch_config.json
Qwen1.5https://github.com/QwenLM/Qwen1.5
Qwen/Qwen1.5-0.5Bhttps://huggingface.co/Qwen/Qwen1.5-0.5B
Qwen/Qwen1.5-0.5B-Chathttps://huggingface.co/Qwen/Qwen1.5-0.5B-Chat
Qwen/Qwen1.5-1.8Bhttps://huggingface.co/Qwen/Qwen1.5-1.8B
Qwen/Qwen1.5-1.8B-Chathttps://huggingface.co/Qwen/Qwen1.5-1.8B-Chat
Qwen/Qwen1.5-7Bhttps://huggingface.co/Qwen/Qwen1.5-7B
Qwen/Qwen1.5-7B-Chathttps://huggingface.co/Qwen/Qwen1.5-7B-Chat
Qwen/Qwen1.5-14Bhttps://huggingface.co/Qwen/Qwen1.5-14B
Qwen/Qwen1.5-14B-Chathttps://huggingface.co/Qwen/Qwen1.5-14B-Chat
Qwen/Qwen1.5-0.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-0.5B/bert4torch_config.json
Qwen/Qwen1.5-0.5B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-0.5B-Chat/bert4torch_config.json
Qwen/Qwen1.5-1.8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-1.8B/bert4torch_config.json
Qwen/Qwen1.5-1.8B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-1.8B-Chat/bert4torch_config.json
Qwen/Qwen1.5-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-7B/bert4torch_config.json
Qwen/Qwen1.5-7B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-7B-Chat/bert4torch_config.json
Qwen/Qwen1.5-14Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-14B/bert4torch_config.json
Qwen/Qwen1.5-14B-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen1.5-14B-Chat/bert4torch_config.json
Qwen2https://github.com/QwenLM/Qwen2
Qwen/Qwen2-0.5Bhttps://huggingface.co/Qwen/Qwen2-0.5B
Qwen/Qwen2-0.5B-Instructhttps://huggingface.co/Qwen/Qwen2-0.5B-Instruct
Qwen/Qwen2-1.5Bhttps://huggingface.co/Qwen/Qwen2-1.5B
Qwen/Qwen2-1.5B-Instructhttps://huggingface.co/Qwen/Qwen2-1.5B-Instruct
Qwen/Qwen2-7Bhttps://huggingface.co/Qwen/Qwen2-7B
Qwen/Qwen2-7B-Instructhttps://huggingface.co/Qwen/Qwen2-7B-Instruct
Qwen/Qwen2-0.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-0.5B/bert4torch_config.json
Qwen/Qwen2-0.5B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-0.5B-Instruct/bert4torch_config.json
Qwen/Qwen2-1.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-1.5B/bert4torch_config.json
Qwen/Qwen2-1.5B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-1.5B-Instruct/bert4torch_config.json
Qwen/Qwen2-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-7B/bert4torch_config.json
Qwen/Qwen2-7B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-7B-Instruct/bert4torch_config.json
Qwen2-VLhttps://github.com/QwenLM/Qwen2-VL
Qwen/Qwen2-VL-2B-Instructhttps://huggingface.co/Qwen/Qwen2-VL-2B-Instruct
Qwen/Qwen2-VL-7B-Instructhttps://huggingface.co/Qwen/Qwen2-VL-7B-Instruct
Qwen/Qwen2-VL-2B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-VL-2B-Instruct/bert4torch_config.json
Qwen/Qwen2-VL-7B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2-VL-7B-Instruct/bert4torch_config.json
Qwen2.5https://github.com/QwenLM/Qwen2.5
Qwen/Qwen2.5-0.5Bhttps://huggingface.co/Qwen/Qwen2.5-0.5B
Qwen/Qwen2.5-0.5B-Instructhttps://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct
Qwen/Qwen2.5-1.5Bhttps://huggingface.co/Qwen/Qwen2.5-1.5B
Qwen/Qwen2.5-1.5B-Instructhttps://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct
Qwen/Qwen2.5-3Bhttps://huggingface.co/Qwen/Qwen2.5-3B
Qwen/Qwen2.5-3B-Instructhttps://huggingface.co/Qwen/Qwen2.5-3B-Instruct
Qwen/Qwen2.5-7Bhttps://huggingface.co/Qwen/Qwen2.5-7B
Qwen/Qwen2.5-7B-Instructhttps://huggingface.co/Qwen/Qwen2.5-7B-Instruct
Qwen/Qwen2.5-14Bhttps://huggingface.co/Qwen/Qwen2.5-14B
Qwen/Qwen2.5-14B-Instructhttps://huggingface.co/Qwen/Qwen2.5-14B-Instruct
Qwen/Qwen2.5-0.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-0.5B/bert4torch_config.json
Qwen/Qwen2.5-0.5B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-0.5B-Instruct/bert4torch_config.json
Qwen/Qwen2.5-1.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-1.5B/bert4torch_config.json
Qwen/Qwen2.5-1.5B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-1.5B-Instruct/bert4torch_config.json
Qwen/Qwen2.5-3Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-3B/bert4torch_config.json
Qwen/Qwen2.5-3B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-3B-Instruct/bert4torch_config.json
Qwen/Qwen2.5-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-7B/bert4torch_config.json
Qwen/Qwen2.5-7B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-7B-Instruct/bert4torch_config.json
Qwen/Qwen2.5-14Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-14B/bert4torch_config.json
Qwen/Qwen2.5-14B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-14B-Instruct/bert4torch_config.json
Qwen2.5-VLhttps://github.com/QwenLM/Qwen2.5-VL
Qwen/Qwen2.5-VL-3B-Instructhttps://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct
Qwen/Qwen2.5-VL-7B-Instructhttps://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct
Qwen/Qwen2.5-VL-3B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-VL-3B-Instruct/bert4torch_config.json
Qwen/Qwen2.5-VL-7B-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen2.5-VL-7B-Instruct/bert4torch_config.json
Qwen3https://github.com/QwenLM/Qwen3
Qwen/Qwen3-0.6B-Basehttps://huggingface.co/Qwen/Qwen3-0.6B-Base
Qwen/Qwen3-0.6Bhttps://huggingface.co/Qwen/Qwen3-0.6B
Qwen/Qwen3-0.6B-GPTQ-Int8https://huggingface.co/Qwen/Qwen3-0.6B-GPTQ-Int8
Qwen/Qwen3-1.7B-Basehttps://huggingface.co/Qwen/Qwen3-1.7B-Base
Qwen/Qwen3-1.7Bhttps://huggingface.co/Qwen/Qwen3-1.7B
Qwen/Qwen3-4B-Basehttps://huggingface.co/Qwen/Qwen3-4B-Base
Qwen/Qwen3-4Bhttps://huggingface.co/Qwen/Qwen3-4B
Qwen/Qwen3-4B-AWQhttps://huggingface.co/Qwen/Qwen3-4B-AWQ
Qwen/Qwen3-8B-Basehttps://huggingface.co/Qwen/Qwen3-8B-Base
Qwen/Qwen3-8Bhttps://huggingface.co/Qwen/Qwen3-8B
Qwen/Qwen3-14B-Basehttps://huggingface.co/Qwen/Qwen3-14B-Base
Qwen/Qwen3-14Bhttps://huggingface.co/Qwen/Qwen3-14B
Qwen/Qwen3-32Bhttps://huggingface.co/Qwen/Qwen3-32B
Qwen/Qwen3-4B-Instruct-2507https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507
Qwen/Qwen3-4B-Thinking-2507https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507
Qwen/Qwen3-30B-A3B-Instruct-2507https://huggingface.co/Qwen/Qwen3-30B-A3B-Instruct-2507
Qwen/Qwen3-30B-A3B-Thinking-2507https://huggingface.co/Qwen/Qwen3-30B-A3B-Thinking-2507
Qwen/Qwen3-0.6B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-0.6B-Base/bert4torch_config.json
Qwen/Qwen3-0.6Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-0.6B/bert4torch_config.json
Qwen/Qwen3-0.6B-GPTQ-Int8https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-0.6B-GPTQ-Int8/bert4torch_config.json
Qwen/Qwen3-1.7B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-1.7B-Base/bert4torch_config.json
Qwen/Qwen3-1.7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-1.7B/bert4torch_config.json
Qwen/Qwen3-4B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-4B-Base/bert4torch_config.json
Qwen/Qwen3-4Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-4B/bert4torch_config.json
Qwen/Qwen3-4B-AWQhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-4B-AWQ/bert4torch_config.json
Qwen/Qwen3-8B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-8B-Base/bert4torch_config.json
Qwen/Qwen3-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-8B/bert4torch_config.json
Qwen/Qwen3-14B-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-14B-Base/bert4torch_config.json
Qwen/Qwen3-14Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-14B/bert4torch_config.json
Qwen/Qwen3-32Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-32B/bert4torch_config.json
Qwen/Qwen3-4B-Instruct-2507https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-4B-Instruct-2507/bert4torch_config.json
Qwen/Qwen3-4B-Thinking-2507https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-4B-Thinking-2507/bert4torch_config.json
Qwen/Qwen3-30B-A3B-Instruct-2507https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-30B-A3B-Instruct-2507/bert4torch_config.json
Qwen/Qwen3-30B-A3B-Thinking-2507https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-30B-A3B-Thinking-2507/bert4torch_config.json
Qwen3-VLhttps://huggingface.co/collections/Qwen/qwen3-vl
Qwen/Qwen3-VL-2B-Instructhttps://huggingface.co/Qwen/Qwen3-VL-2B-Instruct
Qwen/Qwen3-VL-2B-Thinkinghttps://huggingface.co/Qwen/Qwen3-VL-2B-Thinking
Qwen/Qwen3-VL-4B-Instructhttps://huggingface.co/Qwen/Qwen3-VL-4B-Instruct
Qwen/Qwen3-VL-4B-Thinkinghttps://huggingface.co/Qwen/Qwen3-VL-4B-Thinking
Qwen/Qwen3-VL-8B-Instructhttps://huggingface.co/Qwen/Qwen3-VL-8B-Instruct
Qwen/Qwen3-VL-8B-Thinkinghttps://huggingface.co/Qwen/Qwen3-VL-8B-Thinking
Qwen/Qwen3-VL-30B-A3B-Instructhttps://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct
Qwen/Qwen3-VL-30B-A3B-Thinkinghttps://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Thinking
Qwen/Qwen3-VL-32B-Instructhttps://huggingface.co/Qwen/Qwen3-VL-32B-Instruct
Qwen/Qwen3-VL-32B-Thinkinghttps://huggingface.co/Qwen/Qwen3-VL-32B-Thinking
Qwen3-Embeddinghttps://github.com/QwenLM/Qwen3
Qwen/Qwen3-Embedding-0.6Bhttps://huggingface.co/Qwen/Qwen3-Embedding-0.6B
Qwen/Qwen3-Embedding-4Bhttps://huggingface.co/Qwen/Qwen3-Embedding-4B
Qwen/Qwen3-Embedding-8Bhttps://huggingface.co/Qwen/Qwen3-Embedding-8B
Qwen3-Embedding-0.6Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-Embedding-0.6B/bert4torch_config.json
Qwen3-Embedding-4Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-Embedding-4B/bert4torch_config.json
Qwen3-Embedding-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/Qwen/Qwen3-Embedding-8B/bert4torch_config.json
Qwen3-Rerankerhttps://github.com/QwenLM/Qwen3
Qwen/Qwen3-Reranker-0.6Bhttps://huggingface.co/Qwen/Qwen3-Reranker-0.6B
Qwen/Qwen3-Reranker-4Bhttps://huggingface.co/Qwen/Qwen3-Reranker-4B
Qwen/Qwen3-Reranker-8Bhttps://huggingface.co/Qwen/Qwen3-Reranker-8B
Qwen/Qwen3-Reranker-0.6Bhttps://huggingface.co/Tongjilibo/bert4torch_config/tree/main/Qwen/Qwen3-Reranker-0.6B
Qwen/Qwen3-Reranker-4Bhttps://huggingface.co/Tongjilibo/bert4torch_config/tree/main/Qwen/Qwen3-Reranker-4B
Qwen/Qwen3-Reranker-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/tree/main/Qwen/Qwen3-Reranker-8B
InternLMhttps://github.com/InternLM/InternLM
internlm/internlm-7bhttps://huggingface.co/internlm/internlm-7b
internlm/internlm-chat-7bhttps://huggingface.co/internlm/internlm-chat-7b
internlm/internlm-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm-7b/bert4torch_config.json
internlm/internlm-chat-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm-chat-7b/bert4torch_config.json
InternLM2https://huggingface.co/collections/internlm/internlm2-65b0ce04970888799707893c
internlm/internlm2-1_8bhttps://huggingface.co/internlm/internlm2-1_8b
internlm/internlm2-chat-1_8bhttps://huggingface.co/internlm/internlm2-chat-1_8b
internlm/internlm2-7bhttps://huggingface.co/internlm/internlm2-7b
internlm/internlm2-chat-7bhttps://huggingface.co/internlm/internlm2-chat-7b
internlm/internlm2-20bhttps://huggingface.co/internlm/internlm2-20b
internlm/internlm2-chat-20bhttps://huggingface.co/internlm/internlm2-chat-20b
internlm/internlm2-1_8bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2-1_8b/bert4torch_config.json
internlm/internlm2-chat-1_8bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2-chat-1_8b/bert4torch_config.json
internlm/internlm2-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2-7b/bert4torch_config.json
internlm/internlm2-chat-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2-chat-7b/bert4torch_config.json
InternLM2.5https://huggingface.co/collections/internlm/internlm25-66853f32717072d17581bc13
internlm/internlm2_5-7bhttps://huggingface.co/internlm/internlm2_5-7b
internlm/internlm2_5-7b-chathttps://huggingface.co/internlm/internlm2_5-7b-chat
internlm/internlm2_5-7b-chat-1mhttps://huggingface.co/internlm/internlm2_5-7b-chat-1m
internlm/internlm2_5-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2_5-7b/bert4torch_config.json
internlm/internlm2_5-7b-chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2_5-7b-chat/bert4torch_config.json
internlm/internlm2_5-7b-chat-1mhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm2_5-7b-chat-1m/bert4torch_config.json
InternLM3https://huggingface.co/collections/internlm/internlm3-67875827c377690c01a9131d
internlm/internlm3-8b-instructhttps://huggingface.co/internlm/internlm3-8b-instruct
internlm/internlm3-8b-instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/internlm/internlm3-8b-instruct/bert4torch_config.json
InternVL 1.0-1.5https://github.com/OpenGVLab/InternVL
OpenGVLab/Mini-InternVL-Chat-4B-V1-5https://huggingface.co/OpenGVLab/Mini-InternVL-Chat-4B-V1-5
OpenGVLab/Mini-InternVL-Chat-2B-V1-5https://huggingface.co/OpenGVLab/Mini-InternVL-Chat-2B-V1-5
InternVL 2.0https://github.com/OpenGVLab/InternVL
OpenGVLab/InternVL2-1Bhttps://huggingface.co/OpenGVLab/InternVL2-1B
OpenGVLab/InternVL2-2Bhttps://huggingface.co/OpenGVLab/InternVL2-2B
OpenGVLab/InternVL2-4Bhttps://huggingface.co/OpenGVLab/InternVL2-4B
OpenGVLab/InternVL2-8Bhttps://huggingface.co/OpenGVLab/InternVL2-8B
InternVL 2.5https://github.com/OpenGVLab/InternVL
OpenGVLab/InternVL2_5-1Bhttps://huggingface.co/OpenGVLab/InternVL2_5-1B
OpenGVLab/InternVL2_5-2Bhttps://huggingface.co/OpenGVLab/InternVL2_5-2B
OpenGVLab/InternVL2_5-4Bhttps://huggingface.co/OpenGVLab/InternVL2_5-4B
OpenGVLab/InternVL2_5-8Bhttps://huggingface.co/OpenGVLab/InternVL2_5-8B
OpenGVLab/InternVL2_5-1Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/OpenGVLab/InternVL2_5-1B/bert4torch_config.json
Falconhttps://huggingface.co/tiiuae
tiiuae/falcon-rw-1bhttps://huggingface.co/tiiuae/falcon-rw-1b
tiiuae/falcon-7bhttps://huggingface.co/tiiuae/falcon-7b
tiiuae/falcon-7b-instructhttps://huggingface.co/tiiuae/falcon-7b-instruct
tiiuae/falcon-rw-1bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/tiiuae/falcon-rw-1b/bert4torch_config.json
tiiuae/falcon-7bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/tiiuae/falcon-7b/bert4torch_config.json
tiiuae/falcon-7b-instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/tiiuae/falcon-7b-instruct/bert4torch_config.json
DeepSeek-MoEhttps://github.com/deepseek-ai/DeepSeek-MoE
deepseek-ai/deepseek-moe-16b-basehttps://huggingface.co/deepseek-ai/deepseek-moe-16b-base
deepseek-ai/deepseek-moe-16b-chathttps://huggingface.co/deepseek-ai/deepseek-moe-16b-chat
deepseek-ai/deepseek-moe-16b-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-moe-16b-base/bert4torch_config.json
deepseek-ai/deepseek-moe-16b-chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-moe-16b-chat/bert4torch_config.json
DeepSeek-LLMhttps://github.com/deepseek-ai/DeepSeek-LLM
deepseek-ai/deepseek-llm-7b-basehttps://huggingface.co/deepseek-ai/deepseek-llm-7b-base
deepseek-ai/deepseek-llm-7b-chathttps://huggingface.co/deepseek-ai/deepseek-llm-7b-chat
deepseek-ai/deepseek-llm-7b-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-llm-7b-base/bert4torch_config.json
deepseek-ai/deepseek-llm-7b-chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-llm-7b-chat/bert4torch_config.json
DeepSeek-V2https://github.com/deepseek-ai/DeepSeek-V2
deepseek-ai/DeepSeek-V2-Litehttps://huggingface.co/deepseek-ai/DeepSeek-V2-Lite
deepseek-ai/DeepSeek-V2-Lite-Chathttps://huggingface.co/deepseek-ai/DeepSeek-V2-Lite-Chat
deepseek-ai/DeepSeek-V2-Litehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-V2-Lite/bert4torch_config.json
deepseek-ai/DeepSeek-V2-Lite-Chathttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-V2-Lite-Chat/bert4torch_config.json
DeepSeek-Coderhttps://github.com/deepseek-ai/DeepSeek-Coder
deepseek-ai/deepseek-coder-1.3b-basehttps://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base
deepseek-ai/deepseek-coder-1.3b-instructhttps://huggingface.co/deepseek-ai/deepseek-coder-1.3b-instruct
deepseek-ai/deepseek-coder-6.7b-basehttps://huggingface.co/deepseek-ai/deepseek-coder-6.7b-base
deepseek-ai/deepseek-coder-6.7b-instructhttps://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instruct
deepseek-ai/deepseek-coder-7b-base-v1.5https://huggingface.co/deepseek-ai/deepseek-coder-7b-base-v1.5
deepseek-ai/deepseek-coder-7b-instruct-v1.5https://huggingface.co/deepseek-ai/deepseek-coder-7b-instruct-v1.5
deepseek-ai/deepseek-coder-1.3b-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-1.3b-base/bert4torch_config.json
deepseek-ai/deepseek-coder-1.3b-instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-1.3b-instruct/bert4torch_config.json
deepseek-ai/deepseek-coder-6.7b-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-6.7b-base/bert4torch_config.json
deepseek-ai/deepseek-coder-6.7b-instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-6.7b-instruct/bert4torch_config.json
deepseek-ai/deepseek-coder-7b-base-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-7b-base-v1.5/bert4torch_config.json
deepseek-ai/deepseek-coder-7b-instruct-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-coder-7b-instruct-v1.5/bert4torch_config.json
DeepSeek-Coder-V2https://github.com/deepseek-ai/DeepSeek-Coder-V2
deepseek-ai/DeepSeek-Coder-V2-Lite-Basehttps://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Base
deepseek-ai/DeepSeek-Coder-V2-Lite-Instructhttps://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
deepseek-ai/DeepSeek-Coder-V2-Lite-Basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-Coder-V2-Lite-Base/bert4torch_config.json
deepseek-ai/DeepSeek-Coder-V2-Lite-Instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct/bert4torch_config.json
DeepSeek-Mathhttps://github.com/deepseek-ai/DeepSeek-Math
deepseek-ai/deepseek-math-7b-basehttps://huggingface.co/deepseek-ai/deepseek-ai/deepseek-math-7b-base
deepseek-ai/deepseek-math-7b-instructhttps://huggingface.co/deepseek-ai/deepseek-math-7b-instruct
deepseek-ai/deepseek-math-7b-rlhttps://huggingface.co/deepseek-ai/deepseek-math-7b-rl
deepseek-ai/deepseek-math-7b-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-math-7b-base/bert4torch_config.json
deepseek-ai/deepseek-math-7b-instructhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-math-7b-instruct/bert4torch_config.json
deepseek-ai/deepseek-math-7b-rlhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/deepseek-math-7b-rl/bert4torch_config.json
DeepSeek-R1https://huggingface.co/collections/deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
deepseek-ai/DeepSeek-R1-Distill-Qwen-7Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
deepseek-ai/DeepSeek-R1-Distill-Llama-8Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B
deepseek-ai/DeepSeek-R1-Distill-Qwen-14Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
deepseek-ai/DeepSeek-R1-0528-Qwen3-8Bhttps://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/bert4torch_config.json
deepseek-ai/DeepSeek-R1-Distill-Qwen-7Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/bert4torch_config.json
deepseek-ai/DeepSeek-R1-Distill-Llama-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/bert4torch_config.json
deepseek-ai/DeepSeek-R1-Distill-Qwen-14Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/bert4torch_config.json
deepseek-ai/DeepSeek-R1-Distill-Qwen-32Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/bert4torch_config.json
deepseek-ai/DeepSeek-R1-0528-Qwen3-8Bhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B/bert4torch_config.json
Seed-OSShttps://huggingface.co/collections/ByteDance-Seed/seed-oss-68a609f4201e788db05b5dcd
ByteDance-Seed/Seed-OSS-36B-Instructhttps://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Instruct
ByteDance-Seed/Seed-OSS-36B-Basehttps://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base
ByteDance-Seed/Seed-OSS-36B-Base-woSynhttps://huggingface.co/ByteDance-Seed/Seed-OSS-36B-Base-woSyn
Ernie4_5https://huggingface.co/collections/baidu/ernie-45-6861cd4c9be84540645f35c9
baidu/ERNIE-4.5-0.3B-Base-PThttps://huggingface.co/baidu/ERNIE-4.5-0.3B-Base-PT
baidu/ERNIE-4.5-0.3B-PThttps://huggingface.co/baidu/ERNIE-4.5-0.3B-PT
baidu/ERNIE-4.5-21B-A3B-Base-PThttps://huggingface.co/baidu/ERNIE-4.5-21B-A3B-Base-PT
baidu/ERNIE-4.5-21B-A3B-PThttps://huggingface.co/baidu/ERNIE-4.5-21B-A3B-PT
baidu/ERNIE-4.5-VL-28B-A3B-Base-PThttps://huggingface.co/baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
baidu/ERNIE-4.5-VL-28B-A3B-PThttps://huggingface.co/baidu/ERNIE-4.5-VL-28B-A3B-PT
baidu/ERNIE-4.5-0.3B-Base-PThttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baidu/ERNIE-4.5-0.3B-Base-PT/bert4torch_config.json
baidu/ERNIE-4.5-0.3B-PThttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/baidu/ERNIE-4.5-0.3B-PT/bert4torch_config.json
PaddleOCR-VLhttps://huggingface.co/PaddlePaddle/PaddleOCR-VL
PaddlePaddle/PaddleOCR-VLhttps://huggingface.co/PaddlePaddle/PaddleOCR-VL
PaddlePaddle/PaddleOCR-VLhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/PaddlePaddle/PaddleOCR-VL/bert4torch_config.json
MiniCPMhttps://github.com/OpenBMB/MiniCPM
openbmb/MiniCPM-2B-sft-bf16https://huggingface.co/openbmb/MiniCPM-2B-sft-bf16
openbmb/MiniCPM-2B-dpo-bf16https://huggingface.co/openbmb/MiniCPM-2B-dpo-bf16
openbmb/MiniCPM-2B-128khttps://huggingface.co/openbmb/MiniCPM-2B-128k
openbmb/MiniCPM-1B-sft-bf16https://huggingface.co/openbmb/MiniCPM-1B-sft-bf16
openbmb/MiniCPM3-4Bhttps://huggingface.co/openbmb/MiniCPM3-4B
openbmb/MiniCPM4-0.5Bhttps://huggingface.co/openbmb/MiniCPM4-0.5B
openbmb/MiniCPM4-8Bhttps://huggingface.co/openbmb/MiniCPM4-8B
openbmb/MiniCPM-2B-sft-bf16https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-2B-sft-bf16/bert4torch_config.json
openbmb/MiniCPM-2B-dpo-bf16https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-2B-dpo-bf16/bert4torch_config.json
openbmb/MiniCPM-2B-128khttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-2B-128k/bert4torch_config.json
openbmb/MiniCPM-1B-sft-bf16https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-1B-sft-bf16/bert4torch_config.json
MiniCPM-ohttps://github.com/OpenBMB/MiniCPM-o
openbmb/MiniCPM-Llama3-V-2_5https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5
openbmb/MiniCPM-V-2_6https://huggingface.co/openbmb/MiniCPM-V-2_6
openbmb/MiniCPM-o-2_6https://huggingface.co/openbmb/MiniCPM-o-2_6
openbmb/MiniCPM-V-4https://huggingface.co/openbmb/MiniCPM-V-4
openbmb/MiniCPM-Llama3-V-2_5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-Llama3-V-2_5/bert4torch_config.json
openbmb/MiniCPM-V-2_6https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/openbmb/MiniCPM-V-2_6/bert4torch_config.json
text2vec-base-chinesehttps://github.com/shibing624/text2vec
shibing624/text2vec-base-chinesehttps://huggingface.co/shibing624/text2vec-base-chinese
shibing624/text2vec-base-chinesehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/shibing624/text2vec-base-chinese/bert4torch_config.json
m3ehttps://github.com/wangyuxinwhy/uniem
moka-ai/m3e-basehttps://huggingface.co/moka-ai/m3e-base
moka-ai/m3e-basehttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/moka-ai/m3e-base/bert4torch_config.json
BAAI/bge-large-en-v1.5https://huggingface.co/BAAI/bge-large-en-v1.5
BAAI/bge-large-zh-v1.5https://huggingface.co/BAAI/bge-large-zh-v1.5
BAAI/bge-base-en-v1.5https://huggingface.co/BAAI/bge-base-en-v1.5
BAAI/bge-base-zh-v1.5https://huggingface.co/BAAI/bge-base-zh-v1.5
BAAI/bge-small-en-v1.5https://huggingface.co/BAAI/bge-small-en-v1.5
BAAI/bge-small-zh-v1.5https://huggingface.co/BAAI/bge-small-zh-v1.5
BAAI/bge-large-en-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-large-en-v1.5/bert4torch_config.json
BAAI/bge-large-zh-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-large-zh-v1.5/bert4torch_config.json
BAAI/bge-base-en-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-base-en-v1.5/bert4torch_config.json
BAAI/bge-base-zh-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-base-zh-v1.5/bert4torch_config.json
BAAI/bge-small-en-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-small-en-v1.5/bert4torch_config.json
BAAI/bge-small-zh-v1.5https://huggingface.co/Tongjilibo/bert4torch_config/blob/main/BAAI/bge-small-zh-v1.5/bert4torch_config.json
thenlper/gte-large-zhhttps://huggingface.co/thenlper/gte-large-zh
thenlper/gte-base-zhhttps://huggingface.co/thenlper/gte-base-zh
thenlper/gte-base-zhhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/thenlper/gte-base-zh/bert4torch_config.json
thenlper/gte-large-zhhttps://huggingface.co/Tongjilibo/bert4torch_config/blob/main/thenlper/gte-large-zh/bert4torch_config.json
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#6-鸣谢
bert4kerashttps://github.com/bojone/bert4keras
bert4pytorchhttps://github.com/MuQiuJun-AI/bert4pytorch
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#7-引用
https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#8-其他
https://github.com/Tongjilibo
微信号https://github.com/Tongjilibo
https://github.com/Tongjilibo
微信群https://github.com/Tongjilibo
https://star-history.com/#Tongjilibo/bert4torch&Date
Star History Charthttps://star-history.com/#Tongjilibo/bert4torch&Date
bert4torch.readthedocs.io/https://bert4torch.readthedocs.io/
nlp https://patch-diff.githubusercontent.com/topics/nlp
text-classification https://patch-diff.githubusercontent.com/topics/text-classification
transformers https://patch-diff.githubusercontent.com/topics/transformers
pytorch https://patch-diff.githubusercontent.com/topics/pytorch
named-entity-recognition https://patch-diff.githubusercontent.com/topics/named-entity-recognition
seq2seq https://patch-diff.githubusercontent.com/topics/seq2seq
llama https://patch-diff.githubusercontent.com/topics/llama
bert https://patch-diff.githubusercontent.com/topics/bert
relation-extraction https://patch-diff.githubusercontent.com/topics/relation-extraction
belle https://patch-diff.githubusercontent.com/topics/belle
bert4keras https://patch-diff.githubusercontent.com/topics/bert4keras
large-language-models https://patch-diff.githubusercontent.com/topics/large-language-models
llm https://patch-diff.githubusercontent.com/topics/llm
bert4torch https://patch-diff.githubusercontent.com/topics/bert4torch
chatglm https://patch-diff.githubusercontent.com/topics/chatglm
Readme https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#readme-ov-file
MIT license https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch#MIT-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Activityhttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/activity
1.3k starshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/stargazers
13 watchinghttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/watchers
169 forkshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FTongjilibo%2Fbert4torch&report=Tongjilibo+%28user%29
Releases 44https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/releases
稳定版本v0.6.0 Latest Sep 25, 2025 https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/releases/tag/v0.6.0
+ 43 releaseshttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/releases
Packages 0https://patch-diff.githubusercontent.com/users/Tongjilibo/packages?repo_name=bert4torch
Please reload this pagehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Contributors 4https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/graphs/contributors
Please reload this pagehttps://patch-diff.githubusercontent.com/Tongjilibo/bert4torch
Python 100.0% https://patch-diff.githubusercontent.com/Tongjilibo/bert4torch/search?l=python
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.