René's URL Explorer Experiment


Title: GitHub - uclaml/SPIN: The official implementation of Self-Play Fine-Tuning (SPIN)

Open Graph Title: GitHub - uclaml/SPIN: The official implementation of Self-Play Fine-Tuning (SPIN)

X Title: GitHub - uclaml/SPIN: The official implementation of Self-Play Fine-Tuning (SPIN)

Description: The official implementation of Self-Play Fine-Tuning (SPIN) - uclaml/SPIN

Open Graph Description: The official implementation of Self-Play Fine-Tuning (SPIN) - uclaml/SPIN

X Description: The official implementation of Self-Play Fine-Tuning (SPIN) - uclaml/SPIN

Opengraph URL: https://github.com/uclaml/SPIN

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:7e8a1a01-810f-7110-ebd5-af37b12de1cc
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idBE5E:3302AB:80FBC07:AD5185A:698CEE94
html-safe-nonced5e019270b91c9ca57faa04a1828c67ac6f482c0fdc9362c896f1545f852d7d4
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCRTVFOjMzMDJBQjo4MEZCQzA3OkFENTE4NUE6Njk4Q0VFOTQiLCJ2aXNpdG9yX2lkIjoiNzcwMDA4ODI3OTk0OTcwMDc1NiIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac83665eecd6baecceefe46badf7bd0b9380c89b3d2e6107d4155dfc4ac83d3ca1
hovercard-subject-tagrepository:752815135
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/uclaml/SPIN
twitter:imagehttps://opengraph.githubassets.com/9d81262988025682a0b9623a3d31f84b92eed2e8f5af7ad316c443fd81b65b30/uclaml/SPIN
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/9d81262988025682a0b9623a3d31f84b92eed2e8f5af7ad316c443fd81b65b30/uclaml/SPIN
og:image:altThe official implementation of Self-Play Fine-Tuning (SPIN) - uclaml/SPIN
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
None640eeb7b6ff4d8d106235d228c0c286e82592d4d2403227b5b2b4fc5832297a4
turbo-cache-controlno-preview
go-importgithub.com/uclaml/SPIN git https://github.com/uclaml/SPIN.git
octolytics-dimension-user_id22385378
octolytics-dimension-user_loginuclaml
octolytics-dimension-repository_id752815135
octolytics-dimension-repository_nwouclaml/SPIN
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id752815135
octolytics-dimension-repository_network_root_nwouclaml/SPIN
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release3d444f0a47beeeac94cddbb51c91ab408befe8d4
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/uclaml/SPIN#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fuclaml%2FSPIN
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fuclaml%2FSPIN
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=uclaml%2FSPIN
Reloadhttps://patch-diff.githubusercontent.com/uclaml/SPIN
Reloadhttps://patch-diff.githubusercontent.com/uclaml/SPIN
Reloadhttps://patch-diff.githubusercontent.com/uclaml/SPIN
uclaml https://patch-diff.githubusercontent.com/uclaml
SPINhttps://patch-diff.githubusercontent.com/uclaml/SPIN
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fuclaml%2FSPIN
Fork 104 https://patch-diff.githubusercontent.com/login?return_to=%2Fuclaml%2FSPIN
Star 1.2k https://patch-diff.githubusercontent.com/login?return_to=%2Fuclaml%2FSPIN
uclaml.github.io/SPIN/https://uclaml.github.io/SPIN/
Apache-2.0 license https://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/LICENSE
1.2k stars https://patch-diff.githubusercontent.com/uclaml/SPIN/stargazers
104 forks https://patch-diff.githubusercontent.com/uclaml/SPIN/forks
Branches https://patch-diff.githubusercontent.com/uclaml/SPIN/branches
Tags https://patch-diff.githubusercontent.com/uclaml/SPIN/tags
Activity https://patch-diff.githubusercontent.com/uclaml/SPIN/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2Fuclaml%2FSPIN
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2Fuclaml%2FSPIN
Code https://patch-diff.githubusercontent.com/uclaml/SPIN
Issues 24 https://patch-diff.githubusercontent.com/uclaml/SPIN/issues
Pull requests 0 https://patch-diff.githubusercontent.com/uclaml/SPIN/pulls
Actions https://patch-diff.githubusercontent.com/uclaml/SPIN/actions
Projects 0 https://patch-diff.githubusercontent.com/uclaml/SPIN/projects
Security 0 https://patch-diff.githubusercontent.com/uclaml/SPIN/security
Insights https://patch-diff.githubusercontent.com/uclaml/SPIN/pulse
Code https://patch-diff.githubusercontent.com/uclaml/SPIN
Issues https://patch-diff.githubusercontent.com/uclaml/SPIN/issues
Pull requests https://patch-diff.githubusercontent.com/uclaml/SPIN/pulls
Actions https://patch-diff.githubusercontent.com/uclaml/SPIN/actions
Projects https://patch-diff.githubusercontent.com/uclaml/SPIN/projects
Security https://patch-diff.githubusercontent.com/uclaml/SPIN/security
Insights https://patch-diff.githubusercontent.com/uclaml/SPIN/pulse
Brancheshttps://patch-diff.githubusercontent.com/uclaml/SPIN/branches
Tagshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tags
https://patch-diff.githubusercontent.com/uclaml/SPIN/branches
https://patch-diff.githubusercontent.com/uclaml/SPIN/tags
100 Commitshttps://patch-diff.githubusercontent.com/uclaml/SPIN/commits/main/
https://patch-diff.githubusercontent.com/uclaml/SPIN/commits/main/
configshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/configs
configshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/configs
imageshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/images
imageshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/images
scriptshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/scripts
scriptshttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/scripts
spinhttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/spin
spinhttps://patch-diff.githubusercontent.com/uclaml/SPIN/tree/main/spin
.gitattributeshttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/.gitattributes
.gitattributeshttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/.gitattributes
.gitignorehttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/.gitignore
LICENSEhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/LICENSE
README.mdhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/README.md
setup.cfghttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/setup.cfg
setup.cfghttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/setup.cfg
setup.pyhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/setup.py
setup.pyhttps://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/setup.py
READMEhttps://patch-diff.githubusercontent.com/uclaml/SPIN
Apache-2.0 licensehttps://patch-diff.githubusercontent.com/uclaml/SPIN
https://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/images/spin_dalle.png
Modelshttps://huggingface.co/collections/UCLA-AGI/zephyr-7b-sft-full-spin-65c361dfca65637272a02c40
Datasetshttps://huggingface.co/collections/UCLA-AGI/datasets-spin-65c3624e98d4b589bbc76f3a
https://patch-diff.githubusercontent.com/uclaml/SPIN#self-play-fine-tuning-spin
https://camo.githubusercontent.com/5a871a53a0a61d6b628396d930a65232f31bb63fd6676c1bcdae8f0f905fc70b/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f4d6f64656c2d4d69737472616c2d2d37422d2d76302e312d677265656e
https://camo.githubusercontent.com/aef22c467ad559d6997a9f2a14012a833dc5f0323950e58e275491955617921b/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f5461736b2d4f70656e5f4c4c4d5f4c6561646572626f6172642d726564
https://camo.githubusercontent.com/e6dd5037fe43763569c89bb0ab419bb1940acdc2fd7930f9115723a70409e92f/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f5461736b2d4d542d2d42656e63682d726564
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Modelshttps://arxiv.org/abs/2401.01335
Zixiang Chenhttps://sites.google.com/view/zxchen
Yihe Denghttps://sites.google.com/g.ucla.edu/yihedeng/
Huizhuo Yuanhttps://scholar.google.com/citations?user=8foZzX4AAAAJ
Kaixuan Jihttps://scholar.google.com/citations?user=FOoKDukAAAAJ
Quanquan Guhttps://web.cs.ucla.edu/~qgu/
Webpagehttps://uclaml.github.io/SPIN/
Huggingfacehttps://huggingface.co/papers/2401.01335
https://patch-diff.githubusercontent.com/uclaml/SPIN#-news
https://arxiv.org/abs/2401.01335https://arxiv.org/abs/2401.01335
https://arxiv.org/abs/2401.01335https://arxiv.org/abs/2401.01335
Alignment Handbookhttps://github.com/huggingface/alignment-handbook
Confighttps://github.com/huggingface/alignment-handbook/blob/61a11a5c7d66179ed0a930b0dd12e532fce701dd/recipes/zephyr-7b-beta/dpo/config_full.yaml
Modelhttps://huggingface.co/alignment-handbook/zephyr-7b-sft-full/tree/ac6e600eefcce74f5e8bae1035d4f66019e93190
datasetshttps://huggingface.co/collections/UCLA-AGI/datasets-spin-65c3624e98d4b589bbc76f3a
https://patch-diff.githubusercontent.com/uclaml/SPIN#table-of-contents
About SPINhttps://patch-diff.githubusercontent.com/uclaml/SPIN#%F0%9F%8C%80-about-spin
Setuphttps://patch-diff.githubusercontent.com/uclaml/SPIN#Setup
Datahttps://patch-diff.githubusercontent.com/uclaml/SPIN#Data
Modelhttps://patch-diff.githubusercontent.com/uclaml/SPIN#Model
Usagehttps://patch-diff.githubusercontent.com/uclaml/SPIN#Usage
Step 1: Generationhttps://patch-diff.githubusercontent.com/uclaml/SPIN#step-1-generation
Faster generation with vLLMhttps://patch-diff.githubusercontent.com/uclaml/SPIN#%F0%9F%9A%80-faster-generation-with-vllm
Step 1.5: Gather generations and convert data typehttps://patch-diff.githubusercontent.com/uclaml/SPIN#step-15-gather-generations-and-convert-data-type
Step 2: Fine-tuninghttps://patch-diff.githubusercontent.com/uclaml/SPIN#step-2-fine-tuning
Reproducing Our Resultshttps://patch-diff.githubusercontent.com/uclaml/SPIN#Reproducing-Our-Results
Evaluationhttps://patch-diff.githubusercontent.com/uclaml/SPIN#Evaluation
Citationhttps://patch-diff.githubusercontent.com/uclaml/SPIN#Citation
Acknowledgementhttps://patch-diff.githubusercontent.com/uclaml/SPIN#Acknowledgement
https://patch-diff.githubusercontent.com/uclaml/SPIN#-about-spin
https://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/images/iter_openllm.png
https://patch-diff.githubusercontent.com/uclaml/SPIN/blob/main/images/dpo_compare.png
herehttps://arxiv.org/abs/2401.01335
https://patch-diff.githubusercontent.com/uclaml/SPIN#setup
https://patch-diff.githubusercontent.com/uclaml/SPIN#data
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter0
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter1
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter2
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter3
HuggingFaceH4/ultrafeedback_binarizedhttps://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized
https://patch-diff.githubusercontent.com/uclaml/SPIN#model
HuggingFacehttps://huggingface.co/UCLA-AGI/zephyr-7b-sft-full-SPIN-iter0
HuggingFacehttps://huggingface.co/UCLA-AGI/zephyr-7b-sft-full-SPIN-iter1
HuggingFacehttps://huggingface.co/UCLA-AGI/zephyr-7b-sft-full-SPIN-iter2
HuggingFacehttps://huggingface.co/UCLA-AGI/zephyr-7b-sft-full-SPIN-iter3
Step 2: Fine-tuninghttps://patch-diff.githubusercontent.com/uclaml/SPIN#step-2-fine-tuning
https://patch-diff.githubusercontent.com/uclaml/SPIN#usage
https://patch-diff.githubusercontent.com/uclaml/SPIN#step-0-optional-reformatting-sft-dataset
https://patch-diff.githubusercontent.com/uclaml/SPIN#step-1-generation
https://patch-diff.githubusercontent.com/uclaml/SPIN#-faster-generation-with-vllm
https://patch-diff.githubusercontent.com/uclaml/SPIN#step-15-gather-generations-and-convert-data-type
https://patch-diff.githubusercontent.com/uclaml/SPIN#step-2-fine-tuning
https://patch-diff.githubusercontent.com/uclaml/SPIN#reproducing-our-results
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter0
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter1
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter2
HuggingFacehttps://huggingface.co/datasets/UCLA-AGI/SPIN_iter3
https://patch-diff.githubusercontent.com/uclaml/SPIN#evaluation
lm-evaluation-harnesshttps://github.com/EleutherAI/lm-evaluation-harness/tree/46c796644913cd99da7eee868e64f9ed6af33407
Leaderboardhttps://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
https://patch-diff.githubusercontent.com/uclaml/SPIN#star-history
https://star-history.com/#uclaml/SPIN&Date
https://patch-diff.githubusercontent.com/uclaml/SPIN#citation
https://patch-diff.githubusercontent.com/uclaml/SPIN#acknowledgement
The Alignment Handbookhttps://github.com/huggingface/alignment-handbook
uclaml.github.io/SPIN/https://uclaml.github.io/SPIN/
deep-learning https://patch-diff.githubusercontent.com/topics/deep-learning
fine-tuning https://patch-diff.githubusercontent.com/topics/fine-tuning
self-play https://patch-diff.githubusercontent.com/topics/self-play
large-language-models https://patch-diff.githubusercontent.com/topics/large-language-models
Readme https://patch-diff.githubusercontent.com/uclaml/SPIN#readme-ov-file
Apache-2.0 license https://patch-diff.githubusercontent.com/uclaml/SPIN#Apache-2.0-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/uclaml/SPIN
Activityhttps://patch-diff.githubusercontent.com/uclaml/SPIN/activity
1.2k starshttps://patch-diff.githubusercontent.com/uclaml/SPIN/stargazers
11 watchinghttps://patch-diff.githubusercontent.com/uclaml/SPIN/watchers
104 forkshttps://patch-diff.githubusercontent.com/uclaml/SPIN/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2Fuclaml%2FSPIN&report=uclaml+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/uclaml/SPIN/releases
Packages 0https://patch-diff.githubusercontent.com/users/uclaml/packages?repo_name=SPIN
Please reload this pagehttps://patch-diff.githubusercontent.com/uclaml/SPIN
Contributors 10https://patch-diff.githubusercontent.com/uclaml/SPIN/graphs/contributors
Please reload this pagehttps://patch-diff.githubusercontent.com/uclaml/SPIN
Python 87.8% https://patch-diff.githubusercontent.com/uclaml/SPIN/search?l=python
Shell 12.2% https://patch-diff.githubusercontent.com/uclaml/SPIN/search?l=shell
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.