Title: GitHub - voidful/TextRL: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Open Graph Title: GitHub - voidful/TextRL: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
X Title: GitHub - voidful/TextRL: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Description: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL) - voidful/TextRL
Open Graph Description: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL) - voidful/TextRL
X Description: Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL) - voidful/TextRL
Opengraph URL: https://github.com/voidful/TextRL
X: @github
Domain: patch-diff.githubusercontent.com
| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:bd99d16d-1f20-a6d0-9e54-7023276e7a5a |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | 813A:30E8A:8170165:A97EA1D:697576EA |
| html-safe-nonce | 4747a4014183494c4e1d50df80434b01fa1df8114d611c10cb5c5f5e78464940 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4MTNBOjMwRThBOjgxNzAxNjU6QTk3RUExRDo2OTc1NzZFQSIsInZpc2l0b3JfaWQiOiI3MDEwNjc3NjYxNjUwNzQ1MDY2IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | bb28dddfd961eb3003ce4b057c5122b36d556d6b3f420094b08875ff77910f0e |
| hovercard-subject-tag | repository:349008051 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/voidful/TextRL |
| twitter:image | https://opengraph.githubassets.com/f46142b7aede391cc80fdff28a6bd17656ee9b157c8b727e25fd8225ef4e74fc/voidful/TextRL |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/f46142b7aede391cc80fdff28a6bd17656ee9b157c8b727e25fd8225ef4e74fc/voidful/TextRL |
| og:image:alt | Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL) - voidful/TextRL |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | 4a4bf5f4e28041a9d2e5c107d7d20b78b4294ba261cab243b28167c16a623a1f |
| turbo-cache-control | no-preview |
| go-import | github.com/voidful/TextRL git https://github.com/voidful/TextRL.git |
| octolytics-dimension-user_id | 10904842 |
| octolytics-dimension-user_login | voidful |
| octolytics-dimension-repository_id | 349008051 |
| octolytics-dimension-repository_nwo | voidful/TextRL |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 349008051 |
| octolytics-dimension-repository_network_root_nwo | voidful/TextRL |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 488b30e96dfd057fbbe44c6665ccbc030b729dde |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width