Title: Prevent starchat from generating nonsense after ouptutting <|system|> <|end|> <|user|> · Issue #32 · bigcode-project/starcoder.cpp · GitHub
Open Graph Title: Prevent starchat from generating nonsense after ouptutting <|system|> <|end|> <|user|> · Issue #32 · bigcode-project/starcoder.cpp
X Title: Prevent starchat from generating nonsense after ouptutting <|system|> <|end|> <|user|> · Issue #32 · bigcode-project/starcoder.cpp
Description: When using starchat the model will likely start talking bullshit (in Spanish) after printing out the sequence: <|system|> <|end|> <|user|> I added a rudimentary fix to stop generating new tokens in case starchat is used after outputting ...
Open Graph Description: When using starchat the model will likely start talking bullshit (in Spanish) after printing out the sequence: <|system|> <|end|> <|user|> I added a rudimentary fix to stop generating new tokens in...
X Description: When using starchat the model will likely start talking bullshit (in Spanish) after printing out the sequence: <|system|> <|end|> <|user|> I added a rudimentary fix to stop genera...
Opengraph URL: https://github.com/bigcode-project/starcoder.cpp/issues/32
X: @github
Domain: patch-diff.githubusercontent.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Prevent starchat from generating nonsense after ouptutting \u003c|system|\u003e \u003c|end|\u003e \u003c|user|\u003e","articleBody":"When using starchat the model will likely start talking bullshit (in Spanish) after printing out the sequence:\r\n\r\n```\r\n\u003c|system|\u003e \u003c|end|\u003e \u003c|user|\u003e\r\n```\r\n\r\nI added a rudimentary fix to stop generating new tokens in case starchat is used after outputting \u003c|system|\u003e \u003c|end|\u003e \u003c|user|\u003e:\r\n\r\n```\r\n // check if model is santacoder\r\n if (model.hparams.n_layer \u003c= 30 \u0026\u0026 embd.back() == 49152) {\r\n break;\r\n }\r\n // check if model is starcoder\r\n else if (embd.back() == 0) { //TODO: this is only for starcoder\r\n break;\r\n }\r\n // starchat since only these 3 models are supported atm\r\n else{\r\n // break to prevent starchat from talking gibberish\r\n if (output.find(\"\u003c|system|\u003e\\n\u003c|end|\u003e\\n\u003c|user|\u003e\") != std::string::npos) {\r\n break;\r\n }\r\n }\r\n```\r\n\r\nWould that be desirable pr? And also what is the correct way to detect if the model is indeed starcoder instead of using the default?","author":{"url":"https://github.com/KukumavMozolo","@type":"Person","name":"KukumavMozolo"},"datePublished":"2023-08-22T08:10:43.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/32/starcoder.cpp/issues/32"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:fda21851-6400-5cfa-3dc6-24c2c492f892 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | 822C:29573C:38EA85:4F8BB3:6970A6CB |
| html-safe-nonce | eb40b413ba713e5fed9dbc31a2e7640ff4c3bedcf41a0d1795d6f7b7389c1877 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI4MjJDOjI5NTczQzozOEVBODU6NEY4QkIzOjY5NzBBNkNCIiwidmlzaXRvcl9pZCI6IjE4MDM2NTQxMjMxMDY2NDE2MTEiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | 8e6e5edd71beaec6e034d06874f954e1e02203247a7d22d7624ab84bdcfa8c77 |
| hovercard-subject-tag | issue:1860876877 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/bigcode-project/starcoder.cpp/32/issue_layout |
| twitter:image | https://opengraph.githubassets.com/db924d9f4a3ca0d1b2f41af08f044f29a8c4d7f10df4a75252c2301540ccfcdd/bigcode-project/starcoder.cpp/issues/32 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/db924d9f4a3ca0d1b2f41af08f044f29a8c4d7f10df4a75252c2301540ccfcdd/bigcode-project/starcoder.cpp/issues/32 |
| og:image:alt | When using starchat the model will likely start talking bullshit (in Spanish) after printing out the sequence: <|system|> <|end|> <|user|> I added a rudimentary fix to stop generating new tokens in... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | KukumavMozolo |
| hostname | github.com |
| expected-hostname | github.com |
| None | b06a4c45c45fd0bb038b3759265ea6e38211f45d18130bc65261990be6b5972a |
| turbo-cache-control | no-preview |
| go-import | github.com/bigcode-project/starcoder.cpp git https://github.com/bigcode-project/starcoder.cpp.git |
| octolytics-dimension-user_id | 110470554 |
| octolytics-dimension-user_login | bigcode-project |
| octolytics-dimension-repository_id | 640843237 |
| octolytics-dimension-repository_nwo | bigcode-project/starcoder.cpp |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 640843237 |
| octolytics-dimension-repository_network_root_nwo | bigcode-project/starcoder.cpp |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 0e1c4964831785bd64cb22d82e7cf2391ae01f45 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width