Title: GitHub Β· Where software is built
Open Graph Title: bigcode-project/starcoder.cpp
X Title: bigcode-project/starcoder.cpp
Description: C++ implementation for π«StarCoder. Contribute to bigcode-project/starcoder.cpp development by creating an account on GitHub.
Open Graph Description: C++ implementation for π«StarCoder. Contribute to bigcode-project/starcoder.cpp development by creating an account on GitHub.
X Description: C++ implementation for π«StarCoder. Contribute to bigcode-project/starcoder.cpp development by creating an account on GitHub.
Opengraph URL: https://github.com/bigcode-project/starcoder.cpp
X: @github
Domain: patch-diff.githubusercontent.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"convert-hf-to-ggml.py offload_weight() too much parameters","articleBody":"I tried to follow the _Quick start_.\r\n\r\n```\r\n$ git clone https://github.com/bigcode-project/starcoder.cpp\r\nClonage dans 'starcoder.cpp'...\r\nremote: Enumerating objects: 110, done.\r\nremote: Counting objects: 100% (110/110), done.\r\nremote: Compressing objects: 100% (78/78), done.\r\nremote: Total 110 (delta 34), reused 93 (delta 25), pack-reused 0 (from 0)\r\nRΓ©ception d'objets: 100% (110/110), 7.28 Mio | 28.89 Mio/s, fait.\r\nRΓ©solution des deltas: 100% (34/34), fait.\r\n```\r\n```\r\n$ cd starcoder.cpp/\r\n```\r\n```\r\n$ python convert-hf-to-ggml.py bigcode/gpt_bigcode-santacoder\r\nLoading model: bigcode/gpt_bigcode-santacoder\r\ntokenizer_config.json: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 159/159 [00:00\u003c00:00, 324kB/s]\r\ntokenizer.json: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 2.08M/2.08M [00:00\u003c00:00, 6.11MB/s]\r\nspecial_tokens_map.json: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 138/138 [00:00\u003c00:00, 387kB/s]\r\nSpecial tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.\r\nconfig.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 812/812 [00:00\u003c00:00, 2.52MB/s]\r\nmodel.safetensors: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 2.25G/2.25G [00:43\u003c00:00, 52.3MB/s]\r\nTraceback (most recent call last):\r\n File \"/mnt/hdd/Data/ia/starcoder.cpp/convert-hf-to-ggml.py\", line 58, in \u003cmodule\u003e\r\n model = AutoModelForCausalLM.from_pretrained(model_name, config=config, torch_dtype=torch.float16 if use_f16 else torch.float32, low_cpu_mem_usage=True, trust_remote_code=True, offload_state_dict=True)\r\n File \"/home/dmeziere/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py\", line 563, in from_pretrained\r\n return model_class.from_pretrained(\r\n File \"/home/dmeziere/.local/lib/python3.10/site-packages/transformers/modeling_utils.py\", line 3507, in from_pretrained\r\n ) = cls._load_pretrained_model(\r\n File \"/home/dmeziere/.local/lib/python3.10/site-packages/transformers/modeling_utils.py\", line 3932, in _load_pretrained_model\r\n new_error_msgs, offload_index, state_dict_index = _load_state_dict_into_meta_model(\r\n File \"/home/dmeziere/.local/lib/python3.10/site-packages/transformers/modeling_utils.py\", line 798, in _load_state_dict_into_meta_model\r\n state_dict_index = offload_weight(param, param_name, model, state_dict_folder, state_dict_index)\r\nTypeError: offload_weight() takes from 3 to 4 positional arguments but 5 were given\r\n```\r\nThen i tried `make`, but it also returns a lot of warnings. But i think it has compiled.\r\n\r\n```\r\n$ ./quantize models/bigcode/gpt_bigcode-santacoder-ggml.bin models/bigcode/gpt_bigcode-santacoder-ggml-q4_1.bin 3\r\nstarcoder_model_quantize: loading model from 'models/bigcode/gpt_bigcode-santacoder-ggml.bin'\r\nstarcoder_model_quantize: failed to open 'models/bigcode/gpt_bigcode-santacoder-ggml.bin' for reading\r\nmain: failed to quantize model from 'models/bigcode/gpt_bigcode-santacoder-ggml.bin'\r\n```\r\n\r\nThe model seems to be missing, probably because of the errors of `convert-hf-to-ggml.py`.","author":{"url":"https://github.com/dmeziere","@type":"Person","name":"dmeziere"},"datePublished":"2025-01-06T16:01:27.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":0},"url":"https://github.com/36/starcoder.cpp/issues/36"}
| route-pattern | /:user_id/:repository/issues/:id(.:format) |
| route-controller | issues |
| route-action | show |
| fetch-nonce | v2:726b8ea0-3ccc-d67f-bac1-ef6b52dc2fca |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | B37E:30B45A:1E0E16F:282E7B1:69705463 |
| html-safe-nonce | 828e84f50ff8ba608c33b8b1698c7956765e8120349a92fc44e950489a11bbc1 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCMzdFOjMwQjQ1QToxRTBFMTZGOjI4MkU3QjE6Njk3MDU0NjMiLCJ2aXNpdG9yX2lkIjoiNTE5NjQxNjY1MDQyMzg1MDA4MyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 206291fc5950a5c1305b2e8f677b0d96979510af88cf47a2f2e73e15b1595cae |
| hovercard-subject-tag | repository:640843237 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/bigcode-project/starcoder.cpp/issues/36 |
| twitter:image | https://opengraph.githubassets.com/ff5b5c1e4e6cc3f628a412f942442dd71eefcd303a4d73c32b83ed8abd84665d/bigcode-project/starcoder.cpp |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/ff5b5c1e4e6cc3f628a412f942442dd71eefcd303a4d73c32b83ed8abd84665d/bigcode-project/starcoder.cpp |
| og:image:alt | C++ implementation for π«StarCoder. Contribute to bigcode-project/starcoder.cpp development by creating an account on GitHub. |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | 9920a62ba22d06470388e2904804fb7e5ec51c9e35f81784e9191394c74b2bd2 |
| turbo-cache-control | no-cache |
| go-import | github.com/bigcode-project/starcoder.cpp git https://github.com/bigcode-project/starcoder.cpp.git |
| octolytics-dimension-user_id | 110470554 |
| octolytics-dimension-user_login | bigcode-project |
| octolytics-dimension-repository_id | 640843237 |
| octolytics-dimension-repository_nwo | bigcode-project/starcoder.cpp |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 640843237 |
| octolytics-dimension-repository_network_root_nwo | bigcode-project/starcoder.cpp |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | f643964067a552f02067066d6a910b2f90a5721f |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width