Title: GitHub - NVIDIA/Model-Optimizer: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Open Graph Title: GitHub - NVIDIA/Model-Optimizer: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
X Title: GitHub - NVIDIA/Model-Optimizer: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Description: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed. - NVIDIA/Model-Optimizer
Open Graph Description: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks ...
X Description: A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks ...
Opengraph URL: https://github.com/NVIDIA/Model-Optimizer
X: @github
Domain: patch-diff.githubusercontent.com
| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:cf014cc6-9f3e-ddc6-f4b4-8907cee83666 |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | C408:1BFE61:127C26B:17E4D25:6992568C |
| html-safe-nonce | 522a95c33a0a19f55717fdfe12c13d4c2fd7ff29a59138713491bf911f408356 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJDNDA4OjFCRkU2MToxMjdDMjZCOjE3RTREMjU6Njk5MjU2OEMiLCJ2aXNpdG9yX2lkIjoiNDA4MzE0MTIwMjk0OTU5MjcxNyIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 2f8ad7e1204e6259f86fb9b37d8fa43b0ae3f7849686331c1d7025acfca3c50f |
| hovercard-subject-tag | repository:790916393 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/NVIDIA/Model-Optimizer |
| twitter:image | https://opengraph.githubassets.com/6d2e6a182c803c2c222b54f6b25fcb6d91cdf5cbeedcc03b220204962bfa0340/NVIDIA/Model-Optimizer |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/6d2e6a182c803c2c222b54f6b25fcb6d91cdf5cbeedcc03b220204962bfa0340/NVIDIA/Model-Optimizer |
| og:image:alt | A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks ... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | 42c603b9d642c4a9065a51770f75e5e27132fef0e858607f5c9cb7e422831a7b |
| turbo-cache-control | no-preview |
| go-import | github.com/NVIDIA/Model-Optimizer git https://github.com/NVIDIA/Model-Optimizer.git |
| octolytics-dimension-user_id | 1728152 |
| octolytics-dimension-user_login | NVIDIA |
| octolytics-dimension-repository_id | 790916393 |
| octolytics-dimension-repository_nwo | NVIDIA/Model-Optimizer |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 790916393 |
| octolytics-dimension-repository_network_root_nwo | NVIDIA/Model-Optimizer |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 848bc6032dcc93a9a7301dcc3f379a72ba13b96e |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width