Title: GitHub - cmd2001/KVTuner: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Open Graph Title: GitHub - cmd2001/KVTuner: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
X Title: GitHub - cmd2001/KVTuner: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Description: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference - cmd2001/KVTuner
Open Graph Description: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference - cmd2001/KVTuner
X Description: [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference - cmd2001/KVTuner
Opengraph URL: https://github.com/cmd2001/KVTuner
X: @github
Domain: patch-diff.githubusercontent.com
| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:1526a090-c254-b617-1e91-24cd9af98c8e |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | ED40:1D1BED:2584AD4:353F354:6978F4EB |
| html-safe-nonce | c8345cb780837cd6c9224f8d6e9a98ff7b18e99a065960bde8582bdf01ec501d |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJFRDQwOjFEMUJFRDoyNTg0QUQ0OjM1M0YzNTQ6Njk3OEY0RUIiLCJ2aXNpdG9yX2lkIjoiNjM0MjM4MDc5MzM0MzA0Njg5MSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 13428a47ffa0eb9222db709903481d0a1261aefbafcba7ab4ad090688cdead90 |
| hovercard-subject-tag | repository:881250732 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/cmd2001/KVTuner |
| twitter:image | https://opengraph.githubassets.com/8c9ffdc6451b82c3ebaf67158f4c2499de249600f7e45a5db49a6401ecf52b5a/cmd2001/KVTuner |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/8c9ffdc6451b82c3ebaf67158f4c2499de249600f7e45a5db49a6401ecf52b5a/cmd2001/KVTuner |
| og:image:alt | [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference - cmd2001/KVTuner |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | 6d9b384ce19e0ac9756580c70eae9a3359939e77ff05c731561090d13085b2b7 |
| turbo-cache-control | no-preview |
| go-import | github.com/cmd2001/KVTuner git https://github.com/cmd2001/KVTuner.git |
| octolytics-dimension-user_id | 25078724 |
| octolytics-dimension-user_login | cmd2001 |
| octolytics-dimension-repository_id | 881250732 |
| octolytics-dimension-repository_nwo | cmd2001/KVTuner |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 881250732 |
| octolytics-dimension-repository_network_root_nwo | cmd2001/KVTuner |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 56ad33f6fdc0d0fb49c96b3c46ed74c55d926471 |
| ui-target | canary-1 |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width