Title: GitHub - ModelTC/SageAttention: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
Open Graph Title: GitHub - ModelTC/SageAttention: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
X Title: GitHub - ModelTC/SageAttention: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
Description: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models. - ModelTC/SageAttention
Open Graph Description: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models. - ModelTC/SageAttention
X Description: Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models. - ModelTC/SageAttention
Opengraph URL: https://github.com/ModelTC/SageAttention
X: @github
Domain: patch-diff.githubusercontent.com
| route-pattern | /:user_id/:repository |
| route-controller | files |
| route-action | disambiguate |
| fetch-nonce | v2:562a79d0-eec9-1aa3-937f-9e9fa3448d66 |
| current-catalog-service-hash | f3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb |
| request-id | B6AA:3F96B:688C131:8E34F4D:698D06BB |
| html-safe-nonce | fecd8c390c2b72c4c29121376154a839d1c1d7623709b8c55d02d96ffa1949ae |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCNkFBOjNGOTZCOjY4OEMxMzE6OEUzNEY0RDo2OThEMDZCQiIsInZpc2l0b3JfaWQiOiI2ODA1NTQ1NjkzMDI2MDc1NDciLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ== |
| visitor-hmac | 74b9f88cd4719e10ca058bd89e564000e04a7fc1bc2fb0b9aff95967b23440ec |
| hovercard-subject-tag | repository:1038472084 |
| github-keyboard-shortcuts | repository,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/ModelTC/SageAttention |
| twitter:image | https://opengraph.githubassets.com/5032db304777a89d0947c3f12a4519571948e4d37d359e09e2e5c0f602e08c8e/ModelTC/SageAttention |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/5032db304777a89d0947c3f12a4519571948e4d37d359e09e2e5c0f602e08c8e/ModelTC/SageAttention |
| og:image:alt | Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models. - ModelTC/SageAttention |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| hostname | github.com |
| expected-hostname | github.com |
| None | f2da95634bce8a94cfa4123788169bfabdf845fd1d790fbaaaaab09dcfebdf28 |
| turbo-cache-control | no-preview |
| go-import | github.com/ModelTC/SageAttention git https://github.com/ModelTC/SageAttention.git |
| octolytics-dimension-user_id | 69665675 |
| octolytics-dimension-user_login | ModelTC |
| octolytics-dimension-repository_id | 1038472084 |
| octolytics-dimension-repository_nwo | ModelTC/SageAttention |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | true |
| octolytics-dimension-repository_parent_id | 867007699 |
| octolytics-dimension-repository_parent_nwo | thu-ml/SageAttention |
| octolytics-dimension-repository_network_root_id | 867007699 |
| octolytics-dimension-repository_network_root_nwo | thu-ml/SageAttention |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | c21843b18feba17d11efb1895a7db61e8672f2cf |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width