Title: Roadmap · Issue #3 · FasterDecoding/Medusa · GitHub
Open Graph Title: Roadmap · Issue #3 · FasterDecoding/Medusa
X Title: Roadmap · Issue #3 · FasterDecoding/Medusa
Description: Roadmap Functionality #36 #39 Distill from any model without access to the original training data Batched inference Fine-grained KV cache management Integration Local Deployment #33 #32 #35 Serving vllm TGI lightllm Research #34 Optimize...
Open Graph Description: Roadmap Functionality #36 #39 Distill from any model without access to the original training data Batched inference Fine-grained KV cache management Integration Local Deployment #33 #32 #35 Serving...
X Description: Roadmap Functionality #36 #39 Distill from any model without access to the original training data Batched inference Fine-grained KV cache management Integration Local Deployment #33 #32 #35 Serving...
Opengraph URL: https://github.com/FasterDecoding/Medusa/issues/3
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Roadmap","articleBody":"# Roadmap\r\n\r\n## Functionality\r\n- [x] #36\r\n- [x] #39 \r\n- [ ] Distill from any model without access to the original training data\r\n- [ ] Batched inference\r\n- [ ] Fine-grained KV cache management\r\n\r\n## Integration\r\n### Local Deployment\r\n- [ ] #33\r\n- [ ] #32\r\n- [ ] #35\r\n### Serving\r\n- [ ] [vllm](https://github.com/vllm-project/vllm)\r\n- [ ] [TGI](https://github.com/huggingface/text-generation-inference)\r\n- [ ] [lightllm](https://github.com/ModelTC/lightllm)\r\n\r\n## Research\r\n- [x] #34\r\n- [ ] Optimize the tree-based attention to reduce additional computation\r\n- [ ] Improve the acceptance scheme to generate more diverse sequences","author":{"url":"https://github.com/ctlllll","@type":"Person","name":"ctlllll"},"datePublished":"2023-09-12T01:49:06.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":15},"url":"https://github.com/3/Medusa/issues/3"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:51a383c4-c0c8-6d1f-895b-eb711a49bf97 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | B172:38069C:E4520C:122FE28:697F0D91 |
| html-safe-nonce | c3d7cff74c2a8098364159b383d4120d323008bce433188038e5442daa1cecdd |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCMTcyOjM4MDY5QzpFNDUyMEM6MTIyRkUyODo2OTdGMEQ5MSIsInZpc2l0b3JfaWQiOiI4ODMyMTQ5NzMzOTg5MzU0ODk3IiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | 886e2e88962117501100a26149c313f51d90519109bb6dfad59284654c737a26 |
| hovercard-subject-tag | issue:1891531114 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/FasterDecoding/Medusa/3/issue_layout |
| twitter:image | https://opengraph.githubassets.com/b5669fff975636d2cf76fe8c3994a067cd831749c352b0bdea5e8a9b1fa5ae58/FasterDecoding/Medusa/issues/3 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/b5669fff975636d2cf76fe8c3994a067cd831749c352b0bdea5e8a9b1fa5ae58/FasterDecoding/Medusa/issues/3 |
| og:image:alt | Roadmap Functionality #36 #39 Distill from any model without access to the original training data Batched inference Fine-grained KV cache management Integration Local Deployment #33 #32 #35 Serving... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | ctlllll |
| hostname | github.com |
| expected-hostname | github.com |
| None | 60279d4097367e16897439d16d6bbe4180663db828c666eeed2656988ffe59f6 |
| turbo-cache-control | no-preview |
| go-import | github.com/FasterDecoding/Medusa git https://github.com/FasterDecoding/Medusa.git |
| octolytics-dimension-user_id | 144572371 |
| octolytics-dimension-user_login | FasterDecoding |
| octolytics-dimension-repository_id | 689541424 |
| octolytics-dimension-repository_nwo | FasterDecoding/Medusa |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 689541424 |
| octolytics-dimension-repository_network_root_nwo | FasterDecoding/Medusa |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 7c85641c598ad130c74f7bcc27f58575cac69551 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width