| route-pattern | /_view_fragments/voltron/pull_requests/show/:user_id/:repository/:id/pull_request_layout(.:format) |
| route-controller | voltron_pull_requests_fragments |
| route-action | pull_request_layout |
| fetch-nonce | v2:8317cd4c-c88e-46e6-56b6-04ac793765ea |
| current-catalog-service-hash | ae870bc5e265a340912cde392f23dad3671a0a881730ffdadd82f2f57d81641b |
| request-id | AF30:E0371:2D70DA:3B620C:698D0876 |
| html-safe-nonce | 51c5648f57921c66781f941f70394f1cd137871b14f51729c0f950aa14741ca4 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBRjMwOkUwMzcxOjJENzBEQTozQjYyMEM6Njk4RDA4NzYiLCJ2aXNpdG9yX2lkIjoiNTEyODgxNDI0NzQzODc4MDUzNCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 9f35d8ff48ca590d7bfcac82fe76bb5e0ad084d57c0d369b875d44c67c36cf23 |
| hovercard-subject-tag | pull_request:1990194340 |
| github-keyboard-shortcuts | repository,pull-request-list,pull-request-conversation,pull-request-files-changed,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | ///voltron/pull_requests_fragments/pull_request_layout |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/voltron/pull_requests/show/AnswerDotAI/gpu.cpp/22/pull_request_layout |
| twitter:image | https://opengraph.githubassets.com/c0ebcdfa1fbb14f0fcf8b1ae73478e5ad81ee09d1d8cec8dcd547a4b4003b93d/AnswerDotAI/gpu.cpp/pull/22 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/c0ebcdfa1fbb14f0fcf8b1ae73478e5ad81ee09d1d8cec8dcd547a4b4003b93d/AnswerDotAI/gpu.cpp/pull/22 |
| og:image:alt | This implements transpose kernels in https://developer.nvidia.com/blog/efficient-matrix-transpose-cuda-cc/ .
The result of M2 pro is as follows:
Version
GB/s
Naive Matrix Transpose (version ... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | junjihashimoto |
| hostname | github.com |
| expected-hostname | github.com |
| None | f2da95634bce8a94cfa4123788169bfabdf845fd1d790fbaaaaab09dcfebdf28 |
| turbo-cache-control | no-preview |
| go-import | github.com/AnswerDotAI/gpu.cpp git https://github.com/AnswerDotAI/gpu.cpp.git |
| octolytics-dimension-user_id | 156509747 |
| octolytics-dimension-user_login | AnswerDotAI |
| octolytics-dimension-repository_id | 808280286 |
| octolytics-dimension-repository_nwo | AnswerDotAI/gpu.cpp |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 808280286 |
| octolytics-dimension-repository_network_root_nwo | AnswerDotAI/gpu.cpp |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | c21843b18feba17d11efb1895a7db61e8672f2cf |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
| Skip to content | https://github.com/AnswerDotAI/gpu.cpp/pull/22#start-of-content |
|
| https://github.com/ |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F22 |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F22 |
|
Sign up
| https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fpull_requests_fragments%2Fpull_request_layout&source=header-repo&source_repo=AnswerDotAI%2Fgpu.cpp |
| Reload | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| Reload | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| Reload | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
|
AnswerDotAI
| https://github.com/AnswerDotAI |
| gpu.cpp | https://github.com/AnswerDotAI/gpu.cpp |
|
Notifications
| https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp |
|
Fork
189
| https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp |
|
Star
3.9k
| https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp |
|
Code
| https://github.com/AnswerDotAI/gpu.cpp |
|
Issues
8
| https://github.com/AnswerDotAI/gpu.cpp/issues |
|
Pull requests
1
| https://github.com/AnswerDotAI/gpu.cpp/pulls |
|
Actions
| https://github.com/AnswerDotAI/gpu.cpp/actions |
|
Projects
1
| https://github.com/AnswerDotAI/gpu.cpp/projects |
|
Wiki
| https://github.com/AnswerDotAI/gpu.cpp/wiki |
|
Security
0
| https://github.com/AnswerDotAI/gpu.cpp/security |
|
Insights
| https://github.com/AnswerDotAI/gpu.cpp/pulse |
|
Code
| https://github.com/AnswerDotAI/gpu.cpp |
|
Issues
| https://github.com/AnswerDotAI/gpu.cpp/issues |
|
Pull requests
| https://github.com/AnswerDotAI/gpu.cpp/pulls |
|
Actions
| https://github.com/AnswerDotAI/gpu.cpp/actions |
|
Projects
| https://github.com/AnswerDotAI/gpu.cpp/projects |
|
Wiki
| https://github.com/AnswerDotAI/gpu.cpp/wiki |
|
Security
| https://github.com/AnswerDotAI/gpu.cpp/security |
|
Insights
| https://github.com/AnswerDotAI/gpu.cpp/pulse |
| Sign up for GitHub
| https://github.com/signup?return_to=%2FAnswerDotAI%2Fgpu.cpp%2Fissues%2Fnew%2Fchoose |
| terms of service | https://docs.github.com/terms |
| privacy statement | https://docs.github.com/privacy |
| Sign in | https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp%2Fissues%2Fnew%2Fchoose |
| Jump to bottom | https://github.com/AnswerDotAI/gpu.cpp/pull/22#issue-comment-box |
| austinvhuang | https://github.com/austinvhuang |
| AnswerDotAI:main | https://github.com/AnswerDotAI/gpu.cpp/tree/main |
| junjihashimoto:feature/transpose | https://github.com/junjihashimoto/gpu.cpp/tree/feature/transpose |
|
Add transpose kernels
| https://github.com/AnswerDotAI/gpu.cpp/pull/22#top |
| austinvhuang | https://github.com/austinvhuang |
| AnswerDotAI:main | https://github.com/AnswerDotAI/gpu.cpp/tree/main |
| junjihashimoto:feature/transpose | https://github.com/junjihashimoto/gpu.cpp/tree/feature/transpose |
|
Conversation
4
| https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
|
Commits
1
| https://github.com/AnswerDotAI/gpu.cpp/pull/22/commits |
|
Checks
0
| https://github.com/AnswerDotAI/gpu.cpp/pull/22/checks |
|
Files changed
| https://github.com/AnswerDotAI/gpu.cpp/pull/22/files |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.co/hiddenchars |
| https://github.com/AnswerDotAI/gpu.cpp/pull/{{ revealButtonHref }} |
|
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| Jul 26, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#issue-2433068796 |
| https://developer.nvidia.com/blog/efficient-matrix-transpose-cuda-cc/ | https://developer.nvidia.com/blog/efficient-matrix-transpose-cuda-cc/ |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| force-pushed | https://github.com/AnswerDotAI/gpu.cpp/compare/f6ddfbc7300b5310b2e4ce818e09124235146f2b..e66716f05099a0f518b845b4b5dc20238832e3c5 |
| f6ddfbc | https://github.com/AnswerDotAI/gpu.cpp/commit/f6ddfbc7300b5310b2e4ce818e09124235146f2b |
| e66716f | https://github.com/AnswerDotAI/gpu.cpp/commit/e66716f05099a0f518b845b4b5dc20238832e3c5 |
|
Compare
| https://github.com/AnswerDotAI/gpu.cpp/compare/f6ddfbc7300b5310b2e4ce818e09124235146f2b..e66716f05099a0f518b845b4b5dc20238832e3c5 |
| July 26, 2024 22:46 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#event-13666623117 |
| https://github.com/austinvhuang |
| austinvhuang | https://github.com/austinvhuang |
| Jul 27, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#issuecomment-2253729037 |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| force-pushed | https://github.com/AnswerDotAI/gpu.cpp/compare/e66716f05099a0f518b845b4b5dc20238832e3c5..1ce34a67ea6b6f0e550b6e324465e9ab367f8e87 |
| e66716f | https://github.com/AnswerDotAI/gpu.cpp/commit/e66716f05099a0f518b845b4b5dc20238832e3c5 |
| 1ce34a6 | https://github.com/AnswerDotAI/gpu.cpp/commit/1ce34a67ea6b6f0e550b6e324465e9ab367f8e87 |
|
Compare
| https://github.com/AnswerDotAI/gpu.cpp/compare/e66716f05099a0f518b845b4b5dc20238832e3c5..1ce34a67ea6b6f0e550b6e324465e9ab367f8e87 |
| July 28, 2024 01:58 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#event-13669868191 |
|
| https://github.com/junjihashimoto |
| Add transpose | https://github.com/AnswerDotAI/gpu.cpp/pull/22/commits/b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
| b1da604 | https://github.com/AnswerDotAI/gpu.cpp/pull/22/commits/b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| force-pushed | https://github.com/AnswerDotAI/gpu.cpp/compare/1ce34a67ea6b6f0e550b6e324465e9ab367f8e87..b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
| 1ce34a6 | https://github.com/AnswerDotAI/gpu.cpp/commit/1ce34a67ea6b6f0e550b6e324465e9ab367f8e87 |
| b1da604 | https://github.com/AnswerDotAI/gpu.cpp/commit/b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
|
Compare
| https://github.com/AnswerDotAI/gpu.cpp/compare/1ce34a67ea6b6f0e550b6e324465e9ab367f8e87..b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
| July 28, 2024 02:28 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#event-13669902582 |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
|
Jul 28, 2024
| https://github.com/AnswerDotAI/gpu.cpp/pull/22#pullrequestreview-2203257603 |
|
View reviewed changes
| https://github.com/AnswerDotAI/gpu.cpp/pull/22/files/b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b |
| Makefile | https://github.com/AnswerDotAI/gpu.cpp/pull/22/files/b1da604a46fafc4557ac1a4d44d1cf415dcb0d3b#diff-76ed074a9305c04054cdebb9e9aad2d818052b07091de1f20cad0bbac34ffb52 |
| junjihashimoto | https://github.com/junjihashimoto |
| Jul 28, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#discussion_r1694049717 |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| Jul 28, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#issuecomment-2254325508 |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.com/austinvhuang |
| austinvhuang | https://github.com/austinvhuang |
| Jul 29, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#issuecomment-2256076932 |
| #27 | https://github.com/AnswerDotAI/gpu.cpp/pull/27 |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
| https://github.com/austinvhuang |
| austinvhuang | https://github.com/austinvhuang |
| 47a85b7 | https://github.com/AnswerDotAI/gpu.cpp/commit/47a85b7441a2caa1ba8d3dcdc54ec3ed8b59ab95 |
| Jul 29, 2024 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#event-13681446948 |
| https://github.com/junjihashimoto |
| junjihashimoto | https://github.com/junjihashimoto |
| July 31, 2024 08:03 | https://github.com/AnswerDotAI/gpu.cpp/pull/22#event-13705932360 |
| Sign up for free | https://github.com/join?source=comment-repo |
| Sign in to comment | https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F22 |
| Please reload this page | https://github.com/AnswerDotAI/gpu.cpp/pull/22 |
|
| https://github.com/junjihashimoto |
|
| https://github.com/austinvhuang |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |