René's URL Explorer Experiment


Title: Add matmul with float16 by junjihashimoto · Pull Request #39 · AnswerDotAI/gpu.cpp · GitHub

Open Graph Title: Add matmul with float16 by junjihashimoto · Pull Request #39 · AnswerDotAI/gpu.cpp

X Title: Add matmul with float16 by junjihashimoto · Pull Request #39 · AnswerDotAI/gpu.cpp

Description: This PR implements matrix multiplication with float16. Mac's memory bandwidth is not enough for maximum floating point performance of the GPU. To improve performance, we need to reduce memory traff...

Open Graph Description: This PR implements matrix multiplication with float16. Mac's memory bandwidth is not enough for maximum floating point performance of the GPU. To improve performance, we need to reduce memory t...

X Description: This PR implements matrix multiplication with float16. Mac's memory bandwidth is not enough for maximum floating point performance of the GPU. To improve performance, we need to reduce memo...

Opengraph URL: https://github.com/AnswerDotAI/gpu.cpp/pull/39

X: @github

direct link

Domain: github.com

route-pattern/_view_fragments/voltron/pull_requests/show/:user_id/:repository/:id/pull_request_layout(.:format)
route-controllervoltron_pull_requests_fragments
route-actionpull_request_layout
fetch-noncev2:1c3e8e97-d85a-f147-1c9c-d63582e864a9
current-catalog-service-hashae870bc5e265a340912cde392f23dad3671a0a881730ffdadd82f2f57d81641b
request-id9620:134E5:27335C:3307B1:698D0C47
html-safe-nonce635271bb5a343808f43fc3ab9a17248a30422162a97be933c63d010895d97c0a
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiI5NjIwOjEzNEU1OjI3MzM1QzozMzA3QjE6Njk4RDBDNDciLCJ2aXNpdG9yX2lkIjoiMzA2MjEwOTc3NzQwNjI2NjQzOSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9
visitor-hmac73f94c82efdf79f21822cccae397b5bdc99bb55dade3f0e871a6fbe975f89861
hovercard-subject-tagpull_request:2003380961
github-keyboard-shortcutsrepository,pull-request-list,pull-request-conversation,pull-request-files-changed,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location///voltron/pull_requests_fragments/pull_request_layout
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/_view_fragments/voltron/pull_requests/show/AnswerDotAI/gpu.cpp/39/pull_request_layout
twitter:imagehttps://opengraph.githubassets.com/713d4384eae07725874524136359a714ab0206ee787c9c6a9b1bf3bee1bc02f0/AnswerDotAI/gpu.cpp/pull/39
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/713d4384eae07725874524136359a714ab0206ee787c9c6a9b1bf3bee1bc02f0/AnswerDotAI/gpu.cpp/pull/39
og:image:altThis PR implements matrix multiplication with float16. Mac's memory bandwidth is not enough for maximum floating point performance of the GPU. To improve performance, we need to reduce memory t...
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
og:author:usernamejunjihashimoto
hostnamegithub.com
expected-hostnamegithub.com
Nonef2da95634bce8a94cfa4123788169bfabdf845fd1d790fbaaaaab09dcfebdf28
turbo-cache-controlno-preview
go-importgithub.com/AnswerDotAI/gpu.cpp git https://github.com/AnswerDotAI/gpu.cpp.git
octolytics-dimension-user_id156509747
octolytics-dimension-user_loginAnswerDotAI
octolytics-dimension-repository_id808280286
octolytics-dimension-repository_nwoAnswerDotAI/gpu.cpp
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id808280286
octolytics-dimension-repository_network_root_nwoAnswerDotAI/gpu.cpp
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
release311e6780106f89302807c2d4e7790575cc939515
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://github.com/AnswerDotAI/gpu.cpp/pull/39#start-of-content
https://github.com/
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F39
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F39
Sign up https://github.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fpull_requests_fragments%2Fpull_request_layout&source=header-repo&source_repo=AnswerDotAI%2Fgpu.cpp
Reloadhttps://github.com/AnswerDotAI/gpu.cpp/pull/39
Reloadhttps://github.com/AnswerDotAI/gpu.cpp/pull/39
Reloadhttps://github.com/AnswerDotAI/gpu.cpp/pull/39
AnswerDotAI https://github.com/AnswerDotAI
gpu.cpphttps://github.com/AnswerDotAI/gpu.cpp
Notifications https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp
Fork 189 https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp
Star 3.9k https://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp
Code https://github.com/AnswerDotAI/gpu.cpp
Issues 8 https://github.com/AnswerDotAI/gpu.cpp/issues
Pull requests 1 https://github.com/AnswerDotAI/gpu.cpp/pulls
Actions https://github.com/AnswerDotAI/gpu.cpp/actions
Projects 1 https://github.com/AnswerDotAI/gpu.cpp/projects
Wiki https://github.com/AnswerDotAI/gpu.cpp/wiki
Security 0 https://github.com/AnswerDotAI/gpu.cpp/security
Insights https://github.com/AnswerDotAI/gpu.cpp/pulse
Code https://github.com/AnswerDotAI/gpu.cpp
Issues https://github.com/AnswerDotAI/gpu.cpp/issues
Pull requests https://github.com/AnswerDotAI/gpu.cpp/pulls
Actions https://github.com/AnswerDotAI/gpu.cpp/actions
Projects https://github.com/AnswerDotAI/gpu.cpp/projects
Wiki https://github.com/AnswerDotAI/gpu.cpp/wiki
Security https://github.com/AnswerDotAI/gpu.cpp/security
Insights https://github.com/AnswerDotAI/gpu.cpp/pulse
Sign up for GitHub https://github.com/signup?return_to=%2FAnswerDotAI%2Fgpu.cpp%2Fissues%2Fnew%2Fchoose
terms of servicehttps://docs.github.com/terms
privacy statementhttps://docs.github.com/privacy
Sign inhttps://github.com/login?return_to=%2FAnswerDotAI%2Fgpu.cpp%2Fissues%2Fnew%2Fchoose
Jump to bottomhttps://github.com/AnswerDotAI/gpu.cpp/pull/39#issue-comment-box
austinvhuanghttps://github.com/austinvhuang
AnswerDotAI:mainhttps://github.com/AnswerDotAI/gpu.cpp/tree/main
junjihashimoto:feature/matmul-f16https://github.com/junjihashimoto/gpu.cpp/tree/feature/matmul-f16
Add matmul with float16 https://github.com/AnswerDotAI/gpu.cpp/pull/39#top
austinvhuanghttps://github.com/austinvhuang
AnswerDotAI:mainhttps://github.com/AnswerDotAI/gpu.cpp/tree/main
junjihashimoto:feature/matmul-f16https://github.com/junjihashimoto/gpu.cpp/tree/feature/matmul-f16
Conversation 13 https://github.com/AnswerDotAI/gpu.cpp/pull/39
Commits 1 https://github.com/AnswerDotAI/gpu.cpp/pull/39/commits
Checks 0 https://github.com/AnswerDotAI/gpu.cpp/pull/39/checks
Files changed https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.co/hiddenchars
https://github.com/AnswerDotAI/gpu.cpp/pull/{{ revealButtonHref }}
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
Aug 5, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issue-2447891146
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
#40 (comment)https://github.com/AnswerDotAI/gpu.cpp/issues/40#issuecomment-2270142354
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
Aug 5, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2219843081
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
gpu.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-aef109183b0d7e22bf3e9f8b79d8195405068bb008fa1cb0ba8f36fc3d663b43
junjihashimotohttps://github.com/junjihashimoto
Aug 5, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704620030
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220111672
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
numeric_types/half.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-67ba853ef485de4e1b6ab9dff7bc3aad50caf61829e515cc1f6277112d090479
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704801192
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
https://github.com/macton/ps3-archive/blob/master/src/half-precision/Half.v2/int_insn.hhttps://github.com/macton/ps3-archive/blob/master/src/half-precision/Half.v2/int_insn.h
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
junjihashimotohttps://github.com/junjihashimoto
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1705285468
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220124182
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
utils/array_utils.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-8ffc59a2e9d4066391b6c17ea778a18dc9a9fbca07e336898e239605a79b7072
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704809321
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220130092
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
utils/array_utils.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-8ffc59a2e9d4066391b6c17ea778a18dc9a9fbca07e336898e239605a79b7072
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704812762
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220131122
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
gpu.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-aef109183b0d7e22bf3e9f8b79d8195405068bb008fa1cb0ba8f36fc3d663b43
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704813312
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220132899
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
gpu.hhttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-aef109183b0d7e22bf3e9f8b79d8195405068bb008fa1cb0ba8f36fc3d663b43
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704814455
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024 https://github.com/AnswerDotAI/gpu.cpp/pull/39#pullrequestreview-2220133030
View reviewed changes https://github.com/AnswerDotAI/gpu.cpp/pull/39/files
examples/matmul/run.cpphttps://github.com/AnswerDotAI/gpu.cpp/pull/39/files#diff-42423322ea918928e472613dbcc54b714658d4cbed4a83f47b82b971a7bf01c3
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#discussion_r1704814557
Learn morehttps://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issuecomment-2270238686
@ghostplanthttps://github.com/ghostplant
#40 (comment)https://github.com/AnswerDotAI/gpu.cpp/issues/40#issuecomment-2270142354
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issuecomment-2270252803
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
Aug 6, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issuecomment-2270869204
@austinvhuanghttps://github.com/austinvhuang
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
force-pushedhttps://github.com/AnswerDotAI/gpu.cpp/compare/89e890dddede1f2fdfa0cd0b6be265a0a5f83504..b5ed517d3042a450f56493fee3aee6900fbf8f40
89e890dhttps://github.com/AnswerDotAI/gpu.cpp/commit/89e890dddede1f2fdfa0cd0b6be265a0a5f83504
b5ed517https://github.com/AnswerDotAI/gpu.cpp/commit/b5ed517d3042a450f56493fee3aee6900fbf8f40
Compare https://github.com/AnswerDotAI/gpu.cpp/compare/89e890dddede1f2fdfa0cd0b6be265a0a5f83504..b5ed517d3042a450f56493fee3aee6900fbf8f40
August 8, 2024 20:34https://github.com/AnswerDotAI/gpu.cpp/pull/39#event-13811973180
https://github.com/junjihashimoto
Add matmul with float16https://github.com/AnswerDotAI/gpu.cpp/pull/39/commits/23dd96ea226a0ab653229d4d11477339ad7cdeb7
23dd96ehttps://github.com/AnswerDotAI/gpu.cpp/pull/39/commits/23dd96ea226a0ab653229d4d11477339ad7cdeb7
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
force-pushedhttps://github.com/AnswerDotAI/gpu.cpp/compare/b5ed517d3042a450f56493fee3aee6900fbf8f40..23dd96ea226a0ab653229d4d11477339ad7cdeb7
b5ed517https://github.com/AnswerDotAI/gpu.cpp/commit/b5ed517d3042a450f56493fee3aee6900fbf8f40
23dd96ehttps://github.com/AnswerDotAI/gpu.cpp/commit/23dd96ea226a0ab653229d4d11477339ad7cdeb7
Compare https://github.com/AnswerDotAI/gpu.cpp/compare/b5ed517d3042a450f56493fee3aee6900fbf8f40..23dd96ea226a0ab653229d4d11477339ad7cdeb7
August 8, 2024 20:46https://github.com/AnswerDotAI/gpu.cpp/pull/39#event-13812074956
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
Aug 8, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issuecomment-2276623962
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/austinvhuang
austinvhuanghttps://github.com/austinvhuang
228084fhttps://github.com/AnswerDotAI/gpu.cpp/commit/228084f18b2b11f2ccd553a204bb5aea4134efbd
Aug 8, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#event-13812076568
https://github.com/junjihashimoto
junjihashimotohttps://github.com/junjihashimoto
Aug 8, 2024https://github.com/AnswerDotAI/gpu.cpp/pull/39#issuecomment-2276659353
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
Sign up for freehttps://github.com/join?source=comment-repo
Sign in to commenthttps://github.com/login?return_to=https%3A%2F%2Fgithub.com%2FAnswerDotAI%2Fgpu.cpp%2Fpull%2F39
https://github.com/austinvhuang
austinvhuang https://github.com/austinvhuang
https://github.com/AnswerDotAI/gpu.cpp/pull/39/files/89e890dddede1f2fdfa0cd0b6be265a0a5f83504
Please reload this pagehttps://github.com/AnswerDotAI/gpu.cpp/pull/39
https://github.com/junjihashimoto
https://github.com/austinvhuang
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.