René's URL Explorer Experiment


Title: GitHub - Scientific-Computing-Lab/ParallelCodeEstimation: Can Large Language Models Predict Parallel Code Performance?

Open Graph Title: GitHub - Scientific-Computing-Lab/ParallelCodeEstimation: Can Large Language Models Predict Parallel Code Performance?

X Title: GitHub - Scientific-Computing-Lab/ParallelCodeEstimation: Can Large Language Models Predict Parallel Code Performance?

Description: Can Large Language Models Predict Parallel Code Performance? - Scientific-Computing-Lab/ParallelCodeEstimation

Open Graph Description: Can Large Language Models Predict Parallel Code Performance? - Scientific-Computing-Lab/ParallelCodeEstimation

X Description: Can Large Language Models Predict Parallel Code Performance? - Scientific-Computing-Lab/ParallelCodeEstimation

Opengraph URL: https://github.com/Scientific-Computing-Lab/ParallelCodeEstimation

X: @github

direct link

Domain: patch-diff.githubusercontent.com

route-pattern/:user_id/:repository
route-controllerfiles
route-actiondisambiguate
fetch-noncev2:cbabe2cb-3dab-7d82-e2ef-857308fbd141
current-catalog-service-hashf3abb0cc802f3d7b95fc8762b94bdcb13bf39634c40c357301c4aa1d67a256fb
request-idB694:273D24:4FDB5E:6858B0:698D5835
html-safe-nonce52516b67be40cc1982582daef0f3f57da0069c89a569d783b0871d96ddb726cb
visitor-payloadeyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJCNjk0OjI3M0QyNDo0RkRCNUU6Njg1OEIwOjY5OEQ1ODM1IiwidmlzaXRvcl9pZCI6IjUwMDI1NTA4MjAzNDU4Mjk0MjkiLCJyZWdpb25fZWRnZSI6ImlhZCIsInJlZ2lvbl9yZW5kZXIiOiJpYWQifQ==
visitor-hmacdb24b29400eda470374b5402cea3dc35e89dfae4cbf6e901d4119da90ca9bff9
hovercard-subject-tagrepository:959998397
github-keyboard-shortcutsrepository,copilot
google-site-verificationApib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I
octolytics-urlhttps://collector.github.com/github/collect
analytics-location//
fb:app_id1401488693436528
apple-itunes-appapp-id=1477376905, app-argument=https://github.com/Scientific-Computing-Lab/ParallelCodeEstimation
twitter:imagehttps://opengraph.githubassets.com/d411a61915bfa18112b737f9433d2fb6bdcc30b2f399b772c4ea2d4b38e2dca0/Scientific-Computing-Lab/ParallelCodeEstimation
twitter:cardsummary_large_image
og:imagehttps://opengraph.githubassets.com/d411a61915bfa18112b737f9433d2fb6bdcc30b2f399b772c4ea2d4b38e2dca0/Scientific-Computing-Lab/ParallelCodeEstimation
og:image:altCan Large Language Models Predict Parallel Code Performance? - Scientific-Computing-Lab/ParallelCodeEstimation
og:image:width1200
og:image:height600
og:site_nameGitHub
og:typeobject
hostnamegithub.com
expected-hostnamegithub.com
Nonec0818105fa276287e9369cfdefa0a0fa7953719791ceff9b94d69623c0a4fe8a
turbo-cache-controlno-preview
go-importgithub.com/Scientific-Computing-Lab/ParallelCodeEstimation git https://github.com/Scientific-Computing-Lab/ParallelCodeEstimation.git
octolytics-dimension-user_id59704101
octolytics-dimension-user_loginScientific-Computing-Lab
octolytics-dimension-repository_id959998397
octolytics-dimension-repository_nwoScientific-Computing-Lab/ParallelCodeEstimation
octolytics-dimension-repository_publictrue
octolytics-dimension-repository_is_forkfalse
octolytics-dimension-repository_network_root_id959998397
octolytics-dimension-repository_network_root_nwoScientific-Computing-Lab/ParallelCodeEstimation
turbo-body-classeslogged-out env-production page-responsive
disable-turbofalse
browser-stats-urlhttps://api.github.com/_private/browser/stats
browser-errors-urlhttps://api.github.com/_private/browser/errors
releasea95a17cc440c14d4fcddc0641bc1136fa8d908f0
ui-targetfull
theme-color#1e2327
color-schemelight dark

Links:

Skip to contenthttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#start-of-content
https://patch-diff.githubusercontent.com/
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FScientific-Computing-Lab%2FParallelCodeEstimation
GitHub CopilotWrite better code with AIhttps://github.com/features/copilot
GitHub SparkBuild and deploy intelligent appshttps://github.com/features/spark
GitHub ModelsManage and compare promptshttps://github.com/features/models
MCP RegistryNewIntegrate external toolshttps://github.com/mcp
ActionsAutomate any workflowhttps://github.com/features/actions
CodespacesInstant dev environmentshttps://github.com/features/codespaces
IssuesPlan and track workhttps://github.com/features/issues
Code ReviewManage code changeshttps://github.com/features/code-review
GitHub Advanced SecurityFind and fix vulnerabilitieshttps://github.com/security/advanced-security
Code securitySecure your code as you buildhttps://github.com/security/advanced-security/code-security
Secret protectionStop leaks before they starthttps://github.com/security/advanced-security/secret-protection
Why GitHubhttps://github.com/why-github
Documentationhttps://docs.github.com
Bloghttps://github.blog
Changeloghttps://github.blog/changelog
Marketplacehttps://github.com/marketplace
View all featureshttps://github.com/features
Enterpriseshttps://github.com/enterprise
Small and medium teamshttps://github.com/team
Startupshttps://github.com/enterprise/startups
Nonprofitshttps://github.com/solutions/industry/nonprofits
App Modernizationhttps://github.com/solutions/use-case/app-modernization
DevSecOpshttps://github.com/solutions/use-case/devsecops
DevOpshttps://github.com/solutions/use-case/devops
CI/CDhttps://github.com/solutions/use-case/ci-cd
View all use caseshttps://github.com/solutions/use-case
Healthcarehttps://github.com/solutions/industry/healthcare
Financial serviceshttps://github.com/solutions/industry/financial-services
Manufacturinghttps://github.com/solutions/industry/manufacturing
Governmenthttps://github.com/solutions/industry/government
View all industrieshttps://github.com/solutions/industry
View all solutionshttps://github.com/solutions
AIhttps://github.com/resources/articles?topic=ai
Software Developmenthttps://github.com/resources/articles?topic=software-development
DevOpshttps://github.com/resources/articles?topic=devops
Securityhttps://github.com/resources/articles?topic=security
View all topicshttps://github.com/resources/articles
Customer storieshttps://github.com/customer-stories
Events & webinarshttps://github.com/resources/events
Ebooks & reportshttps://github.com/resources/whitepapers
Business insightshttps://github.com/solutions/executive-insights
GitHub Skillshttps://skills.github.com
Documentationhttps://docs.github.com
Customer supporthttps://support.github.com
Community forumhttps://github.com/orgs/community/discussions
Trust centerhttps://github.com/trust-center
Partnershttps://github.com/partners
GitHub SponsorsFund open source developershttps://github.com/sponsors
Security Labhttps://securitylab.github.com
Maintainer Communityhttps://maintainers.github.com
Acceleratorhttps://github.com/accelerator
Archive Programhttps://archiveprogram.github.com
Topicshttps://github.com/topics
Trendinghttps://github.com/trending
Collectionshttps://github.com/collections
Enterprise platformAI-powered developer platformhttps://github.com/enterprise
GitHub Advanced SecurityEnterprise-grade security featureshttps://github.com/security/advanced-security
Copilot for BusinessEnterprise-grade AI featureshttps://github.com/features/copilot/copilot-business
Premium SupportEnterprise-grade 24/7 supporthttps://github.com/premium-support
Pricinghttps://github.com/pricing
Search syntax tipshttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
documentationhttps://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax
Sign in https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2FScientific-Computing-Lab%2FParallelCodeEstimation
Sign up https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E&source=header-repo&source_repo=Scientific-Computing-Lab%2FParallelCodeEstimation
Reloadhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Reloadhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Reloadhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Scientific-Computing-Lab https://patch-diff.githubusercontent.com/Scientific-Computing-Lab
ParallelCodeEstimationhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FScientific-Computing-Lab%2FParallelCodeEstimation
Fork 1 https://patch-diff.githubusercontent.com/login?return_to=%2FScientific-Computing-Lab%2FParallelCodeEstimation
Star 6 https://patch-diff.githubusercontent.com/login?return_to=%2FScientific-Computing-Lab%2FParallelCodeEstimation
BSD-3-Clause license https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/LICENSE
6 stars https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/stargazers
1 fork https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/forks
Branches https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/branches
Tags https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tags
Activity https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/activity
Star https://patch-diff.githubusercontent.com/login?return_to=%2FScientific-Computing-Lab%2FParallelCodeEstimation
Notifications https://patch-diff.githubusercontent.com/login?return_to=%2FScientific-Computing-Lab%2FParallelCodeEstimation
Code https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Issues 1 https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/issues
Pull requests 0 https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/pulls
Actions https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/actions
Projects 0 https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/projects
Security 0 https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/security
Insights https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/pulse
Code https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Issues https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/issues
Pull requests https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/pulls
Actions https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/actions
Projects https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/projects
Security https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/security
Insights https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/pulse
Brancheshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/branches
Tagshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tags
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/branches
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tags
4,456 Commitshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/commits/main/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/commits/main/
analysishttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/analysis
analysishttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/analysis
dataset-genhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/dataset-gen
dataset-genhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/dataset-gen
few-shot-exampleshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/few-shot-examples
few-shot-exampleshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/few-shot-examples
resultshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/results
resultshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/results
simple-roofline-surveyhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/simple-roofline-survey
simple-roofline-surveyhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/simple-roofline-survey
srchttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/src
srchttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/tree/main/src
.gitignorehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/.gitignore
.gitignorehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/.gitignore
CMakeLists.txthttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/CMakeLists.txt
CMakeLists.txthttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/CMakeLists.txt
LICENSEhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/LICENSE
LICENSEhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/LICENSE
README.mdhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/README.md
README.mdhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/README.md
gatherData.pyhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/gatherData.py
gatherData.pyhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/gatherData.py
requirements.txthttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/requirements.txt
requirements.txthttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/requirements.txt
runBuild.shhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/runBuild.sh
runBuild.shhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/runBuild.sh
READMEhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
BSD-3-Clause licensehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#modified-hecbench-for-roofline-analysis
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#building
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#common-build-issues
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gathering-roofline-data
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#scraping-the-cuda-kernels
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#building-the-llm-dataset
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dataset-visualization-todo
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#limitations
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#future-less-important-features-todo
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hecbench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#background-use-cases-and-future-work
https://ieeexplore.ieee.org/document/10158214https://ieeexplore.ieee.org/document/10158214
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#software-installation
AMD ROCmhttps://rocmdocs.amd.com/en/latest/Installation_Guide/Installation-Guide.html
Intel DPC++ compilerhttps://github.com/intel/llvm/blob/sycl/sycl/doc/GetStartedGuide.md
Intel oneAPI toolkithttps://software.intel.com/content/www/us/en/develop/articles/installation-guide-for-intel-oneapi-toolkits.html
NVIDIA HPC SDKhttps://developer.nvidia.com/hpc-sdk
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dependencies
oneDPLhttps://github.com/oneapi-src/oneDPL
oneTBBhttps://github.com/oneapi-src/oneTBB
Syclomatichttps://github.com/oneapi-src/SYCLomatic
oneMKL interfaceshttps://github.com/oneapi-src/oneMKL
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#benchmark-categories
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#automotive
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bandwidth
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bioinformatics
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#computer-vision-and-image-processing
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cryptography
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#data-compression-and-reduction
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#data-encoding-decoding-or-verification
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#finance
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#geoscience
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#graph-and-tree
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#language-and-kernel-features
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#machine-learning
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#math
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#random-number-generation
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#search
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#signal-processing
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#simulation
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sorting
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#robotics
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#run-a-benchmark
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dataset
http://lava.cs.virginia.edu/Rodinia/download.htmhttp://lava.cs.virginia.edu/Rodinia/download.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#known-issues
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#emulation
FP64 emulationhttps://github.com/intel/compute-runtime/blob/master/opencl/doc/FAQ.md#feature-double-precision-emulation-fp64
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#feedback-from-the-papers
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#experimental-results
herehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/blob/main/results/README.md
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#reference
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#accuracy-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ace-cuda
https://github.com/myousefi2016/Allen-Cahn-CUDAhttps://github.com/myousefi2016/Allen-Cahn-CUDA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#adam-cuda
https://github.com/hpcaitech/ColossalAIhttps://github.com/hpcaitech/ColossalAI
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#adamw-cuda
https://github.com/lessw2020/QuantFour_AdamW_Cudahttps://github.com/lessw2020/QuantFour_AdamW_Cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#addbiasqkv-cuda
https://github.com/NVIDIA/FasterTransformerhttps://github.com/NVIDIA/FasterTransformer
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#addbiasresiduallayernorm-cuda
https://github.com/NVIDIA/FasterTransformerhttps://github.com/NVIDIA/FasterTransformer
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#adv-cuda
https://github.com/Nek5000/nekBench/tree/master/advhttps://github.com/Nek5000/nekBench/tree/master/adv
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#aes-opencl
https://github.com/Multi2Sim/m2s-bench-amdsdk-2.5-srchttps://github.com/Multi2Sim/m2s-bench-amdsdk-2.5-src
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#affine-opencl
https://github.com/Xilinx/SDAccel_Examples/tree/master/vision/affinehttps://github.com/Xilinx/SDAccel_Examples/tree/master/vision/affine
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#aidw-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#aligned-types-cuda
http://docs.nvidia.com/cuda/cuda-samples/index.htmlhttp://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#allreduce-cuda
https://github.com/baidu-research/baidu-allreducehttps://github.com/baidu-research/baidu-allreduce
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#all-pairs-distance-cuda
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910913/https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2910913/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#amgmk-openmp
https://asc.llnl.gov/CORAL-benchmarks/Micro/amgmk-v1.0.tar.gzhttps://asc.llnl.gov/CORAL-benchmarks/Micro/amgmk-v1.0.tar.gz
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ans-cuda
https://github.com/weissenberger/multianshttps://github.com/weissenberger/multians
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#aobench-openmp
https://code.google.com/archive/p/aobenchhttps://code.google.com/archive/p/aobench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#aop-cuda
https://github.com/NVIDIA-developer-bloghttps://github.com/NVIDIA-developer-blog
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#asmooth-cuda
http://www.hcs.harvard.edu/admiralty/http://www.hcs.harvard.edu/admiralty/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#asta-cuda
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atan2-cpp
https://github.com/cms-patatrack/pixeltrack-standalonehttps://github.com/cms-patatrack/pixeltrack-standalone
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomicaggreate-cuda
https://developer.nvidia.com/blog/using-cuda-warp-level-primitives/https://developer.nvidia.com/blog/using-cuda-warp-level-primitives/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomicintrinsics-cuda
http://docs.nvidia.com/cuda/cuda-samples/index.htmlhttp://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomiccas-cuda
https://github.com/treecode/Bonsai/blob/master/runtime/profiling/derived_atomic_functions.hhttps://github.com/treecode/Bonsai/blob/master/runtime/profiling/derived_atomic_functions.h
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomiccost
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomicperf-cuda
https://stackoverflow.com/questions/22367238/cuda-atomic-operation-performance-in-different-scenarioshttps://stackoverflow.com/questions/22367238/cuda-atomic-operation-performance-in-different-scenarios
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomicreduction-hip
https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/master/reductionhttps://github.com/ROCm-Developer-Tools/HIP-Examples/tree/master/reduction
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#atomicsystemwide-cuda
http://docs.nvidia.com/cuda/cuda-samples/index.htmlhttp://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#attention-pseudocodes
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#attentionmultihead-cuda
https://github.com/IrishCoffee/cudnnMultiHeadAttentionhttps://github.com/IrishCoffee/cudnnMultiHeadAttention
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#axhelm-cuda
https://github.com/Nek5000/nekBench/tree/master/axhelmhttps://github.com/Nek5000/nekBench/tree/master/axhelm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#babelstream-cuda
https://github.com/UoB-HPC/BabelStreamhttps://github.com/UoB-HPC/BabelStream
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#background-subtract-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#backprop-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bezier-surface-opencl
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bfs-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bh-cuda
https://userweb.cs.txstate.edu/~burtscher/research/ECL-BH/https://userweb.cs.txstate.edu/~burtscher/research/ECL-BH/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bilateral-cuda
https://github.com/jstraub/cudaPclhttps://github.com/jstraub/cudaPcl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bincount-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#binomial-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bitcracker-cuda
https://github.com/e-ago/bitcracker.githttps://github.com/e-ago/bitcracker.git
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bitonic-sort-sycl
https://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramming/https://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramming/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bitpacking-cuda
https://github.com/NVIDIA/nvcomphttps://github.com/NVIDIA/nvcomp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bitpermute-cuda
https://github.com/supranational/spparkhttps://github.com/supranational/sppark
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#black-scholes-cuda
https://github.com/cavazos-lab/FinanceBenchhttps://github.com/cavazos-lab/FinanceBench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-dot-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-fp8gemm-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-gemm-sycl
https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpp/2025-0/gemm.htmlhttps://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpp/2025-0/gemm.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-gemmbatched-cuda
https://github.com/pyrovski/cublasSgemmBatched-examplehttps://github.com/pyrovski/cublasSgemmBatched-example
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-gemmstridedbatched-cuda
https://github.com/pyrovski/cublasSgemmBatched-examplehttps://github.com/pyrovski/cublasSgemmBatched-example
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-gemmex-cuda
https://godweiyang.com/2021/08/24/gemm/https://godweiyang.com/2021/08/24/gemm/
https://github.com/UoB-HPC/abc-pvc-deepdivehttps://github.com/UoB-HPC/abc-pvc-deepdive
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blas-gemmex2-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blockaccess-cuda
https://github.com/NVIDIA/cubhttps://github.com/NVIDIA/cub
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#blockexchange-cuda
https://github.com/NVIDIA/cubhttps://github.com/NVIDIA/cub
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bm3d-cuda
https://github.com/DawyD/bm3d-gpuhttps://github.com/DawyD/bm3d-gpu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bn-cuda
https://github.com/OSU-STARLAB/UVM_benchmark/blob/master/non_UVM_benchmarkshttps://github.com/OSU-STARLAB/UVM_benchmark/blob/master/non_UVM_benchmarks
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bonds-cuda
https://github.com/cavazos-lab/FinanceBenchhttps://github.com/cavazos-lab/FinanceBench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#boxfilter-cuda
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bscan-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bsearch-cuda
https://www.sciencedirect.com/science/article/abs/pii/S0743731517302836https://www.sciencedirect.com/science/article/abs/pii/S0743731517302836
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bspline-vgh-openmp
https://github.com/QMCPACK/miniqmc/blob/OMP_offload/src/OpenMP/main.cpphttps://github.com/QMCPACK/miniqmc/blob/OMP_offload/src/OpenMP/main.cpp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bsw-cuda
https://github.com/mgawan/ADEPThttps://github.com/mgawan/ADEPT
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#burger-openmp
https://github.com/soumyasen1809/OpenMP_C_12_steps_to_Navier_Stokeshttps://github.com/soumyasen1809/OpenMP_C_12_steps_to_Navier_Stokes
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#bwt-cuda
https://github.com/jedbrooke/cuda_bwthttps://github.com/jedbrooke/cuda_bwt
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#btree-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#car-cuda
https://github.com/sunwj/CARhttps://github.com/sunwj/CAR
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cbsfil-cuda
https://github.com/DannyRuijters/CubicInterpolationCUDAhttps://github.com/DannyRuijters/CubicInterpolationCUDA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cc-cuda
https://userweb.cs.txstate.edu/~burtscher/research/ECL-CC/https://userweb.cs.txstate.edu/~burtscher/research/ECL-CC/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ccl-cuda
https://github.com/NVIDIA/ncclhttps://github.com/NVIDIA/nccl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ccs-cuda
https://github.com/abhatta3/Condition-dependent-Correlation-Subgroups-CCShttps://github.com/abhatta3/Condition-dependent-Correlation-Subgroups-CCS
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ccsd-trpdrv-c
https://github.com/jeffhammond/nwchem-ccsd-trpdrvhttps://github.com/jeffhammond/nwchem-ccsd-trpdrv
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ced-opencl
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cfd-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#chacha20-c
https://github.com/983/ChaCha20https://github.com/983/ChaCha20
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#channelshuffle-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#channelsum-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#che-cuda
https://github.com/myousefi2016/Cahn-Hilliard-CUDAhttps://github.com/myousefi2016/Cahn-Hilliard-CUDA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#chemv-cuda
https://repo.or.cz/ppcg.githttps://repo.or.cz/ppcg.git
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#chi2-cuda
https://web.njit.edu/~usman/courses/cs677_spring19/https://web.njit.edu/~usman/courses/cs677_spring19/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#clenergy-opencl
http://www.ks.uiuc.edu/Training/Workshop/GPU_Aug2010/resources/clenergy.tar.gzhttp://www.ks.uiuc.edu/Training/Workshop/GPU_Aug2010/resources/clenergy.tar.gz
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#clink-c
http://github.com/UCLA-VAST/CLINKhttp://github.com/UCLA-VAST/CLINK
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cm-cuda
https://pubmed.ncbi.nlm.nih.gov/24112435/https://pubmed.ncbi.nlm.nih.gov/24112435/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cmembench-cuda
https://github.com/ekondis/gpumembenchhttps://github.com/ekondis/gpumembench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cmp-cuda
https://github.com/hpg-cepetro/IPDPS-CRS-CMP-codehttps://github.com/hpg-cepetro/IPDPS-CRS-CMP-code
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cobahh-opencl
https://dl.acm.org/doi/10.1145/3307339.3343460https://dl.acm.org/doi/10.1145/3307339.3343460
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#collision-cuda
https://github.com/facebookarchive/fbcudahttps://github.com/facebookarchive/fbcuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#colorwheel-cc
https://vision.middlebury.edu/flow/code/flow-code/colorcode.cpphttps://vision.middlebury.edu/flow/code/flow-code/colorcode.cpp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#columnarsolver-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#complex-cuda
https://github.com/tpn/cuda-samples/blob/master/v8.0/include/cuComplex.hhttps://github.com/tpn/cuda-samples/blob/master/v8.0/include/cuComplex.h
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#compute-score-opencl
https://www.intel.com/content/www/us/en/programmable/support/support-resources/design-examples/design-software/opencl/compute-score.htmlhttps://www.intel.com/content/www/us/en/programmable/support/support-resources/design-examples/design-software/opencl/compute-score.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#concat-cuda
https://github.com/bytedance/lightseqhttps://github.com/bytedance/lightseq
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#concurrentkernels-cuda
https://github.com/NVIDIA/cuda-samples/tree/master/Samples/0_Introduction/concurrentKernelshttps://github.com/NVIDIA/cuda-samples/tree/master/Samples/0_Introduction/concurrentKernels
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#contract-cuda
https://github.com/HyTruongSon/GraphFlowhttps://github.com/HyTruongSon/GraphFlow
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#conversion-sycl
intel/llvm#7195https://github.com/intel/llvm/issues/7195
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#convolution1d-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#convolution3d-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#convolutiondeformable-cuda
https://github.com/lucasjinreal/DCNv2_latesthttps://github.com/lucasjinreal/DCNv2_latest
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#convolutionseperable-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cooling-cuda
https://github.com/cholla-hydro/chollahttps://github.com/cholla-hydro/cholla
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#coordinates-cuda
https://github.com/rapidsai/cuspatialhttps://github.com/rapidsai/cuspatial
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#copy-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#crc64-openmp
https://trac.alcf.anl.gov/projects/hpcrc64/https://trac.alcf.anl.gov/projects/hpcrc64/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#cross-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#crossentropy-sycl
intel/llvm#5969https://github.com/intel/llvm/issues/5969
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#crs-cuda
https://www.comp.hkbu.edu.hk/~chxw/gcrs.htmlhttps://www.comp.hkbu.edu.hk/~chxw/gcrs.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#d2q9_bgk-sycl
https://github.com/WSJHawkins/ExploringSyclhttps://github.com/WSJHawkins/ExploringSycl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#d3q19_bgk-cuda
https://gitlab.com/unigehpfs/stlbmhttps://gitlab.com/unigehpfs/stlbm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#daphne-cuda
https://github.com/esa-tu-darmstadt/daphne-benchmarkhttps://github.com/esa-tu-darmstadt/daphne-benchmark
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#damage-opencl
https://github.com/alan-turing-institute/PeriPyhttps://github.com/alan-turing-institute/PeriPy
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dct8x8-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ddbp-cuda
https://github.com/LAVI-USP/DBT-Reconstructionhttps://github.com/LAVI-USP/DBT-Reconstruction
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#debayer-opencl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#degrid-cuda
https://github.com/NVIDIA/SKA-gpu-degridhttps://github.com/NVIDIA/SKA-gpu-degrid
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dense-embedding-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#depixel-cuda
https://github.com/yzhwang/depixelizationhttps://github.com/yzhwang/depixelization
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#deredundancy-sycl
https://github.com/JuZhenCS/gene-sequences-de-redundancyhttps://github.com/JuZhenCS/gene-sequences-de-redundancy
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#determinant-cuda
https://github.com/OrangeOwlSolutions/Linear-Algebrahttps://github.com/OrangeOwlSolutions/Linear-Algebra
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#diamond-opencl
https://github.com/bbuchfink/diamondhttps://github.com/bbuchfink/diamond
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dispatch-hip
https://github.com/ROCm-Developer-Tools/HIP-CPUhttps://github.com/ROCm-Developer-Tools/HIP-CPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#distort-cuda
https://github.com/Cuda-Chen/barrel-distortion-cudahttps://github.com/Cuda-Chen/barrel-distortion-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#divergence-cuda
https://github.com/E3SM-Project/divergence_cmdvsehttps://github.com/E3SM-Project/divergence_cmdvse
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#doh-cuda
https://github.com/rapidsai/cucimhttps://github.com/rapidsai/cucim
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dp-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dpid-cuda
https://github.com/mergian/dpidhttps://github.com/mergian/dpid
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dropout-cuda
https://github.com/pytorch/https://github.com/pytorch/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dslash-sycl
https://gitlab.com/NERSC/nersc-proxies/milc-dslashhttps://gitlab.com/NERSC/nersc-proxies/milc-dslash
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dxtc2-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dwconv-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#dwconv1d-cuda
https://github.com/BlinkDL/RWKV-CUDA/tree/mainhttps://github.com/BlinkDL/RWKV-CUDA/tree/main
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#easywave-cuda
https://git.gfz-potsdam.de/id2/geoperil/easyWavehttps://git.gfz-potsdam.de/id2/geoperil/easyWave
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ecdh-c
https://github.com/jaw566/ECDHhttps://github.com/jaw566/ECDH
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#egs-cuda
https://jonaslippuner.com/research/cuda-egs/https://jonaslippuner.com/research/cuda-egs/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#eigenvalue-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#eikonal-cuda
https://github.com/SCIInstitute/StructuredEikonalhttps://github.com/SCIInstitute/StructuredEikonal
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#entropy-cuda
https://lan-jing.github.io/parallel%20computing/system/entropy/https://lan-jing.github.io/parallel%20computing/system/entropy/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#epistasis-sycl
https://github.com/rafatcampos/bio-epistasis-detectionhttps://github.com/rafatcampos/bio-epistasis-detection
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ert-cuda
https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#expdist-cuda
https://github.com/benvanwerkhoven/kernel_tunerhttps://github.com/benvanwerkhoven/kernel_tuner
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#extend2-c
https://github.com/lh3/bwahttps://github.com/lh3/bwa
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#extrema-cuda
https://github.com/rapidsai/cusignal/https://github.com/rapidsai/cusignal/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#f16max-c
https://x.momo86.net/en?p=113https://x.momo86.net/en?p=113
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#f16sp-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#face-cuda
https://sites.google.com/site/5kk73gpu2012/assignment/viola-jones-face-detectionhttps://sites.google.com/site/5kk73gpu2012/assignment/viola-jones-face-detection
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fdtd3d-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#feynman-kac-c
https://people.sc.fsu.edu/~jburkardt/c_src/feynman_kac_2d/feynman_kac_2d.htmlhttps://people.sc.fsu.edu/~jburkardt/c_src/feynman_kac_2d/feynman_kac_2d.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fhd-cuda
https://ict.senecacollege.ca/~gpu610/pages/content/cudas.htmlhttps://ict.senecacollege.ca/~gpu610/pages/content/cudas.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#filter-cuda
https://developer.nvidia.com/blog/cuda-pro-tip-optimized-filtering-warp-aggregated-atomics/https://developer.nvidia.com/blog/cuda-pro-tip-optimized-filtering-warp-aggregated-atomics/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fft-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#flame-cuda
http://gpugems.hwu-server2.crhc.illinois.edu/http://gpugems.hwu-server2.crhc.illinois.edu/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#flip-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#floydwarshall-hip
https://github.com/ROCm-Developer-Tools/HIP-Examples/blob/master/HIP-Examples-Applications/FloydWarshall/https://github.com/ROCm-Developer-Tools/HIP-Examples/blob/master/HIP-Examples-Applications/FloydWarshall/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#floydwarshall2-cuda
https://userweb.cs.txstate.edu/~burtscher/research/ECL-APSP/https://userweb.cs.txstate.edu/~burtscher/research/ECL-APSP/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fluidsim-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fpc-opencl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#frechet-matlab
https://github.com/mp4096/discrete-frechet-distancehttps://github.com/mp4096/discrete-frechet-distance
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fresnel-c
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#frna-cuda
http://rna.urmc.rochester.edu/Text/Fold-cuda.htmlhttp://rna.urmc.rochester.edu/Text/Fold-cuda.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fsm-cuda
https://userweb.cs.txstate.edu/~burtscher/research/FSM_GA/https://userweb.cs.txstate.edu/~burtscher/research/FSM_GA/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#fwt-cuda
http://docs.nvidia.com/cuda/cuda-samples/index.htmlhttp://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ga-cuda
https://github.com/NUCAR-DEV/Hetero-Markhttps://github.com/NUCAR-DEV/Hetero-Mark
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gabor-cc
https://github.com/fercer/gaborfilterhttps://github.com/fercer/gaborfilter
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gamma-correction-sycl
https://github.com/intel/BaseKit-code-sampleshttps://github.com/intel/BaseKit-code-samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gaussian-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gc-cuda
https://userweb.cs.txstate.edu/~burtscher/research/ECL-GC/https://userweb.cs.txstate.edu/~burtscher/research/ECL-GC/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gd-c
https://github.com/CGudapati/BinaryClassificationhttps://github.com/CGudapati/BinaryClassification
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#geam-cuda
https://docs.nvidia.com/cuda/cublas/index.html#cublas-lt-t-gt-geamhttps://docs.nvidia.com/cuda/cublas/index.html#cublas-lt-t-gt-geam
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gels-sycl
https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpphttps://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gelu-cuda
https://github.com/NVIDIA/FasterTransformerhttps://github.com/NVIDIA/FasterTransformer
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gemv-cuda
https://github.com/wangsiping97/FastGEMVhttps://github.com/wangsiping97/FastGEMV
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#geodesic-opencl
https://www.osti.gov/servlets/purl/1576565https://www.osti.gov/servlets/purl/1576565
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ge-spmm-cuda
https://github.com/hgyhungry/ge-spmmhttps://github.com/hgyhungry/ge-spmm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gibbs-cuda
https://github.com/arendsee/cuda-gibbs-examplehttps://github.com/arendsee/cuda-gibbs-example
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#glu-cuda
https://pytorch.org/docs/stable/generated/torch.nn.GLU.htmlhttps://pytorch.org/docs/stable/generated/torch.nn.GLU.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gmm-cuda
https://github.com/Corv/CUDA-GMM-MultiGPUhttps://github.com/Corv/CUDA-GMM-MultiGPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#goulash-cuda
https://github.com/LLNL/goulashhttps://github.com/LLNL/goulash
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gpp-cuda-omp
https://github.com/NERSC/gpu-for-science-day-july-2019https://github.com/NERSC/gpu-for-science-day-july-2019
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#graphexecution-cuda
https://github.com/NVIDIA/cuda-samples/tree/master/Samples/3_CUDA_Features/simpleCudaGraphshttps://github.com/NVIDIA/cuda-samples/tree/master/Samples/3_CUDA_Features/simpleCudaGraphs
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#grep-cuda
https://github.com/bkase/CUDA-grephttps://github.com/bkase/CUDA-grep
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#grrt-cuda
https://github.com/hungyipu/Odysseyhttps://github.com/hungyipu/Odyssey
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#gru-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#haccmk-c
https://asc.llnl.gov/CORAL-benchmarks/#haccmkhttps://asc.llnl.gov/CORAL-benchmarks/#haccmk
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#halo-finder-cuda
https://gem5.googlesource.com/public/gem5-resourceshttps://gem5.googlesource.com/public/gem5-resources
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hausdorff-cc
https://github.com/arohamirai/Hausdorff-Distance-Matchhttps://github.com/arohamirai/Hausdorff-Distance-Match
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#haversine-cuda
https://github.com/rapidsai/cuspatialhttps://github.com/rapidsai/cuspatial
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hbc-cuda
https://github.com/Adam27X/hybrid_BChttps://github.com/Adam27X/hybrid_BC
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#heartwall-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#heat-sycl
https://github.com/UoB-HPC/heat_syclhttps://github.com/UoB-HPC/heat_sycl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#heat2d-cuda
https://github.com/gpucw/cuda-laplhttps://github.com/gpucw/cuda-lapl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hellinger-cuda
https://github.com/rapidsai/rafthttps://github.com/rapidsai/raft
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#henry-cuda
https://github.com/CorySimon/HenryCoefficienthttps://github.com/CorySimon/HenryCoefficient
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hexicton-opencl
https://github.com/noma/hexciton_benchmarkhttps://github.com/noma/hexciton_benchmark
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#histogram-cuda
http://github.com/NVlabs/cub/tree/masterhttp://github.com/NVlabs/cub/tree/master
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hmm-opencl
http://developer.download.nvidia.com/compute/DevZone/OpenCL/Projects/oclHiddenMarkovModel.tar.gzhttp://developer.download.nvidia.com/compute/DevZone/OpenCL/Projects/oclHiddenMarkovModel.tar.gz
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hogbom-cuda
https://github.com/ATNF/askap-benchmarks/https://github.com/ATNF/askap-benchmarks/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hotspot-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hotspot3d-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hpl-cuda
https://github.com/oneapi-src/Velocity-Benchhttps://github.com/oneapi-src/Velocity-Bench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hungarian-cuda
https://github.com/paclopes/HungarianGPUhttps://github.com/paclopes/HungarianGPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hwt1d-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hybridsort-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#hypterm-cuda
https://github.com/pssrawat/ppopp-artifacthttps://github.com/pssrawat/ppopp-artifact
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#idivide-cuda
https://github.com/milakov/int_fastdivhttps://github.com/milakov/int_fastdiv
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#interleave-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#interval-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#intrinsics-cast-cuda
https://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__INTRINSIC__CAST.htmlhttps://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__INTRINSIC__CAST.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#inversek2j-cuda
http://axbench.org/http://axbench.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#is-cuda
https://github.com/GMAP/NPB-GPUhttps://github.com/GMAP/NPB-GPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ising-cuda
https://github.com/NVIDIA/ising-gpu/https://github.com/NVIDIA/ising-gpu/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#iso2dfd-sycl
https://github.com/intel/HPCKit-code-samples/https://github.com/intel/HPCKit-code-samples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#jaccard-cuda
https://github.com/rapidsai/nvgraph/blob/main/cpp/src/jaccard_gpu.cuhttps://github.com/rapidsai/nvgraph/blob/main/cpp/src/jaccard_gpu.cu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#jacobi-cuda
https://github.com/NVIDIA/multi-gpu-programming-models/blob/master/single_gpu/jacobi.cuhttps://github.com/NVIDIA/multi-gpu-programming-models/blob/master/single_gpu/jacobi.cu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#jenkins-hash-c
https://android.googlesource.com/platform/external/jenkins-hash/+/75dbeadebd95869dd623a29b720678c5c5c55630/lookup3.chttps://android.googlesource.com/platform/external/jenkins-hash/+/75dbeadebd95869dd623a29b720678c5c5c55630/lookup3.c
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#kalman-cuda
https://github.com/rapidsai/cuml/https://github.com/rapidsai/cuml/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#keccaktreehash-cuda
http://sites.google.com/site/keccaktreegpu/http://sites.google.com/site/keccaktreegpu/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#keogh-cuda
https://github.com/gravitino/cudadtwhttps://github.com/gravitino/cudadtw
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#kernellaunch-hip
https://github.com/ROCm/hip-testshttps://github.com/ROCm/hip-tests
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#kmc-cuda
https://github.com/MKLab-ITI/CUDAhttps://github.com/MKLab-ITI/CUDA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#kmeans-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#knn-cuda
https://github.com/OSU-STARLAB/UVM_benchmark/blob/master/non_UVM_benchmarkshttps://github.com/OSU-STARLAB/UVM_benchmark/blob/master/non_UVM_benchmarks
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#kurtosis-cuda
https://github.com/d-d-j/ddj_storehttps://github.com/d-d-j/ddj_store
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lanczos-cuda
https://github.com/linhr/15618https://github.com/linhr/15618
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#langford-cuda
https://github.com/boris-dimitrov/z4_planar_langford_multigpuhttps://github.com/boris-dimitrov/z4_planar_langford_multigpu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#laplace-cuda
https://github.com/kyleniemeyer/laplace_gpuhttps://github.com/kyleniemeyer/laplace_gpu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#laplace3d-cuda
https://github.com/gpgpu-sim/ispass2009-benchmarkshttps://github.com/gpgpu-sim/ispass2009-benchmarks
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lavamd-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#layernorm-cuda
https://github.com/karpathy/llm.chttps://github.com/karpathy/llm.c
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#layout-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lci-c
https://github.com/vskokov/Landau_Collisional_Integralhttps://github.com/vskokov/Landau_Collisional_Integral
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lda-cuda
https://github.com/js1010/cusimhttps://github.com/js1010/cusim
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ldpc-cuda
https://github.com/robertwgh/cuLDPChttps://github.com/robertwgh/cuLDPC
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lebesgue-c
https://people.math.sc.edu/Burkardt/c_src/lebesgue/lebesgue.htmlhttps://people.math.sc.edu/Burkardt/c_src/lebesgue/lebesgue.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#leukocyte--opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lfib4-cuda
https://bitbucket.org/przemstp/gpu-marsa-lfib4/src/master/https://bitbucket.org/przemstp/gpu-marsa-lfib4/src/master/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#libor-cuda
https://people.maths.ox.ac.uk/~gilesm/cuda_old.htmlhttps://people.maths.ox.ac.uk/~gilesm/cuda_old.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lid-driven-cavity--cuda
https://github.com/kyleniemeyer/lid-driven-cavity_gpuhttps://github.com/kyleniemeyer/lid-driven-cavity_gpu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lif-cuda
https://github.com/e2crawfo/hrr-scalinghttps://github.com/e2crawfo/hrr-scaling
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#linearprobing-cuda
https://github.com/nosferalatu/SimpleGPUHashTablehttps://github.com/nosferalatu/SimpleGPUHashTable
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#log2-c
https://adacenter.org/sites/default/files/milspec/Transcendentals.ziphttps://adacenter.org/sites/default/files/milspec/Transcendentals.zip
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#logan-cuda
https://github.com/albertozeni/LOGANhttps://github.com/albertozeni/LOGAN
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#logic-resim-cuda
https://github.com/ceKyleLee/cad20-tglshttps://github.com/ceKyleLee/cad20-tgls
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#logic-rewrite-cuda
https://github.com/cuhk-eda/CULShttps://github.com/cuhk-eda/CULS
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#logprob-cuda
https://github.com/NVIDIA/FasterTransformerhttps://github.com/NVIDIA/FasterTransformer
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lombscargle-cuda
https://github.com/rapidsai/cusignal/https://github.com/rapidsai/cusignal/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#loopback-cuda
https://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-37-efficient-random-number-generation-and-applicationhttps://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-37-efficient-random-number-generation-and-application
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lr-opencl
https://github.com/ChenyangZhang-cs/iMLBenchhttps://github.com/ChenyangZhang-cs/iMLBench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lrn-sycl
intel/llvm#8292https://github.com/intel/llvm/issues/8292
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lsqt-cuda
https://github.com/brucefan1983/gpuqthttps://github.com/brucefan1983/gpuqt
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lud-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ludb-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lulesh-cuda
https://github.com/LLNL/LULESHhttps://github.com/LLNL/LULESH
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#lzss-cuda
https://github.com/hipdac-lab/ICS23-GPULZhttps://github.com/hipdac-lab/ICS23-GPULZ
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mallocfree-hip
https://github.com/ROCm-Developer-Tools/HIP/https://github.com/ROCm-Developer-Tools/HIP/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mandelbrot-sycl
https://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramminghttps://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramming
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#marchingcubes-cuda
https://github.com/LRLVEC/MarchingCubeshttps://github.com/LRLVEC/MarchingCubes
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mask-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#match-cuda
https://github.com/Celebrandil/CudaSifthttps://github.com/Celebrandil/CudaSift
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#matern-cuda
https://tbetcke.github.io/hpc_lecture_notes/rbf_evaluation.htmlhttps://tbetcke.github.io/hpc_lecture_notes/rbf_evaluation.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#matrix-rotate-openmp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#matrixt-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#maxpool3d-opencl
https://github.com/nachiket/papaa-openclhttps://github.com/nachiket/papaa-opencl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#maxflops-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mcmd-cuda
https://github.com/khavernathy/mcmdhttps://github.com/khavernathy/mcmd
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mcpr-cuda
https://github.com/berkeley-scf/gpu-workshop-2016https://github.com/berkeley-scf/gpu-workshop-2016
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#md-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mdh-opencl
http://gpugems.hwu-server2.crhc.illinois.edu/http://gpugems.hwu-server2.crhc.illinois.edu/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#md5hash-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#meanshift-cuda
https://github.com/w00zie/mean_shifthttps://github.com/w00zie/mean_shift
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#medianfilter-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#merkle-sycl
https://github.com/itzmeanjan/ff-gpuhttps://github.com/itzmeanjan/ff-gpu
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#memcpy-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#memtest-cuda
https://github.com/ComputationalRadiationPhysics/cuda_memtesthttps://github.com/ComputationalRadiationPhysics/cuda_memtest
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#merge-cuda
https://github.com/ogreen/MergePathGPUhttps://github.com/ogreen/MergePathGPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#metropolis-cuda
https://github.com/crinavar/truekehttps://github.com/crinavar/trueke
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mf-sgd-cuda
https://github.com/cuMF/cumf_sgdhttps://github.com/cuMF/cumf_sgd
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#michalewicz-c
https://www.sfu.ca/~ssurjano/michal.htmlhttps://www.sfu.ca/~ssurjano/michal.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minife-omp
https://github.com/Mantevo/miniFEhttps://github.com/Mantevo/miniFE
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minibude-sycl
https://github.com/UoB-HPC/miniBUDEhttps://github.com/UoB-HPC/miniBUDE
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minimap2-cuda
https://github.com/UCLA-VAST/minimap2-accelerationhttps://github.com/UCLA-VAST/minimap2-acceleration
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minimod-cuda
https://github.com/rsrice/gpa-minimod-artifactshttps://github.com/rsrice/gpa-minimod-artifacts
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minisweep-openmp
https://github.com/wdj/minisweephttps://github.com/wdj/minisweep
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#miniweather-openmp
https://github.com/mrnorman/miniWeatherhttps://github.com/mrnorman/miniWeather
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minkowski-cuda
https://github.com/rapidsai/rafthttps://github.com/rapidsai/raft
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#minmax-cuda
https://github.com/rapidsai/cuspatialhttps://github.com/rapidsai/cuspatial
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mis-cuda
http://www.cs.txstate.edu/~burtscher/research/ECL-MIS/http://www.cs.txstate.edu/~burtscher/research/ECL-MIS/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mixbench-cuda
https://github.com/ekondis/mixbenchhttps://github.com/ekondis/mixbench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mmcsf-cuda
https://github.com/isratnisa/MM-CSFhttps://github.com/isratnisa/MM-CSF
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mnist-cuda
https://developer.amd.com/wp-content/resourceshttps://developer.amd.com/wp-content/resources
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#morphology-cuda
https://github.com/yszheda/CUDA-Morphologyhttps://github.com/yszheda/CUDA-Morphology
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mr-c
https://github.com/wizykowski/miller-rabinhttps://github.com/wizykowski/miller-rabin
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mriq-cuda
https://github.com/abduld/Parboil/blob/master/benchmarks/mri-q/DESCRIPTIONhttps://github.com/abduld/Parboil/blob/master/benchmarks/mri-q/DESCRIPTION
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mrc-cuda
https://pytorch.orghttps://pytorch.org
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mrg32k3a-cuda
https://github.com/NVIDIA/CUDALibrarySamples/https://github.com/NVIDIA/CUDALibrarySamples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mt-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#mtf-thrust
https://github.com/bzip2-cuda/bzip2-cudahttps://github.com/bzip2-cuda/bzip2-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#multimaterial-sycl
https://github.com/reguly/multimaterialhttps://github.com/reguly/multimaterial
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#multinomial-cuda
https://pytorch.orghttps://pytorch.org
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#murmurhash3-c
https://github.com/aappleby/smhasher/wiki/MurmurHash3https://github.com/aappleby/smhasher/wiki/MurmurHash3
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#myocte-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nbnxm-sycl
https://manual.gromacs.org/current/doxygen/html-full/page_nbnxm.xhtmlhttps://manual.gromacs.org/current/doxygen/html-full/page_nbnxm.xhtml
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nbody-opencl
https://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramming/C%2B%2BSYCL/N-BodyMethods/Nbodyhttps://github.com/oneapi-src/oneAPI-samples/tree/master/DirectProgramming/C%2B%2BSYCL/N-BodyMethods/Nbody
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ne-cuda
https://github.com/PointCloudLibrary/pclhttps://github.com/PointCloudLibrary/pcl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nlll-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nms-cuda
https://github.com/hertasecurity/gpu-nmshttps://github.com/hertasecurity/gpu-nms
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nn-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nonzero-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#norm2-cuda
https://docs.nvidia.com/cuda/cublashttps://docs.nvidia.com/cuda/cublas
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nosync-cuda
https://github.com/NVIDIA/thrust/tree/main/exampleshttps://github.com/NVIDIA/thrust/tree/main/examples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nqueen-cuda
https://github.com/tcarneirop/ChOphttps://github.com/tcarneirop/ChOp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ntt-cuda
https://github.com/vernamlab/cuHEhttps://github.com/vernamlab/cuHE
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#nw-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#openmp-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#opticalflow-cuda
https://github.com/NVIDIA/cuda-samples/blob/master/Samples/5_Domain_Specific/HSOpticalFlowhttps://github.com/NVIDIA/cuda-samples/blob/master/Samples/5_Domain_Specific/HSOpticalFlow
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#overlap-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#overlay-cuda
https://github.com/dusty-nv/jetson-inferencehttps://github.com/dusty-nv/jetson-inference
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#p2p-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#p4-cuda
https://github.com/NVIDIA-AI-IOT/CUDA-PointPillarshttps://github.com/NVIDIA-AI-IOT/CUDA-PointPillars
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pad-cuda
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#page-rank-opencl
https://github.com/Sable/Ostrich/tree/master/map-reduce/page-rankhttps://github.com/Sable/Ostrich/tree/master/map-reduce/page-rank
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#particle-diffusion-sycl
https://www.intel.com/content/www/us/en/developer/articles/code-sample/oneapi-dpcpp-compiler-example-particle-diffusion.htmlhttps://www.intel.com/content/www/us/en/developer/articles/code-sample/oneapi-dpcpp-compiler-example-particle-diffusion.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#particlefilter-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#particles-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pathfinder-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pcc-cuda
https://github.com/pcdslab/Fast-GPU-PCChttps://github.com/pcdslab/Fast-GPU-PCC
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#perlin-cuda
https://github.com/silverweed/perlin_cudahttps://github.com/silverweed/perlin_cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#permutate-cuda
https://github.com/yeah1kim/yeah_GPU_SP800_90B_IIDhttps://github.com/yeah1kim/yeah_GPU_SP800_90B_IID
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#permute-cuda
https://github.com/karpathy/llm.chttps://github.com/karpathy/llm.c
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#perplexity-cuda
https://github.com/rapidsai/cuml/https://github.com/rapidsai/cuml/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#phmm-cuda
https://github.com/lienliang/Pair_HMM_forward_GPUhttps://github.com/lienliang/Pair_HMM_forward_GPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pingpong-cuda
https://github.com/olcf-tutorials/MPI_ping_ponghttps://github.com/olcf-tutorials/MPI_ping_pong
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pitch-cuda
https://docs.nvidia.com/cuda/cuda-c-programming-guidehttps://docs.nvidia.com/cuda/cuda-c-programming-guide
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pnpoly-cuda
https://github.com/benvanwerkhoven/kernel_tunerhttps://github.com/benvanwerkhoven/kernel_tuner
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pns-cuda
https://github.com/abduld/Parboil/https://github.com/abduld/Parboil/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pointwise-cuda
https://developer.nvidia.com/blog/optimizing-recurrent-neural-networks-cudnn-5/https://developer.nvidia.com/blog/optimizing-recurrent-neural-networks-cudnn-5/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pool-hip
https://github.com/PaddlePaddle/Paddlehttps://github.com/PaddlePaddle/Paddle
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#popcount-opencl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#prefetch-hip
https://github.com/ROCm-Developer-Tools/HIP/https://github.com/ROCm-Developer-Tools/HIP/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#present-c
https://github.com/bozhu/PRESENT-C/blob/master/present.hhttps://github.com/bozhu/PRESENT-C/blob/master/present.h
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#prna-cuda
http://rna.urmc.rochester.edu/Text/partition-cuda.htmlhttp://rna.urmc.rochester.edu/Text/partition-cuda.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#projectile-sycl
https://github.com/intel/BaseKit-code-sampleshttps://github.com/intel/BaseKit-code-samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#pso-cuda
https://github.com/wiseodd/cuda-psohttps://github.com/wiseodd/cuda-pso
https://github.com/chensohg/GPU_CUDA_PSOhttps://github.com/chensohg/GPU_CUDA_PSO
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#qem-cuda
https://github.com/qureshizawar/CUDA-quartic-solverhttps://github.com/qureshizawar/CUDA-quartic-solver
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#qkv-cuda
https://github.com/karpathy/llm.chttps://github.com/karpathy/llm.c
https://github.com/mspronesti/llm.syclhttps://github.com/mspronesti/llm.sycl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#qrg-cuda
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#qtclustering-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#quant-cuda
https://github.com/bitsandbytes-foundation/bitsandbytes/tree/mainhttps://github.com/bitsandbytes-foundation/bitsandbytes/tree/main
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#quicksort-sycl
https://software.intel.com/content/www/us/en/develop/download/code-for-the-parallel-universe-article-gpu-quicksort-from-opencl-to-data-parallel-c.htmlhttps://software.intel.com/content/www/us/en/develop/download/code-for-the-parallel-universe-article-gpu-quicksort-from-opencl-to-data-parallel-c.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#radixsort-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#radixsort2-cuda
https://github.com/NVIDIA/cuda-sampleshttps://github.com/NVIDIA/cuda-samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rainflow-c
https://github.com/carlos-souto/rainflow-cycle-countinghttps://github.com/carlos-souto/rainflow-cycle-counting
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#randomaccess-openmp
https://icl.cs.utk.edu/projectsfiles/hpcc/RandomAccess/https://icl.cs.utk.edu/projectsfiles/hpcc/RandomAccess/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rayleighbenardconvection-cuda
https://github.com/sanchda/2D_RB_convec/tree/mainhttps://github.com/sanchda/2D_RB_convec/tree/main
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#reaction-cuda
https://github.com/ifilot/wavefusehttps://github.com/ifilot/wavefuse
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#recursivegaussian-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#remap-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#relu-cuda
https://github.com/tensorflowhttps://github.com/tensorflow
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#resize-cuda
https://github.com/opencv/https://github.com/opencv/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#resnet-kernels-cuda
https://github.com/xuqiantong/CUDA-Winogradhttps://github.com/xuqiantong/CUDA-Winograd
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#reverse-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#reverse2d-cuda
https://github.com/rapidsai/cumlhttps://github.com/rapidsai/cuml
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rfs-cuda
https://github.com/facebookarchive/fbcudahttps://github.com/facebookarchive/fbcuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ring-sycl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rle-cuda
https://github.com/NVIDIA/cubhttps://github.com/NVIDIA/cub
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rng-wallace-cuda
https://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-37-efficient-random-number-generation-and-applicationhttps://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-37-efficient-random-number-generation-and-application
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rodrigues-cuda
https://github.com/DIDSR/VICTRE_MCGPUhttps://github.com/DIDSR/VICTRE_MCGPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#romberg-cuda
https://github.com/SwayambhuNathRay/Parallel-Romberg-Integrationhttps://github.com/SwayambhuNathRay/Parallel-Romberg-Integration
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rowwisemoments-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rotary-cuda
https://github.com/Dao-AILab/flash-attention/tree/main/csrc/rotaryhttps://github.com/Dao-AILab/flash-attention/tree/main/csrc/rotary
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rsbench-opencl
https://github.com/ANL-CESAR/RSBench/https://github.com/ANL-CESAR/RSBench/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rsc-cuda
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rsmt-cuda
https://userweb.cs.txstate.edu/~burtscher/research/SFP/https://userweb.cs.txstate.edu/~burtscher/research/SFP/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rtm8-hip
https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/master/rtm8https://github.com/ROCm-Developer-Tools/HIP-Examples/tree/master/rtm8
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#rushlarsen-cuda
https://bitbucket.org/finsberg/gotran/src/masterhttps://bitbucket.org/finsberg/gotran/src/master
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#s3d-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#s8n-cuda
https://github.com/MVIG-SJTU/pointSIFT/tree/masterhttps://github.com/MVIG-SJTU/pointSIFT/tree/master
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sa-cuda
https://github.com/gmzang/Parallel-Suffix-Array-on-GPUhttps://github.com/gmzang/Parallel-Suffix-Array-on-GPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sad-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sampling-cuda
https://github.com/rapidsai/cumlhttps://github.com/rapidsai/cuml
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#saxpy-ompt-openmp
https://github.com/pc2/OMP-Offloadinghttps://github.com/pc2/OMP-Offloading
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sc-cuda
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#scan-cuda
https://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-39-parallel-prefix-sum-scan-cudahttps://developer.nvidia.com/gpugems/gpugems3/part-vi-gpu-computing/chapter-39-parallel-prefix-sum-scan-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#scan2-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#scan3-cuda
https://github.com/NVIDIA/cubhttps://github.com/NVIDIA/cub
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#scel-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#score-cuda
https://github.com/opencv/https://github.com/opencv/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sddmm-batch-cuda
https://docs.nvidia.com/cuda/cusparse/index.html#cusparsesddmmhttps://docs.nvidia.com/cuda/cusparse/index.html#cusparsesddmm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#secp256k1-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#seam-carving-cuda
https://github.com/pauty/CUDA_seam_carvinghttps://github.com/pauty/CUDA_seam_carving
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#segment-reduce-cuda
https://github.com/c3sr/tcu_scopehttps://github.com/c3sr/tcu_scope
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#segsort-cuda
https://github.com/Funatiq/bb_segsorthttps://github.com/Funatiq/bb_segsort
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sheath-cuda
https://www.particleincell.com/2016/cuda-pic/https://www.particleincell.com/2016/cuda-pic/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#shmembench-cuda
https://github.com/ekondis/gpumembenchhttps://github.com/ekondis/gpumembench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#shuffle-hip
https://github.com/cpc/hipcl/tree/master/samples/4_shflhttps://github.com/cpc/hipcl/tree/master/samples/4_shfl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#si-cuda
https://github.com/chribell/set_intersectionhttps://github.com/chribell/set_intersection
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#simplemoc-opencl
https://github.com/ANL-CESAR/SimpleMOC-kernelhttps://github.com/ANL-CESAR/SimpleMOC-kernel
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#simplemultidevice-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#simplespmv-cuda
https://github.com/passlab/CUDAMicroBenchhttps://github.com/passlab/CUDAMicroBench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#slit-cuda
https://github.com/bamaratunga/cuda_fft.githttps://github.com/bamaratunga/cuda_fft.git
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#slu-cuda
https://github.com/sheldonucr/GLU_publichttps://github.com/sheldonucr/GLU_public
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#snake-cuda
https://github.com/CMU-SAFARI/SneakySnakehttps://github.com/CMU-SAFARI/SneakySnake
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#snicit-cuda
https://github.com/IDEA-CUHK/SNICIThttps://github.com/IDEA-CUHK/SNICIT
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sobel-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sobol-cuda
https://docs.nvidia.com/cuda/cuda-samples/index.htmlhttps://docs.nvidia.com/cuda/cuda-samples/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#softmax-opencl
https://github.com/pytorch/glow/tree/master/lib/Backends/OpenCLhttps://github.com/pytorch/glow/tree/master/lib/Backends/OpenCL
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#softmax-fused-cuda
https://github.com/Dao-AILab/flash-attention/tree/main/csrc/fused_softmaxhttps://github.com/Dao-AILab/flash-attention/tree/main/csrc/fused_softmax
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#softmax-online-cuda
https://github.com/karpathy/llm.chttps://github.com/karpathy/llm.c
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sort-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sortkv-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sosfil-cuda
https://github.com/rapidsai/cusignal/https://github.com/rapidsai/cusignal/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sparkler-cuda
https://github.com/wdj/sparklerhttps://github.com/wdj/sparkler
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sph-openmp
https://github.com/olcf/SPH_Simplehttps://github.com/olcf/SPH_Simple
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#split-cuda
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spm-cuda
http://mri.ee.ntust.edu.tw/cuda/http://mri.ee.ntust.edu.tw/cuda/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spd2s-cuda
https://docs.nvidia.com/cuda/cusparse/index.html#cusparsedensetosparsehttps://docs.nvidia.com/cuda/cusparse/index.html#cusparsedensetosparse
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spgeam-cuda-sycl
https://docs.nvidia.com/cuda/cusparse/index.htmlhttps://docs.nvidia.com/cuda/cusparse/index.html
https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpp/2023-1/oneapi-mkl-sparse-omatcopy.htmlhttps://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-dpcpp/2023-1/oneapi-mkl-sparse-omatcopy.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spgemm-cuda
https://docs.nvidia.com/cuda/cusparse/index.htmlhttps://docs.nvidia.com/cuda/cusparse/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spmm-cuda
https://docs.nvidia.com/cuda/cusparse/index.htmlhttps://docs.nvidia.com/cuda/cusparse/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spmv-cuda
https://docs.nvidia.com/cuda/cusparse/index.htmlhttps://docs.nvidia.com/cuda/cusparse/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spnnz-cuda
https://docs.nvidia.com/cuda/cusparse/index.html#cusparse-t-nnzhttps://docs.nvidia.com/cuda/cusparse/index.html#cusparse-t-nnz
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sps2d-cuda
https://docs.nvidia.com/cuda/cusparse/index.html#cusparsesparsetodensehttps://docs.nvidia.com/cuda/cusparse/index.html#cusparsesparsetodense
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#spsort-cuda-and-sycl
https://docs.nvidia.com/cuda/cusparse/index.html#cusparsexcsrsorthttps://docs.nvidia.com/cuda/cusparse/index.html#cusparsexcsrsort
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sptrsv-cuda
https://github.com/JiyaSu/CapelliniSpTRSVhttps://github.com/JiyaSu/CapelliniSpTRSV
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#srad-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ss-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#ssim-cuda
https://github.com/VIDILabs/instantvnrhttps://github.com/VIDILabs/instantvnr
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sss-cuda
https://github.com/mukherjeec/DPmixGGM/https://github.com/mukherjeec/DPmixGGM/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sssp-opencl
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#stddev-cuda
https://github.com/rapidsai/rafthttps://github.com/rapidsai/raft
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#stencil1d-cuda
https://www.olcf.ornl.gov/wp-content/uploads/2019/12/02-CUDA-Shared-Memory.pdfhttps://www.olcf.ornl.gov/wp-content/uploads/2019/12/02-CUDA-Shared-Memory.pdf
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#stencil3d-cuda
https://github.com/LLNL/cardioidhttps://github.com/LLNL/cardioid
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#streamcluster-opencl
http://lava.cs.virginia.edu/Rodinia/download_links.htmhttp://lava.cs.virginia.edu/Rodinia/download_links.htm
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#streamcreatecopydestroy-hip
https://github.com/ROCm-Developer-Tools/HIP/https://github.com/ROCm-Developer-Tools/HIP/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#streamorderedallocation-cuda
https://github.com/NVIDIA/cuda-samples/https://github.com/NVIDIA/cuda-samples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#streampriority-cuda
https://github.com/NVIDIA/cuda-samples/https://github.com/NVIDIA/cuda-samples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#streamum-cuda
https://github.com/NVIDIA/cuda-samples/https://github.com/NVIDIA/cuda-samples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#stsg-cuda
https://github.com/HPSCIL/cuSTSGhttps://github.com/HPSCIL/cuSTSG
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#su3-sycl
https://gitlab.com/NERSC/nersc-proxies/su3_benchhttps://gitlab.com/NERSC/nersc-proxies/su3_bench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#surfel-cuda
https://github.com/jstraub/cudaPclhttps://github.com/jstraub/cudaPcl
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#svd3x3-cuda
https://github.com/kuiwuchn/3x3_SVD_CUDAhttps://github.com/kuiwuchn/3x3_SVD_CUDA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#sw4ck-cuda
https://github.com/LLNL/SW4CKhttps://github.com/LLNL/SW4CK
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#swish-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tensoraccessor-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tensort-cuda
https://github.com/Jokeren/GPA-Benchmark/tree/master/ExaTENSORhttps://github.com/Jokeren/GPA-Benchmark/tree/master/ExaTENSOR
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#testsnap-openmp
https://github.com/FitSNAP/TestSNAPhttps://github.com/FitSNAP/TestSNAP
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#thomas-cuda
https://pm.bsc.es/gitlab/run-math/cuThomasBatch/tree/masterhttps://pm.bsc.es/gitlab/run-math/cuThomasBatch/tree/master
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#threadfence-cuda
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#memory-fence-functionshttps://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#memory-fence-functions
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tissue-cuda
https://github.com/secomb/GreensTD19_GPUhttps://github.com/secomb/GreensTD19_GPU
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tonemapping-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tpacf-cuda
https://users.ncsa.illinois.edu/kindr/projects/hpca/index.htmlhttps://users.ncsa.illinois.edu/kindr/projects/hpca/index.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tqs-cuda
https://github.com/chai-benchmarks/chaihttps://github.com/chai-benchmarks/chai
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#triad-opencl
https://github.com/vetter/shoc/https://github.com/vetter/shoc/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tridiagonal-opencl
http://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.htmlhttp://developer.download.nvidia.com/compute/cuda/3_0/sdk/website/OpenCL/website/samples.html
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tsa-cuda
https://bitbucket.org/zzzoom/trottersuzuki/src/master/https://bitbucket.org/zzzoom/trottersuzuki/src/master/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#tsp-cuda
https://userweb.cs.txstate.edu/~burtscher/research/TSP_GPU/https://userweb.cs.txstate.edu/~burtscher/research/TSP_GPU/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#unfold-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#urng-opencl
https://github.com/OpenCL/AMD_APP_sampleshttps://github.com/OpenCL/AMD_APP_samples
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#vangenuchten-cuda
https://github.com/HydroComplexity/Dharahttps://github.com/HydroComplexity/Dhara
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#vmc-cuda
https://github.com/wadejong/Summer-School-Materials/tree/master/Examples/vmchttps://github.com/wadejong/Summer-School-Materials/tree/master/Examples/vmc
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#vol2col-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#vote-cuda
https://github.com/NVIDIA/cuda-samples/https://github.com/NVIDIA/cuda-samples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#voxelization-cuda
https://github.com/NVIDIA-AI-IOT/Lidar_AI_Solutionhttps://github.com/NVIDIA-AI-IOT/Lidar_AI_Solution
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#warpexchange-cuda
https://github.com/NVIDIA/cubhttps://github.com/NVIDIA/cub
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#warpsort-cuda
https://github.com/facebookarchive/fbcudahttps://github.com/facebookarchive/fbcuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wedford-cuda
https://github.com/hpcaitech/ColossalAIhttps://github.com/hpcaitech/ColossalAI
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#winograd-opencl
https://github.com/ChenyangZhang-cs/iMLBenchhttps://github.com/ChenyangZhang-cs/iMLBench
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wlcpow-cuda
https://github.com/AnselGitAccount/USERMESO-2.0https://github.com/AnselGitAccount/USERMESO-2.0
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wmma-hip
https://github.com/ROCm/rocWMMAhttps://github.com/ROCm/rocWMMA
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#word2vec-cuda
https://github.com/cudabigdata/word2vec_cudahttps://github.com/cudabigdata/word2vec_cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wordcount-cuda
https://github.com/NVIDIA/thrust/blob/main/examples/https://github.com/NVIDIA/thrust/blob/main/examples/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wsm5-cuda
https://github.com/gpgpu-sim/ispass2009-benchmarks/tree/master/wphttps://github.com/gpgpu-sim/ispass2009-benchmarks/tree/master/wp
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#wyllie-cuda
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#xlqc-cuda
https://github.com/recoli/XLQChttps://github.com/recoli/XLQC
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#xsbench-opencl
https://github.com/ANL-CESAR/XSBench/https://github.com/ANL-CESAR/XSBench/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#zerocopy-cuda
https://github.com/NVIDIA/cuda-samples/tree/master/Samples/0_Introduction/simpleZeroCopyhttps://github.com/NVIDIA/cuda-samples/tree/master/Samples/0_Introduction/simpleZeroCopy
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#zeropoint-cuda
https://pytorch.org/https://pytorch.org/
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#zmddft-cuda
https://github.com/spiral-software/fftxhttps://github.com/spiral-software/fftx
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#zoom-cuda
https://github.com/rapidsai/cucimhttps://github.com/rapidsai/cucim
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#developer
https://github.com/zjin-lcfhttps://github.com/zjin-lcf
https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#acknowledgement
Readme https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#readme-ov-file
BSD-3-Clause license https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation#BSD-3-Clause-1-ov-file
Please reload this pagehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Activityhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/activity
Custom propertieshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/custom-properties
6 starshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/stargazers
1 watchinghttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/watchers
1 forkhttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/forks
Report repository https://patch-diff.githubusercontent.com/contact/report-content?content_url=https%3A%2F%2Fgithub.com%2FScientific-Computing-Lab%2FParallelCodeEstimation&report=Scientific-Computing-Lab+%28user%29
Releaseshttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/releases
Packages 0https://patch-diff.githubusercontent.com/orgs/Scientific-Computing-Lab/packages?repo_name=ParallelCodeEstimation
Please reload this pagehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
Contributors 10https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/graphs/contributors
Please reload this pagehttps://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation
C++ 38.3% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=c%2B%2B
Cuda 22.4% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=cuda
C 21.5% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=c
Jupyter Notebook 10.9% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=jupyter-notebook
Makefile 4.9% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=makefile
Roff 0.9% https://patch-diff.githubusercontent.com/Scientific-Computing-Lab/ParallelCodeEstimation/search?l=roff
https://github.com
Termshttps://docs.github.com/site-policy/github-terms/github-terms-of-service
Privacyhttps://docs.github.com/site-policy/privacy-policies/github-privacy-statement
Securityhttps://github.com/security
Statushttps://www.githubstatus.com/
Communityhttps://github.community/
Docshttps://docs.github.com/
Contacthttps://support.github.com?tags=dotcom-footer

Viewport: width=device-width


URLs of crawlers that visited me.