| route-pattern | /_view_fragments/voltron/pull_requests/show/:user_id/:repository/:id/pull_request_layout(.:format) |
| route-controller | voltron_pull_requests_fragments |
| route-action | pull_request_layout |
| fetch-nonce | v2:8599c1c2-236b-730c-70c8-9f42a9c88da1 |
| current-catalog-service-hash | ae870bc5e265a340912cde392f23dad3671a0a881730ffdadd82f2f57d81641b |
| request-id | A042:ADAB8:2B85610:3A82480:698DA6ED |
| html-safe-nonce | ace731be8595a5557043f61c6e65acec8869c5d48e9820c34e36d555faa4834a |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJBMDQyOkFEQUI4OjJCODU2MTA6M0E4MjQ4MDo2OThEQTZFRCIsInZpc2l0b3JfaWQiOiI3NjA0NjI2MzQ4MjU4OTkzOTAxIiwicmVnaW9uX2VkZ2UiOiJpYWQiLCJyZWdpb25fcmVuZGVyIjoiaWFkIn0= |
| visitor-hmac | fe9fae40160ac9ab0611ad7e07ac825082e5df9960de1f9b383358c63d5327e3 |
| hovercard-subject-tag | pull_request:2612777267 |
| github-keyboard-shortcuts | repository,pull-request-list,pull-request-conversation,pull-request-files-changed,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | ///voltron/pull_requests_fragments/pull_request_layout |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/voltron/pull_requests/show/gpu-mode/kernelbot-data/1/pull_request_layout |
| twitter:image | https://opengraph.githubassets.com/c8b8f83cd9db4d586146d74e6270ee50825064487998f7f3926d676297887073/gpu-mode/kernelbot-data/pull/1 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/c8b8f83cd9db4d586146d74e6270ee50825064487998f7f3926d676297887073/gpu-mode/kernelbot-data/pull/1 |
| og:image:alt | This pull request introduces deduplication functionality to the export.py script and updates the documentation to include testing instructions. The key changes include integrating a deduplication m... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | PaliC |
| hostname | github.com |
| expected-hostname | github.com |
| None | 8c7947c0c592efeab6162b9909ad11fa43bff8b0cb5ff43273dc25e41979d43e |
| turbo-cache-control | no-preview |
| go-import | github.com/gpu-mode/kernelbot-data git https://github.com/gpu-mode/kernelbot-data.git |
| octolytics-dimension-user_id | 154984337 |
| octolytics-dimension-user_login | gpu-mode |
| octolytics-dimension-repository_id | 1005764152 |
| octolytics-dimension-repository_nwo | gpu-mode/kernelbot-data |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 1005764152 |
| octolytics-dimension-repository_network_root_nwo | gpu-mode/kernelbot-data |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 0562b88b05bab6c9b1cf780b4a66b9334b3a602a |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
| Skip to content | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#start-of-content |
|
| https://patch-diff.githubusercontent.com/ |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fgpu-mode%2Fkernelbot-data%2Fpull%2F1 |
| GitHub CopilotWrite better code with AI | https://github.com/features/copilot |
| GitHub SparkBuild and deploy intelligent apps | https://github.com/features/spark |
| GitHub ModelsManage and compare prompts | https://github.com/features/models |
| MCP RegistryNewIntegrate external tools | https://github.com/mcp |
| ActionsAutomate any workflow | https://github.com/features/actions |
| CodespacesInstant dev environments | https://github.com/features/codespaces |
| IssuesPlan and track work | https://github.com/features/issues |
| Code ReviewManage code changes | https://github.com/features/code-review |
| GitHub Advanced SecurityFind and fix vulnerabilities | https://github.com/security/advanced-security |
| Code securitySecure your code as you build | https://github.com/security/advanced-security/code-security |
| Secret protectionStop leaks before they start | https://github.com/security/advanced-security/secret-protection |
| Why GitHub | https://github.com/why-github |
| Documentation | https://docs.github.com |
| Blog | https://github.blog |
| Changelog | https://github.blog/changelog |
| Marketplace | https://github.com/marketplace |
| View all features | https://github.com/features |
| Enterprises | https://github.com/enterprise |
| Small and medium teams | https://github.com/team |
| Startups | https://github.com/enterprise/startups |
| Nonprofits | https://github.com/solutions/industry/nonprofits |
| App Modernization | https://github.com/solutions/use-case/app-modernization |
| DevSecOps | https://github.com/solutions/use-case/devsecops |
| DevOps | https://github.com/solutions/use-case/devops |
| CI/CD | https://github.com/solutions/use-case/ci-cd |
| View all use cases | https://github.com/solutions/use-case |
| Healthcare | https://github.com/solutions/industry/healthcare |
| Financial services | https://github.com/solutions/industry/financial-services |
| Manufacturing | https://github.com/solutions/industry/manufacturing |
| Government | https://github.com/solutions/industry/government |
| View all industries | https://github.com/solutions/industry |
| View all solutions | https://github.com/solutions |
| AI | https://github.com/resources/articles?topic=ai |
| Software Development | https://github.com/resources/articles?topic=software-development |
| DevOps | https://github.com/resources/articles?topic=devops |
| Security | https://github.com/resources/articles?topic=security |
| View all topics | https://github.com/resources/articles |
| Customer stories | https://github.com/customer-stories |
| Events & webinars | https://github.com/resources/events |
| Ebooks & reports | https://github.com/resources/whitepapers |
| Business insights | https://github.com/solutions/executive-insights |
| GitHub Skills | https://skills.github.com |
| Documentation | https://docs.github.com |
| Customer support | https://support.github.com |
| Community forum | https://github.com/orgs/community/discussions |
| Trust center | https://github.com/trust-center |
| Partners | https://github.com/partners |
| GitHub SponsorsFund open source developers | https://github.com/sponsors |
| Security Lab | https://securitylab.github.com |
| Maintainer Community | https://maintainers.github.com |
| Accelerator | https://github.com/accelerator |
| Archive Program | https://archiveprogram.github.com |
| Topics | https://github.com/topics |
| Trending | https://github.com/trending |
| Collections | https://github.com/collections |
| Enterprise platformAI-powered developer platform | https://github.com/enterprise |
| GitHub Advanced SecurityEnterprise-grade security features | https://github.com/security/advanced-security |
| Copilot for BusinessEnterprise-grade AI features | https://github.com/features/copilot/copilot-business |
| Premium SupportEnterprise-grade 24/7 support | https://github.com/premium-support |
| Pricing | https://github.com/pricing |
| Search syntax tips | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
| documentation | https://docs.github.com/search-github/github-code-search/understanding-github-code-search-syntax |
|
Sign in
| https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fgpu-mode%2Fkernelbot-data%2Fpull%2F1 |
|
Sign up
| https://patch-diff.githubusercontent.com/signup?ref_cta=Sign+up&ref_loc=header+logged+out&ref_page=%2F%3Cuser-name%3E%2F%3Crepo-name%3E%2Fvoltron%2Fpull_requests_fragments%2Fpull_request_layout&source=header-repo&source_repo=gpu-mode%2Fkernelbot-data |
| Reload | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| Reload | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| Reload | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
|
gpu-mode
| https://patch-diff.githubusercontent.com/gpu-mode |
| kernelbot-data | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data |
|
Notifications
| https://patch-diff.githubusercontent.com/login?return_to=%2Fgpu-mode%2Fkernelbot-data |
|
Fork
1
| https://patch-diff.githubusercontent.com/login?return_to=%2Fgpu-mode%2Fkernelbot-data |
|
Star
2
| https://patch-diff.githubusercontent.com/login?return_to=%2Fgpu-mode%2Fkernelbot-data |
|
Code
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data |
|
Issues
0
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/issues |
|
Pull requests
1
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/actions |
|
Projects
0
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/projects |
|
Security
0
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/security |
|
Insights
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pulse |
|
Code
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data |
|
Issues
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/issues |
|
Pull requests
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pulls |
|
Actions
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/actions |
|
Projects
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/projects |
|
Security
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/security |
|
Insights
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pulse |
| Sign up for GitHub
| https://patch-diff.githubusercontent.com/signup?return_to=%2Fgpu-mode%2Fkernelbot-data%2Fissues%2Fnew%2Fchoose |
| terms of service | https://docs.github.com/terms |
| privacy statement | https://docs.github.com/privacy |
| Sign in | https://patch-diff.githubusercontent.com/login?return_to=%2Fgpu-mode%2Fkernelbot-data%2Fissues%2Fnew%2Fchoose |
| Jump to bottom | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issue-comment-box |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| gpu-mode:main | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/tree/main |
| PaliC:dedup | https://patch-diff.githubusercontent.com/PaliC/kernelbot-data/tree/dedup |
|
Add deduplication logic
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#top |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| gpu-mode:main | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/tree/main |
| PaliC:dedup | https://patch-diff.githubusercontent.com/PaliC/kernelbot-data/tree/dedup |
|
Conversation
6
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
|
Commits
11
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits |
|
Checks
0
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/checks |
|
Files changed
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/files |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| https://github.co/hiddenchars |
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/{{ revealButtonHref }} |
|
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| Jun 23, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issue-3169438963 |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| https://medium.com/@omkarsoak/from-min-hashing-to-locality-sensitive-hashing-the-complete-process-b88b298d71a1 | https://medium.com/@omkarsoak/from-min-hashing-to-locality-sensitive-hashing-the-complete-process-b88b298d71a1 |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| June 23, 2025 16:56 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#commits-pushed-15c53eb |
|
| https://patch-diff.githubusercontent.com/PaliC |
| Add deduplication logic | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/15c53ebe903bedec04b3fe4971188e0535a00c96 |
| 15c53eb | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/15c53ebe903bedec04b3fe4971188e0535a00c96 |
|
| https://patch-diff.githubusercontent.com/PaliC |
| magic number removal | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/2ad5e808d56243da49aa25a44dbac861628263d7 |
| 2ad5e80 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/2ad5e808d56243da49aa25a44dbac861628263d7 |
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| June 24, 2025 00:01 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#event-18283014541 |
| https://patch-diff.githubusercontent.com/msaroufim |
| msaroufim | https://patch-diff.githubusercontent.com/msaroufim |
| Jun 24, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issuecomment-2998335714 |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
|
| https://patch-diff.githubusercontent.com/PaliC |
| Add deduplicated datasets | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/a30f85d0353b6347aef785e827edc7f25f97bae9 |
| a30f85d | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/a30f85d0353b6347aef785e827edc7f25f97bae9 |
| https://patch-diff.githubusercontent.com/ngc92 |
| ngc92 | https://patch-diff.githubusercontent.com/ngc92 |
|
Jun 24, 2025
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#pullrequestreview-2953662215 |
|
View reviewed changes
| https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/files/2ad5e808d56243da49aa25a44dbac861628263d7 |
| dedup.py | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/files/2ad5e808d56243da49aa25a44dbac861628263d7#diff-317f5a4bb7d577d3a0dff96c929bbaa86aa1783bd0e4a3a4de0c21fb0309026f |
| ngc92 | https://patch-diff.githubusercontent.com/ngc92 |
| Jun 24, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#discussion_r2163892198 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| Jun 24, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#discussion_r2163896346 |
| Learn more | https://docs.github.com/articles/managing-disruptive-comments/#hiding-a-comment |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| Jun 24, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issuecomment-3000312179 |
| @msaroufim | https://github.com/msaroufim |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| Jun 24, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issuecomment-3000961074 |
| @msaroufim | https://github.com/msaroufim |
| https://www.diffchecker.com/KamzTAeT/ | https://www.diffchecker.com/KamzTAeT/ |
| https://www.diffchecker.com/MtK5pbWL/ | https://www.diffchecker.com/MtK5pbWL/ |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| ngc92 | https://patch-diff.githubusercontent.com/ngc92 |
| June 24, 2025 15:33 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#event-18297678410 |
| November 24, 2025 15:58 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#commits-pushed-16567f9 |
| Updates for 2nd comptetition | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/16567f949cecc27b946ff352d90f2323681e97a4 |
| 16567f9 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/16567f949cecc27b946ff352d90f2323681e97a4 |
|
| https://patch-diff.githubusercontent.com/b9r5 |
| Updated extraction scripts for 2nd competition | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/5da7a0fa9b6de2153958d7ccebcd9da8c402419a |
| 5da7a0f | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/5da7a0fa9b6de2153958d7ccebcd9da8c402419a |
|
| https://patch-diff.githubusercontent.com/PaliC |
| Add deduplication logic | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/b39b2928ccedb2bf765e4f18c0ae154c0653ed66 |
| b39b292 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/b39b2928ccedb2bf765e4f18c0ae154c0653ed66 |
|
| https://patch-diff.githubusercontent.com/PaliC |
| magic number removal | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/216d47a05f1a23eec88dfd8394d617dd0d838818 |
| 216d47a | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/216d47a05f1a23eec88dfd8394d617dd0d838818 |
|
| https://patch-diff.githubusercontent.com/PaliC |
| Add deduplicated datasets | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/89573b4b80107071d752612242951b02d8e571f2 |
| 89573b4 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/89573b4b80107071d752612242951b02d8e571f2 |
| remove test | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/d32c79ca2855ea2d27af77e80f326db33a2d69e6 |
| d32c79c | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/d32c79ca2855ea2d27af77e80f326db33a2d69e6 |
| Merge branch 'dedup' of github.com:PaliC/kernelbot-data into dedup | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/0c1429b1630708ad51af478f40c9f3c0451d4cb2 |
| 0c1429b | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/0c1429b1630708ad51af478f40c9f3c0451d4cb2 |
| https://patch-diff.githubusercontent.com/PaliC |
| PaliC | https://patch-diff.githubusercontent.com/PaliC |
| Nov 30, 2025 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1#issuecomment-3592203901 |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
| update | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/9867a05da031f169ff0063b2d26706579e05ea6e |
| 9867a05 | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1/commits/9867a05da031f169ff0063b2d26706579e05ea6e |
| Sign up for free | https://patch-diff.githubusercontent.com/join?source=comment-repo |
| Sign in to comment | https://patch-diff.githubusercontent.com/login?return_to=https%3A%2F%2Fgithub.com%2Fgpu-mode%2Fkernelbot-data%2Fpull%2F1 |
|
| https://patch-diff.githubusercontent.com/ngc92 |
|
ngc92
| https://patch-diff.githubusercontent.com/ngc92 |
| Please reload this page | https://patch-diff.githubusercontent.com/gpu-mode/kernelbot-data/pull/1 |
|
| https://patch-diff.githubusercontent.com/PaliC |
|
| https://patch-diff.githubusercontent.com/msaroufim |
|
| https://patch-diff.githubusercontent.com/ngc92 |
|
| https://patch-diff.githubusercontent.com/b9r5 |
|
| https://github.com |
| Terms | https://docs.github.com/site-policy/github-terms/github-terms-of-service |
| Privacy | https://docs.github.com/site-policy/privacy-policies/github-privacy-statement |
| Security | https://github.com/security |
| Status | https://www.githubstatus.com/ |
| Community | https://github.community/ |
| Docs | https://docs.github.com/ |
| Contact | https://support.github.com?tags=dotcom-footer |