Title: Test Execution Enviroment for SweBench tasks · Issue #73 · commit-0/commit0 · GitHub
Open Graph Title: Test Execution Enviroment for SweBench tasks · Issue #73 · commit-0/commit0
X Title: Test Execution Enviroment for SweBench tasks · Issue #73 · commit-0/commit0
Description: I have recently being working on swebench where we built distributed eval on top of Modal for faster eval cycles. As a next step, I was hoping to use that setup to execute the patch generated by LLMs after the localization stage. I was w...
Open Graph Description: I have recently being working on swebench where we built distributed eval on top of Modal for faster eval cycles. As a next step, I was hoping to use that setup to execute the patch generated by LL...
X Description: I have recently being working on swebench where we built distributed eval on top of Modal for faster eval cycles. As a next step, I was hoping to use that setup to execute the patch generated by LL...
Opengraph URL: https://github.com/commit-0/commit0/issues/73
X: @github
Domain: github.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Test Execution Enviroment for SweBench tasks","articleBody":"I have recently being working on swebench where we built distributed eval on top of Modal for faster eval cycles. As a next step, I was hoping to use that setup to execute the patch generated by LLMs after the localization stage. I was wondering whether it is possible via the `commit0` project.\r\n\r\nTest execution feedback and search can improve the quality over Best-of-N or majority voting based approaches. Also, as part of this idea, we either need to predict the relevant unittests which affect the localized files or generate unittests using LLMs.\r\n\r\n\r\n\r\ncc @wenting-zhao ","author":{"url":"https://github.com/345ishaan","@type":"Person","name":"345ishaan"},"datePublished":"2024-10-01T21:06:25.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":3},"url":"https://github.com/73/commit0/issues/73"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:50091022-0209-d549-5580-9d8fe21d12be |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | DC2E:1D772C:2626AB4:3258BAA:696B22B4 |
| html-safe-nonce | d312aef687ce10b537772f005d174450143b407cef534525480487f0e5f8de22 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJEQzJFOjFENzcyQzoyNjI2QUI0OjMyNThCQUE6Njk2QjIyQjQiLCJ2aXNpdG9yX2lkIjoiNTQxODExMDM4MTA2ODA2NzUwOSIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | daebab64c96fdc3b1412f334412360c59f63b96ff215f520fc9329bbf2656f9f |
| hovercard-subject-tag | issue:2560216608 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/commit-0/commit0/73/issue_layout |
| twitter:image | https://opengraph.githubassets.com/79403816bc963a840089addc2068f51a7dda6dee419379ce764741b06c4b2396/commit-0/commit0/issues/73 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/79403816bc963a840089addc2068f51a7dda6dee419379ce764741b06c4b2396/commit-0/commit0/issues/73 |
| og:image:alt | I have recently being working on swebench where we built distributed eval on top of Modal for faster eval cycles. As a next step, I was hoping to use that setup to execute the patch generated by LL... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | 345ishaan |
| hostname | github.com |
| expected-hostname | github.com |
| None | 5f99f7c1d70f01da5b93e5ca90303359738944d8ab470e396496262c66e60b8d |
| turbo-cache-control | no-preview |
| go-import | github.com/commit-0/commit0 git https://github.com/commit-0/commit0.git |
| octolytics-dimension-user_id | 177869509 |
| octolytics-dimension-user_login | commit-0 |
| octolytics-dimension-repository_id | 838991377 |
| octolytics-dimension-repository_nwo | commit-0/commit0 |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 838991377 |
| octolytics-dimension-repository_network_root_nwo | commit-0/commit0 |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | 82560a55c6b2054555076f46e683151ee28a19bc |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width