Commit 992e173
Update official results after promoting 57 staging runs
3455 valid scored tasks across 639 total runs.
Includes Sonnet 4.6 Claude Code + OpenHands results.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 272c73a commit 992e173
641 files changed
Lines changed: 1122371 additions & 358321 deletions
File tree
- docs/official_results
- data
- runs
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Lines changed: 1109044 additions & 351710 deletions
| Original file line number | Diff line number | Diff line change |
|---|
Lines changed: 25 additions & 0 deletions
Lines changed: 27 additions & 0 deletions
Lines changed: 29 additions & 0 deletions
Lines changed: 27 additions & 0 deletions
Lines changed: 29 additions & 0 deletions
Lines changed: 25 additions & 0 deletions
Lines changed: 29 additions & 0 deletions
Lines changed: 29 additions & 0 deletions
0 commit comments