r/LocalLLaMA· rssEN16:35 · 05·27
→SWE-rebench Leaderboard Update: GPT-5.5, Opus 4.7, Cursor, Kimi K2.6, and More
SWE-rebench updated its leaderboard with 110 new Python tasks from GitHub PRs created in March, April, and part of May 2026, using the SWE-bench setup where models read issues, edit code, run tests, and must pass the full test suite.
#Code#Benchmarking#SWE-rebench#GPT-5.5
why featured
HKR-H/K/R pass, but the post only gives task count and covered months; model scores, margins, and reproducibility details are not disclosed. A single Reddit leaderboard stays in all, below featured.
editor take
SWE-rebench claims 110 new Python tasks; Reddit 403 blocks the body, so GPT-5.5 ranks and pass rates stay unverifiable.
HKR breakdown
hook ✓knowledge ✓resonance ✓