Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Significantly! See this recent post „Compare harnesses not models: Blitzy vs GPT-5.4 on SWE-Bench Pro” https://quesma.com/blog/verifying-blitzy-swe-bench-pro/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: