Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

100% Positive

Analyzed from 76 words in the discussion.

Trending Topics

#opus#mini#swe#agent#claude#code#gpt#harness#performance#interesting

Discussion (2 Comments)Read Original on HackerNews

mjzhabout 2 hours ago
interesting, per https://cadbench.ai/leaderboard, gpt5.5 is the best, not the opus 4.7, why opus 4.7 is with mini-swe-agent, not claude code.
handcraftedabout 2 hours ago
GPT-5.5 and Opus 4.7 are comparable when using the same harness mini-swe-agent. GPT-5.5 demonstrates a significant performance delta only when integrated with the Codex module. We hypothesize that the superior performance of Opus 4.7 on mini-swe-agent relative to the more complex Claude Code harness stems from the tight feedback loop (edit-run-check), well suited for the CAD generation task.