OpenAI still leads in agentic terminal coding, but by less.

Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer)

That’s one way to turn profitable before the IPO, I guess. Goodbye tokens.

  • unpossum@sh.itjust.worksOP
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    3
    ·
    2 days ago

    Gpt 5.4 xhigh isn’t too bad for automated reviews and the like, and 5.5 is fairly efficient for interactive coding. I prefer those to Claude and opus, the Anthropic models feel like they’re trying to hard to be human to me, but that’s personal preference I guess.

    Yeah, it’s not free (or the free models aren’t good enough), but the consensus at work is that this is a potential game changer, and we need to experiment to see what works and what doesn’t. So, the budget is there until things settle, and afterwards if things work out.