OpenAI still leads in agentic terminal coding, but by less.

Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer)

That’s one way to turn profitable before the IPO, I guess. Goodbye tokens.

  • Echo Dot@feddit.uk
    link
    fedilink
    English
    arrow-up
    2
    ·
    7 hours ago

    Yeah it’s interesting as long as you can completely disregard all of the negative impacts but if you disregard all of the negative impacts and I would argue you’re not assessing the technology in a fair manner.

    The Turing test was also designed back in the day when a computer was just a big box in a room. An AI passing the Turing test is just something to throw at the media, it’s not a meaningful experiment. The Apple 2 was able to pass the Turing test.

    • unpossum@sh.itjust.worksOP
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 hours ago

      I’m sorry, but I don’t agree with your first point at all. Things can have negative sides and still be interesting.

      The Turing test, as I interpret it at least, is more of a philosophical than a technical thing, trying to provide a way to evaluate the thinking ability of someone or -thing without being able to look at its innards. I’ve always found it fascinating, but I can understand if people disagree (just don’t drag the Chinese room into it). However, if you don’t think a conversation with Claude is more interesting than a faux psychiatrist session with ELIZA, I don’t know where we could go from there 🤷