• Bronzebeard@lemm.ee
    link
    fedilink
    English
    arrow-up
    7
    ·
    2 days ago

    That’s 22% worse.

    That’s basically wrong half the time vs wrong 1/3 of the time.

    • Lugh@futurology.todayOPM
      link
      fedilink
      English
      arrow-up
      20
      ·
      2 days ago

      Yes, but GPT-4 was at 7% and regarded as world best only months ago.

      The true significance here, is that they’ve replicated the industry leader so easily and so quickly.