Four weeks ago, GPT-4 remained the undisputed champion: consistently at the top of every key benchmark, but more importantly the clear winner in terms of “vibes”. Almost everyone investing serious time exploring LLMs agreed that it was the most capable default model for the majority of tasks—and had been for more than a year.
Today that barrier has finally been smashed. We have four new models, all released to the public in the last four weeks, that are benchmarking near or even above GPT-4. And the all-important vibes are good, too!
Those models come from four different vendors.
None of the models that rival gpt4 are open AFAIK
Yup
Firstly, none of those models are openly licensed or weights available. I imagine the resources they need to run would make them impractical for most people, but at after a year that has seen enormous leaps forward in the openly licensed model category it’s sad to see the very best models remain strictly proprietary.
Hmm has the leaderboard not been updated? It’s still showing gpt4 at the top: https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard
deleted by creator
Note to self — read the thread first.