Tag Archives: Chatbot Arena

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

Getty Images reader comments 31 On Monday, OpenAI employee William Fedus confirmed on X that a mysterious chat-topping AI chatbot known as “gpt-chatbot” that had been undergoing testing on LMSYS’s Chatbot Arena and frustrating experts was, in fact, OpenAI’s newly announced GPT-4o AI model. He also revealed that GPT-4o had topped the Chatbot Arena leaderboard,… Read More »

Mysterious “gpt2-chatbot” AI model appears suddenly, confuses experts

reader comments 25 On Sunday, word began to spread on social media about a new mystery chatbot named “gpt2-chatbot” that appeared in the LMSYS Chatbot Arena. Some people speculate that it may be a secret test version of OpenAI’s upcoming GPT-4.5 or GPT-5 large language model (LLM). The paid version of ChatGPT is currently powered… Read More »

Words are flowing out like endless rain: Recapping a busy week of LLM news

Enlarge / An image of a boy amazed by flying letters. reader comments 17 Some weeks in AI news are eerily quiet, but during others, getting a grip on the week’s events feels like trying to hold back the tide. This week has seen three notable large language model (LLM) releases: Google Gemini Pro 1.5… Read More »

“The king is dead”—Claude 3 surpasses GPT-4 on Chatbot Arena for the first time

reader comments 45 On Tuesday, Anthropic’s Claude 3 Opus large language model (LLM) surpassed OpenAI’s GPT-4 (which powers ChatGPT) for the first time on Chatbot Arena, a popular crowdsourced leaderboard used by AI researchers to gauge the relative capabilities of AI language models. “The king is dead,” tweeted software developer Nick Dobos in a post… Read More »