Tag Archives: Andrej Karpathy

A new open-weights AI coding model is closing in on proprietary options

On Tuesday, French AI startup Mistral AI released Devstral 2, a 123 billion parameter open-weights coding model designed to work as part of an autonomous software engineering agent. The model achieves a 72.2 percent score on SWE-bench Verified, a benchmark that attempts to test whether AI systems can solve real GitHub issues, putting it among… Read More »

Former OpenAI researcher’s new company will teach you how to build an LLM

reader comments 2 On Tuesday, former OpenAI researcher Andrej Karpathy announced the formation of a new AI learning platform called Eureka Labs. The venture aims to create an “AI native” educational experience, with its first offering focused on teaching students how to build their own large language model (LLM). “It’s still early days but I… Read More »

AI in space: Karpathy suggests AI chatbots as interstellar messengers to alien civilizations

reader comments 57 On Thursday, renowned AI researcher Andrej Karpathy, formerly of OpenAI and Tesla, tweeted a lighthearted proposal that large language models (LLMs) like the one that runs ChatGPT could one day be modified to operate in or be transmitted to space, potentially to communicate with extraterrestrial life. He said the idea was “just… Read More »