tokens – Weekly Geek

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

But 1,000 tokens per second is actually modest by Cerebras standards. The company has measured 2,100 tokens per second on Llama 3.1 70B and reported 3,000 tokens per second on OpenAI’s own open-weight gpt-oss-120B model, suggesting that Codex-Spark’s comparatively lower speed reflects the overhead of a larger or more complex model. AI coding agents have… Read More: OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized… »

OpenAI introduces GPT-4 Turbo: Larger memory, lower cost, new knowledge

reader comments 27 with On Monday at the OpenAI DevDay event, company CEO Sam Altman announced a major update to its GPT-4 language model called GPT-4 Turbo, which can process a much larger amount of text than GPT-4 and features a knowledge cutoff of April 2023. He also introduced APIs for DALL-E 3, GPT-4 Vision,… Read More: OpenAI introduces GPT-4 Turbo: Larger memory, lower cost, new knowledge »