Tag Archives: GPU

AMD unveils powerful new AI chip to challenge Nvidia

On Thursday, AMD announced its new MI325X AI accelerator chip, which is set to roll out to data center customers in the fourth quarter of this year. At an event hosted in San Francisco, the company claimed the new chip offers “industry-leading” performance compared to Nvidia’s current H200 GPUs, which are widely used in data… Read More »

Researchers upend AI status quo by eliminating matrix multiplication in LLMs

Enlarge / Illustration of a brain inside of a light bulb. reader comments 100 Researchers claim to have developed a new way to run AI language models more efficiently by eliminating matrix multiplication from the process. This fundamentally redesigns neural network operations that are currently accelerated by GPU chips. The findings, detailed in a recent… Read More »

Nvidia jumps ahead of itself and reveals next-gen “Rubin” AI chips in keynote tease

Enlarge / Nvidia’s CEO Jensen Huang delivers his keystone speech ahead of Computex 2024 in Taipei on June 2, 2024. reader comments 37 On Sunday, Nvidia CEO Jensen Huang reached beyond Blackwell and revealed the company’s next-generation AI-accelerating GPU platform during his keynote at Computex 2024 in Taiwan. Huang also detailed plans for an annual… Read More »

Nvidia unveils Blackwell B200, the “world’s most powerful chip” designed for AI

Enlarge / The GB200 “superchip” covered with a fanciful blue explosion. Nvidia / Benj Edwards reader comments 89 On Monday, Nvidia unveiled the Blackwell B200 tensor core chip—the company’s most powerful single-chip GPU, with 208 billion transistors—which Nvidia claims can reduce AI inference operating costs (such as running ChatGPT) and energy consumption by up to… Read More »

Nvidia’s “Chat With RTX” is a ChatGPT-style app that runs on your own GPU

reader comments 65 On Tuesday, Nvidia released Chat With RTX, a free personalized AI chatbot similar to ChatGPT that can run locally on a PC with an Nvidia RTX graphics card. It uses Mistral or Llama open-weights LLMs and can search through local files and answer questions about them. Chat With RTX works on Windows… Read More »

Nvidia CEO calls for “Sovereign AI” as his firm overtakes Amazon in market value

Nvidia / Benj Edwards reader comments 78 On Monday, Nvidia CEO Jensen Huang said that every country should control its own AI infrastructure so it can protect its culture, Reuters reports. He called this concept “Sovereign AI,” which an Nvidia blog post defined as each country owning “the production of their own intelligence.” Huang made… Read More »

Nvidia introduces the H200, an AI-crunching monster GPU that may speed up ChatGPT

Enlarge / Eight Nvidia H200 GPUs covered with a fanciful blue explosion that figuratively represents raw compute power bursting forth in a glowing flurry. Nvidia | Benj Edwards reader comments 47 with On Monday, Nvidia announced the HGX H200 Tensor Core GPU, which utilizes the Hopper architecture to accelerate AI applications. It’s a follow-up of… Read More »

US surprises Nvidia by speeding up new AI chip export ban

Enlarge / A press photo of the Nvidia H100 Tensor Core GPU. reader comments 30 with On Tuesday, chip designer Nvidia announced in an SEC filing that new US export restrictions on its high-end AI GPU chips to China are now in effect sooner than expected, according to a report from Reuters. The curbs were… Read More »

Hungry for AI? New supercomputer contains 16 dinner-plate-size chips

Enlarge / The Cerebras Andromeda, a 13.5 million core AI supercomputer. reader comments 36 with 0 posters participating Share this story On Monday, Cerebras Systems unveiled its 13.5 million core Andromeda AI supercomputer for deep learning, reports Reuters. According Cerebras, Andromeda delivers over one 1 exaflop (1 quintillion operations per second) of AI computational power… Read More »

Nvidia’s powerful H100 GPU will ship in October

Enlarge / A press handout showing the Nvidia H100 Hopper GPU and its applications. reader comments 20 with 18 posters participating, including story author Share this story At today’s GTC conference keynote, Nvidia announced that its H100 Tensor Core GPU is in full production and that tech partners such as Dell, Lenovo, Cisco, Atos, Fujitsu,… Read More »