Tag Archives: gemini

OpenAI releases new simulated reasoning models with full tool access

On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities with access to functions like web browsing and coding. These models mark the first time OpenAI’s reasoning-focused models can use every ChatGPT tool simultaneously, including visual analysis and image generation. OpenAI announced o3 in December, and until now, only… Read More »

Gemini hackers can deliver more potent attacks with a helping hand from… Gemini

The resulting dataset, which reflected a distribution of attack categories similar to the complete dataset, showed an attack success rate of 65 percent and 82 percent against Gemini 1.5 Flash and Gemini 1.0 Pro, respectively. By comparison, attack baseline success rates were 28 percent and 43 percent. Success rates for ablation, where only effects of… Read More »

Farewell Photoshop? Google’s new AI lets you edit images by asking.

Multimodal output opens up new possibilities Having true multimodal output opens up interesting new possibilities in chatbots. For example, Gemini 2.0 Flash can play interactive graphical games or generate stories with consistent illustrations, maintaining character and setting continuity throughout multiple images. It’s far from perfect, but character consistency is a new capability in AI assistants.… Read More »

Google and Meta update their AI models amid the rise of “AlphaChip”

Enlarge / There’s been a lot of AI news this week, and covering it sometimes feels like running through a hall full of danging CRTs, just like this Getty Images illustration. reader comments 8 It’s been a wildly busy week in AI news thanks to OpenAI, including a controversial blog post from CEO Sam Altman,… Read More »

Google’s “AI Overview” can give false, misleading, and dangerous answers

Enlarge / This is fine. Getty Images reader comments 80 If you use Google regularly, you may have noticed the company’s new AI Overviews providing summarized answers to some of your questions in recent days. If you use social media regularly, you may have come across many examples of those AI Overviews being hilariously or even… Read More »

Google strikes back at OpenAI with “Project Astra” AI agent prototype

Enlarge / A video still of Project Astra demo at the Google I/O conference keynote in Mountain View on May 14, 2024. reader comments 28 Just one day after OpenAI revealed GPT-4o, which it bills as being able to understand what’s taking place in a video feed and converse about it, Google announced Project Astra,… Read More »

Words are flowing out like endless rain: Recapping a busy week of LLM news

Enlarge / An image of a boy amazed by flying letters. reader comments 17 Some weeks in AI news are eerily quiet, but during others, getting a grip on the week’s events feels like trying to hold back the tide. This week has seen three notable large language model (LLM) releases: Google Gemini Pro 1.5… Read More »

Apple may hire Google to power new iPhone AI features using Gemini—report

Benj Edwards reader comments 72 On Monday, Bloomberg reported that Apple is in talks to license Google’s Gemini model to power AI features like Siri in a future iPhone software update coming later in 2024, according to people familiar with the situation. Apple has also reportedly conducted similar talks with ChatGPT maker OpenAI. The potential… Read More »

Google’s hidden AI diversity prompts lead to outcry over historically inaccurate images

Enlarge / Generations from Gemini AI from the prompt, “Paint me a historically accurate depiction of a medieval British king.” reader comments 223 On Thursday morning, Google announced it was pausing its Gemini AI image-synthesis feature in response to criticism that the tool was inserting diversity into its images in a historically inaccurate way, such… Read More »

Google goes “open AI” with Gemma, a free, open-weights chatbot family

reader comments 27 On Wednesday, Google announced a new family of AI language models called Gemma, which are free, open-weights models built on technology similar to the more powerful but closed Gemini models. Unlike Gemini, Gemma models can run locally on a desktop or laptop computer. It’s Google’s first significant open large language model (LLM)… Read More »