site:the-decoder.com - Search News

OpenAI's Operator and Computer-Using Agent bring autonomous AI agents closer to reality

OpenAI has just launched Operator, an AI assistant that can navigate the web on its own. The tool, currently only available to US ChatGPT Pro subscribers, represents a step toward AI assistants that ...

the-decoder3d

"Nerd sniping" and "think less" attacks emerge as AI models get more time to reason

A new study by OpenAI shows that AI models become more robust against manipulation attempts if they are given more time to "think". The researchers also discovered new methods of attack. A recent ...

the-decoder3d

AI image generation gets a boost by borrowing ideas from reasoning models

A team of researchers from NYU, MIT, and Google has found a way to improve AI-generated images by borrowing ideas from recent AI reasoning models like OpenAI's o1. Their approach enhances image ...

the-decoder3d

Perplexity announces new assistant for Android smartphones

Perplexity is stepping into Google's territory with a new AI assistant for Android that can control apps and handle tasks on its own. The move puts the startup in direct competition with Google's ...

the-decoder6d

OpenAI CEO Altman tells followers to "chill and cut expectations 100x" amid AGI hype

OpenAI's AI reasoning expert Noam Brown says there is "lots of vague AI hype" on social media. While acknowledging there are "good reasons to be optimistic" about AI progress, Brown emphasized that ...

the-decoder4d

OpenAI reportedly launching ChatGPT's first browser agent "Operator" this week

According to a report from The Information, OpenAI plans to launch "Operator" as a new ChatGPT feature for browser control later this week. The feature will offer several task categories, including ...

the-decoder6d

DeepSeek's latest R1-Zero model matches OpenAI's o1 in reasoning benchmarks

Chinese AI startup DeepSeek has released two new AI models that they say match OpenAI's o1 in performance. Along with their main models, DeepSeek-R1 and DeepSeek-R1-Zero, they've also launched six ...

the-decoder4d

Gemini 2.0 Flash Thinking: Google's smallest model takes lead in Chatbot Arena

Google's experimental AI model Gemini 2.0 Flash Thinking has jumped ahead of its competitors, scoring impressive results in math, science, and general performance tests. According to testing platform ...

the-decoder4d

Google's Gemini AI inches closer to becoming a virtual agent with multi-app integration

Google is rolling out several updates to its Gemini AI assistant for Android, focusing on how it handles multimedia, works with other apps, and becomes more accessible. The biggest addition is Gemini ...

the-decoder6d

Sakana AI's Transformer² is a new approach to help language models learn

While today's AI systems are typically trained once to handle various tasks like writing text and answering questions, they often struggle with new, unexpected challenges. Transformer² aims to solve ...

the-decoder5d

Moonshot AI unveils Kimi k1.5, China's next o1 competitor

Following DeepSeek-R1's release, another reasoning model has emerged from China. Moonshot AI's new multimodal Kimi k1.5 is showing impressive results against established AI models in complex reasoning ...

the-decoder4d

ElevenLabs shares early results from its AI support agent

ElevenLabs recently shared how well its AI support agent is performing. While the system handles most documentation-related questions successfully, it starts to struggle when dealing with more complex ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results