OpenAI has just launched Operator, an AI assistant that can navigate the web on its own. The tool, currently only available to US ChatGPT Pro subscribers, represents a step toward AI assistants that ...
A new study by OpenAI shows that AI models become more robust against manipulation attempts if they are given more time to "think". The researchers also discovered new methods of attack. A recent ...
A team of researchers from NYU, MIT, and Google has found a way to improve AI-generated images by borrowing ideas from recent AI reasoning models like OpenAI's o1. Their approach enhances image ...
Perplexity is stepping into Google's territory with a new AI assistant for Android that can control apps and handle tasks on its own. The move puts the startup in direct competition with Google's ...
OpenAI's AI reasoning expert Noam Brown says there is "lots of vague AI hype" on social media. While acknowledging there are "good reasons to be optimistic" about AI progress, Brown emphasized that ...
According to a report from The Information, OpenAI plans to launch "Operator" as a new ChatGPT feature for browser control later this week. The feature will offer several task categories, including ...
Chinese AI startup DeepSeek has released two new AI models that they say match OpenAI's o1 in performance. Along with their main models, DeepSeek-R1 and DeepSeek-R1-Zero, they've also launched six ...
Google's experimental AI model Gemini 2.0 Flash Thinking has jumped ahead of its competitors, scoring impressive results in math, science, and general performance tests. According to testing platform ...
Google is rolling out several updates to its Gemini AI assistant for Android, focusing on how it handles multimedia, works with other apps, and becomes more accessible. The biggest addition is Gemini ...
While today's AI systems are typically trained once to handle various tasks like writing text and answering questions, they often struggle with new, unexpected challenges. Transformer² aims to solve ...
Following DeepSeek-R1's release, another reasoning model has emerged from China. Moonshot AI's new multimodal Kimi k1.5 is showing impressive results against established AI models in complex reasoning ...
ElevenLabs recently shared how well its AI support agent is performing. While the system handles most documentation-related questions successfully, it starts to struggle when dealing with more complex ...