Google, Anthropic

On Tuesday, Anthropic CEO Dario Amodei predicted that AI models may surpass human capabilities "in almost everything" within ...
LLMs have the ability to "fake alignment" - making it seem that they are following instructions, whilst like humans, avoiding ...