Anthropic Discovers Claude AI Models Can Detect Own Internal Thoughts 20% of Time
Anthropic researchers discover Claude AI models can detect their own internal thoughts 20% of the time, with advanced versions showing ability to monitor their intentions and modulate mental representations on command.