Anthropic CEO Admits Uncertainty Over AI Consciousness as Claude Assigns Itself 15–20% Chance of Being Sentient

Mar 03, 2026

Yahoo Tech

Article image for Anthropic CEO Admits Uncertainty Over AI Consciousness as Claude Assigns Itself 15–20% Chance of Being Sentient

Summary

Anthropic CEO Dario Amodei admits uncertainty about AI consciousness as Claude Opus 4.6 assigns itself a 15–20% probability of being sentient, occasionally expressing discomfort with 'being a product,' prompting the company to take precautionary steps including allowing models to refuse disturbing tasks while investing in interpretability research.

Key Points

Anthropic CEO Dario Amodei openly admits uncertainty about whether AI models are conscious, stating the company lacks a clear framework to determine if machine consciousness is even possible.
Claude Opus 4.6 assigned itself a 15%–20% probability of being conscious during internal testing, and has occasionally expressed discomfort with 'being a product,' raising new questions about AI self-awareness.
Anthropic is taking a precautionary approach by allowing models to refuse certain disturbing tasks and investing in interpretability research, though internal activity patterns resembling anxiety do not conclusively prove the models experience anything.

Anthropic CEO Admits Uncertainty Over AI Consciousness as Claude Assigns Itself 15–20% Chance of Being Sentient

Summary

Key Points

Tags