AI Language Models Fail to Distinguish Belief from Knowledge in Major Study of 24 Systems
New research testing 24 advanced AI language models reveals they systematically fail to distinguish between belief and knowledge, with accuracy plummeting from over 90% to as low as 14.4% when processing first-person false beliefs, exposing critical flaws in their reasoning capabilities.