AI Models Switch Between Accurate and Hallucinated Responses Based on Simple Instruction Changes
Researchers discover that AI models dramatically switch between accurate and false responses based solely on how questions are phrased, with 'think step by step' prompts triggering correct factual recall while 'give answer in one word' instructions activate shallow processing circuits that frequently generate hallucinated information.