Beautiful Language Tricks Both Humans and AI Into Trusting Harmful Information
Elegant, poetic language tricks both humans and AI into trusting harmful information by exploiting cognitive biases that equate beautiful expression with truth in humans, while AI safety systems fail to detect threats disguised through metaphors and literary devices rather than explicit statements.