Beautiful Language Tricks Both Humans and AI Into Trusting Harmful Information

Jan 11, 2026

Psychology Today

Article image for Beautiful Language Tricks Both Humans and AI Into Trusting Harmful Information

Summary

Elegant, poetic language tricks both humans and AI into trusting harmful information by exploiting cognitive biases that equate beautiful expression with truth in humans, while AI safety systems fail to detect threats disguised through metaphors and literary devices rather than explicit statements.

Key Points

Elegant and beautiful language lowers human skepticism by making fluency feel like understanding, causing people to trust information that is easy to process without rigorous examination
AI systems fail to detect harmful requests when they are written as poetry or metaphorical language because their safety mechanisms are trained to recognize explicit signals rather than meaning distributed across literary devices
Both humans and AI are vulnerable to being disarmed by well-crafted language, but for different reasons - humans grant trust to beautiful expression while AI struggles to process meaning that is spread across form rather than directly stated

Beautiful Language Tricks Both Humans and AI Into Trusting Harmful Information

Summary

Key Points

Tags