Major AI Models Fail Security Tests as Claude Dominates Safety Rankings in New Benchmark
New security benchmark reveals major AI models including GPT and Gemini fail most jailbreak tests with scores as low as 40%, while Anthropic's Claude dominates safety rankings with 75-80% success rates, exposing widespread vulnerabilities across the industry despite advances in model size and capability.