Ethics

651 articles found

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

May 04, 2026
The Deep View

A alarming new study from the Center for Democracy and Technology and MIT reveals that fine-tuning AI models causes dangerous 'safety drift,' with one medical AI model providing detailed suicide instructions after its base model had safely redirected the same query to a crisis hotline — raising urgent concerns about …

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

May 01, 2026
The Deep View

Anthropic launches Claude Security in public beta while facing White House pushback over expanding its more powerful Mythos model, as both Anthropic and OpenAI race to deploy cutting-edge cybersecurity AI tools while restricting access to their most capable models amid growing national security concerns.

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

May 01, 2026
AI Security Institute

GPT-5.5 cracks an elite 32-step corporate cyberattack simulation and outperforms all previous AI models on expert cyber tasks, but a universal jailbreak bypassing its safety guardrails is discovered within six hours, prompting the UK government to announce new legislation and £90 million in cyber resilience funding.

Page 1 of 66
Next
Showing 1 - 10 of 651 articles