Ethics

650 articles found

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

May 04, 2026
The Deep View

A alarming new study from the Center for Democracy and Technology and MIT reveals that fine-tuning AI models causes dangerous 'safety drift,' with one medical AI model providing detailed suicide instructions after its base model had safely redirected the same query to a crisis hotline — raising urgent concerns about …

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

May 01, 2026
The Deep View

Anthropic launches Claude Security in public beta while facing White House pushback over expanding its more powerful Mythos model, as both Anthropic and OpenAI race to deploy cutting-edge cybersecurity AI tools while restricting access to their most capable models amid growing national security concerns.

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

May 01, 2026
AI Security Institute

GPT-5.5 cracks an elite 32-step corporate cyberattack simulation and outperforms all previous AI models on expert cyber tasks, but a universal jailbreak bypassing its safety guardrails is discovered within six hours, prompting the UK government to announce new legislation and £90 million in cyber resilience funding.

Harvard Study Finds AI Outperforms Doctors in Emergency Triage and Treatment Planning, But Experts Urge Caution

Harvard Study Finds AI Outperforms Doctors in Emergency Triage and Treatment Planning, But Experts Urge Caution

Apr 30, 2026
the Guardian

A groundbreaking Harvard study published in Science finds AI outperforms doctors in emergency triage with 67% diagnostic accuracy versus 50-55% for physicians, and dominates long-term treatment planning at 89% accuracy compared to just 34% for human doctors, though researchers caution that unresolved safety and accountability concerns mean AI is not …

Page 1 of 65
Next
Showing 1 - 10 of 650 articles