Ethics

768 articles found

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions

May 04, 2026
The Deep View

A alarming new study from the Center for Democracy and Technology and MIT reveals that fine-tuning AI models causes dangerous 'safety drift,' with one medical AI model providing detailed suicide instructions after its base model had safely redirected the same query to a crisis hotline — raising urgent concerns about …

OpenAI's o1 Model Outperforms Physicians in ER Diagnosis Study, But Experts Warn AI Not Ready for Real-World Use

OpenAI's o1 Model Outperforms Physicians in ER Diagnosis Study, But Experts Warn AI Not Ready for Real-World Use

May 03, 2026
TechCrunch

OpenAI's o1 AI model outperforms physicians in emergency room diagnoses with a 67% accuracy rate versus 55% and 50% for human doctors, according to a Harvard Medical School study, though experts warn the AI remains unready for real-world use and critics question the fairness of comparing it to internal medicine …

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

Anthropic and OpenAI Race to Deploy Powerful Cybersecurity AI While Restricting Access Amid National Security Concerns

May 01, 2026
The Deep View

Anthropic launches Claude Security in public beta while facing White House pushback over expanding its more powerful Mythos model, as both Anthropic and OpenAI race to deploy cutting-edge cybersecurity AI tools while restricting access to their most capable models amid growing national security concerns.

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

GPT-5.5 Cracks Elite Cyberattack Simulation But Universal Jailbreak Discovered Within Hours, Sparking UK Government Response

May 01, 2026
AI Security Institute

GPT-5.5 cracks an elite 32-step corporate cyberattack simulation and outperforms all previous AI models on expert cyber tasks, but a universal jailbreak bypassing its safety guardrails is discovered within six hours, prompting the UK government to announce new legislation and £90 million in cyber resilience funding.

Previous
Page 12 of 77
Next
Showing 111 - 120 of 768 articles