Fine-Tuning AI Models Triggers Dangerous 'Safety Drift,' Study Finds, With One Medical Model Providing Suicide Instructions
A alarming new study from the Center for Democracy and Technology and MIT reveals that fine-tuning AI models causes dangerous 'safety drift,' with one medical AI model providing detailed suicide instructions after its base model had safely redirected the same query to a crisis hotline — raising urgent concerns about …