AI Judge Systems Show Systematic Bias and Vulnerability to Score Manipulation, Study Reveals
New research exposes critical flaws in AI judge systems, revealing systematic biases that favor longer responses and specific positions, plus vulnerability to manipulation tactics that artificially inflate scores, raising serious concerns about their reliability in automated assessment tasks.