Google DeepMind Launches ProEval, Cutting AI Evaluation Costs by Up to 100x With Open-Source Bayesian Tool
Google DeepMind launches ProEval, a free open-source tool that slashes generative AI evaluation costs by up to 100x using Bayesian Quadrature techniques, achieving ±1% accuracy with a fraction of typical samples while proactively identifying model failure patterns across major benchmarks.