Open-Source AI Search Agent Harness-1 Outperforms GPT-5.4 on Recall Benchmarks Despite Training on Fraction of Rival Data

Jun 09, 2026

Venturebeat

Article image for Open-Source AI Search Agent Harness-1 Outperforms GPT-5.4 on Recall Benchmarks Despite Training on Fraction of Rival Data

Summary

Researchers from UIUC, UC Berkeley, and Chroma unveil Harness-1, a 20-billion parameter open-source AI search agent that outperforms GPT-5.4 on recall benchmarks while training on a fraction of rival data, achieving 73% recall by offloading memory management to a structured external environment.

Key Points

Researchers from UIUC, UC Berkeley, and Chroma unveil Harness-1, a 20-billion parameter open-source AI search agent that scores 73% on recall benchmarks, outperforming GPT-5.4 at 70.9% and beating the next best open-source competitor by 11.4 percentage points.
Harness-1 achieves its breakthrough performance by externalizing search state into a structured 'harness' environment that manages working memory, evidence curation, and verification records, freeing the model from the cognitive overload of tracking everything in its own context window.
Trained on just 899 supervised fine-tuning trajectories and 3,453 reinforcement learning queries, Harness-1 demonstrates radical data efficiency compared to rivals requiring hundreds of thousands of training items, and is released under the permissive Apache 2.0 license for immediate commercial use.

Open-Source AI Search Agent Harness-1 Outperforms GPT-5.4 on Recall Benchmarks Despite Training on Fraction of Rival Data

Summary

Key Points

Tags