2606.05922v1 Jun 04, 2026 cs.AI

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Xianfeng Tang
Xianfeng Tang
Citations: 714
h-index: 11
Jingying Zeng
Jingying Zeng
Citations: 172
h-index: 8
Xiaohua Jia
Xiaohua Jia
Citations: 73
h-index: 4
Shujie Liu
Shujie Liu
Citations: 312
h-index: 8
Wenbo Pan
Wenbo Pan
City University of Hong Kong
Citations: 153
h-index: 5
Xiangyang Zhou
Xiangyang Zhou
Citations: 41
h-index: 2
Yan Lu
Yan Lu
Citations: 2
h-index: 1
Chin-Yew Lin
Chin-Yew Lin
Citations: 1,508
h-index: 7

AI agents rely on a harness of skills, tools, and workflows to solve complex problems. Continually improving this harness is essential for adapting to new tasks. However, existing optimization methods typically require ground-truth validation sets, yet such labeled data is difficult to acquire in practical deployment settings. To address this problem, we introduce Retrospective Harness Optimization (RHO), a self-supervised method that optimizes the agent harness using only past trajectories. Specifically, RHO selects a diverse coreset of challenging tasks from past trajectories and re-solves them in parallel. The agent analyzes these rollouts using self-validation and self-consistency, then generates candidate harness updates and selects the most effective one by its own pairwise self-preference. We evaluate RHO across three diverse domains, spanning software engineering, technical work, and knowledge work. Notably, a single optimization round improves the pass rate on SWE-Bench Pro from 59% to 78% without any external grading. Furthermore, our analysis demonstrates that RHO effectively targets prior failure modes. As a result, the optimized harness alters the agent's behavior patterns and sustains higher accuracy during long-horizon sessions.

0 Citations
0 Influential
5.5 Altmetric
27.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!