2606.11662v1 Jun 10, 2026 cs.AI

TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

Zhuofan Shi
Zhuofan Shi
Citations: 63
h-index: 4
Qingwei Lin
Qingwei Lin
Citations: 763
h-index: 14
Dongmei Zhang
Dongmei Zhang
Citations: 463
h-index: 13
S. Rajmohan
S. Rajmohan
Citations: 3,349
h-index: 33
Pu Zhao
Pu Zhao
Citations: 347
h-index: 10
Mingzhen Ma
Mingzhen Ma
Citations: 36
h-index: 3
Lu Wang
Lu Wang
Citations: 9
h-index: 1
Fangkai Yang
Fangkai Yang
Citations: 7
h-index: 1
Yiming Guan
Yiming Guan
Citations: 1
h-index: 1
You-De Huang
You-De Huang
Citations: 2
h-index: 1
Wei Zhang
Wei Zhang
Citations: 96
h-index: 2

Deep search requires agents to answer complex questions through multi-step web search, browsing, evidence comparison, and synthesis. A central challenge is deciding how to search when several directions look plausible but only some will later lead to reliable evidence. If an agent greedily follows the current best-looking direction, it may keep extending a weak continuation. If it explores without discipline, it may waste budget on disconnected trials. We propose TreeSeeker, an inference-time framework for controlled trial-and-error in deep search. TreeSeeker organizes search as branch-and-return search over tree-structured states, where each branch is a tentative direction for a sub-goal. At each round, TreeSearch reads all sub-goal trees, identifies active goals, and uses textual UCB signals of value, uncertainty, and risk to select among exploiting a promising branch, exploring an uncertain alternative, or pruning an unproductive continuation and returning to an earlier branch point. TreeMem supports this control loop by keeping evidence, uncertainty, conflicts, progress, and failure cues attached to the branches that produced them, so trial outcomes can guide later decisions. Experiments on XBench-DeepSearch, BrowseComp, and BrowseComp-ZH show that TreeSeeker consistently outperforms strong open-source baselines, suggesting that explicit branch-and-return control complements stronger reasoning and tool execution.

0 Citations
0 Influential
16.5 Altmetric
82.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!