2605.05191v1 May 06, 2026 cs.AI

LongSeeker: 장기 예측 검색 에이전트를 위한 탄력적 컨텍스트 조율

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

Jiajun Wang

Citations: 7

h-index: 1

Yijun Lu

Citations: 37

h-index: 3

Siheng Chen

Citations: 317

h-index: 10

Yuwen Du

Citations: 89

h-index: 5

Rui Ye

Citations: 719

h-index: 12

Songhua Liu

Citations: 761

h-index: 12

장기 예측 검색 에이전트는 추론, 도구 사용 및 정보 관찰 과정에서 빠르게 증가하는 작업 컨텍스트를 관리해야 합니다. 모든 중간 정보를 무분별하게 누적하면 에이전트에 과부하가 걸려 비용이 증가하고 오류 발생 위험이 커질 수 있습니다. 우리는 효과적인 컨텍스트 관리는 적응적이어야 한다고 제안합니다. 즉, 에이전트의 경로의 일부는 현재 작업과의 관련성에 따라 다른 수준의 상세 정보로 유지되어야 합니다. 이러한 원칙을 구현하기 위해, 우리는 컨텍스트 관리, 추론 및 도구 사용을 통합하는 일반적인 에이전트 패러다임인 Context-ReAct을 소개합니다. Context-ReAct은 Skip, Compress, Rollback, Snippet 및 Delete의 다섯 가지 기본 연산을 제공하며, 이를 통해 에이전트는 작업 컨텍스트를 동적으로 재구성하여 중요한 증거를 보존하고, 해결된 정보를 요약하고, 쓸모없는 부분을 제거하고, 컨텍스트 크기를 제어할 수 있습니다. 우리는 Compress 연산이 표현적으로 완전함을 증명했으며, 다른 전문 연산들은 생성 비용과 환각 위험을 줄이는 효율성 및 정확성 보장을 제공합니다. 이러한 패러다임을 기반으로, 우리는 10,000개의 합성 경로를 사용하여 Qwen3-30B-A3B 모델을 미세 조정하여 개발된 장기 예측 검색 에이전트인 LongSeeker를 개발했습니다. 네 가지 대표적인 검색 벤치마크에서 LongSeeker는 BrowseComp에서 61.5%, BrowseComp-ZH에서 62.5%의 성능을 달성하여 Tongyi DeepResearch (43.2% 및 46.7%) 및 AgentFold (36.2% 및 47.3%)를 크게 능가했습니다. 이러한 결과는 적응형 컨텍스트 관리가 에이전트가 작업 메모리를 능동적으로 형성함으로써 보다 안정적이고 효율적인 장기 예측 추론을 달성할 수 있는 잠재력을 보여줍니다.

Original Abstract

Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermediate content can overwhelm the agent, increasing costs and the risk of errors. We propose that effective context management should be adaptive: parts of the agent's trajectory are maintained at different levels of detail depending on their current relevance to the task. To operationalize this principle, we introduce Context-ReAct, a general agentic paradigm for elastic context orchestration that integrates reasoning, context management, and tool use in a unified loop. Context-ReAct provides five atomic operations: Skip, Compress, Rollback, Snippet and Delete, which allow the agent to dynamically reshape its working context, preserving important evidence, summarizing resolved information, discarding unhelpful branches, and controlling context size. We prove that the Compress operator is expressively complete, while the other specialized operators provide efficiency and fidelity guarantees that reduce generation cost and hallucination risk. Building on this paradigm, we develop LongSeeker, a long-horizon search agent fine-tuned from Qwen3-30B-A3B on 10k synthesized trajectories. Across four representative search benchmarks, LongSeeker achieves 61.5% on BrowseComp and 62.5% on BrowseComp-ZH, substantially outperforming Tongyi DeepResearch (43.2% and 46.7%) and AgentFold (36.2% and 47.3%). These results highlight the potential of adaptive context management, showing that agents can achieve more reliable and efficient long-horizon reasoning by actively shaping their working memory.

2 Citations

0 Influential

6 Altmetric

32.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!