2601.18226v2 Jan 26, 2026 cs.AI

윤주에이전트 기술 보고서: 완전한 재현성을 갖춘, 초기 상태에서 자체 진화하는 에이전트 시스템을 위한 개방형 작업

Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Shijun Yang

Citations: 21

h-index: 2

Silei Zhao

Citations: 2

h-index: 1

Haotian Li

Citations: 70

h-index: 3

Weizhen Qi

Citations: 53

h-index: 2

Rui Hua

Citations: 43

h-index: 2

M. Song

Citations: 31

h-index: 3

Xiaojia Yang

Citations: 6

h-index: 2

Chao Peng

Citations: 502

h-index: 10

기존의 에이전트 시스템은 작업 분포가 지속적으로 변화하고 외부 감독이 부족한 개방형 환경에서 어려움을 겪는 경우가 많습니다. 이러한 시스템은 정적인 도구 세트나 오프라인 학습에 의존하기 때문에, 시스템의 능력 한계가 경직되어 있고 알려지지 않은 경우가 많습니다. 이러한 문제를 해결하기 위해, 우리는 In-Situ Self-Evolving (현장 자체 진화) 패러다임을 제안합니다. 이 접근 방식은 순차적인 작업 상호 작용을 지속적인 경험 스트림으로 취급하여, 시스템이 정답 레이블 없이 단기 실행 피드백을 장기적인, 재사용 가능한 능력으로 변환할 수 있도록 합니다. 이 프레임워크 내에서, 우리는 도구 진화를 능력 확장의 핵심 경로로 파악하고, 이는 검증 가능한 이진 피드백 신호를 제공합니다. 이 프레임워크 내에서, 우리는 반복적으로 도구를 합성, 최적화하고 재사용하여 새로운 과제를 해결하는 시스템인 Yunjue Agent를 개발했습니다. 진화 효율성을 최적화하기 위해, 우리는 병렬 배치 진화(Parallel Batch Evolution) 전략을 추가로 도입했습니다. 초기 상태(zero-start)에서 다섯 가지 다양한 벤치마크에 대한 실험적 평가 결과, 독점적인 기준 시스템보다 상당한 성능 향상을 보였습니다. 또한, 보완적인 초기 상태 설정(warm-start) 평가를 통해, 축적된 일반적인 지식을 새로운 영역으로 원활하게 이전할 수 있음을 확인했습니다. 마지막으로, 우리는 진화 수렴을 모니터링하는 새로운 지표를 제안하며, 이는 기존 최적화에서 손실 함수와 유사한 역할을 합니다. 우리는 우리의 코드, 시스템 추적 정보 및 진화된 도구를 공개하여, 탄력적인, 자체 진화하는 지능에 대한 미래 연구를 촉진하고자 합니다.

Original Abstract

Conventional agent systems often struggle in open-ended environments where task distributions continuously drift and external supervision is scarce. Their reliance on static toolsets or offline training lags behind these dynamics, leaving the system's capability boundaries rigid and unknown. To address this, we propose the In-Situ Self-Evolving paradigm. This approach treats sequential task interactions as a continuous stream of experience, enabling the system to distill short-term execution feedback into long-term, reusable capabilities without access to ground-truth labels. Within this framework, we identify tool evolution as the critical pathway for capability expansion, which provides verifiable, binary feedback signals. Within this framework, we develop Yunjue Agent, a system that iteratively synthesizes, optimizes, and reuses tools to navigate emerging challenges. To optimize evolutionary efficiency, we further introduce a Parallel Batch Evolution strategy. Empirical evaluations across five diverse benchmarks under a zero-start setting demonstrate significant performance gains over proprietary baselines. Additionally, complementary warm-start evaluations confirm that the accumulated general knowledge can be seamlessly transferred to novel domains. Finally, we propose a novel metric to monitor evolution convergence, serving as a function analogous to training loss in conventional optimization. We open-source our codebase, system traces, and evolved tools to facilitate future research in resilient, self-evolving intelligence.

2 Citations

0 Influential

5 Altmetric

27.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!