2603.25723v1 Mar 26, 2026 cs.CL

자연어 기반 에이전트 하니스

Natural-Language Agent Harnesses

Linyue Pan

Tsinghua University

Citations: 23

h-index: 2

Lexiao Zou

Citations: 96

h-index: 3

Hai-Tao Zheng

Citations: 35

h-index: 2

Shuo Guo

Citations: 33

h-index: 2

Jingcheng Ni

Citations: 26

h-index: 2

에이전트의 성능은 점점 더 extit{하니스 설계}에 의존하지만, 하니스 설계는 일반적으로 컨트롤러 코드와 런타임 관련 규칙에 숨겨져 있어, 이식, 비교 및 과학적 연구를 어렵게 만듭니다. 본 연구에서는 에이전트 하니스의 고수준 제어 로직을 이식 가능한 실행 파일 형태로 외부화할 수 있는지 질문합니다. 우리는 extbf{자연어 기반 에이전트 하니스 (Natural-Language Agent Harnesses, NLAH)}를 제안하며, 이는 편집 가능한 자연어로 하니스의 동작을 표현합니다. 또한 extbf{지능형 하니스 런타임 (Intelligent Harness Runtime, IHR)}을 제안합니다. 이는 명시적인 계약, 지속 가능한 아티팩트, 그리고 경량 어댑터를 통해 이러한 하니스를 실행하는 공유 런타임입니다. 우리는 코딩 및 컴퓨터 사용 벤치마크를 통해 운영 가능성, 모듈 제거, 코드-텍스트 하니스 변환 등의 측면에서 체계적인 평가를 수행했습니다.

Original Abstract

Agent performance increasingly depends on \emph{harness engineering}, yet harness design is usually buried in controller code and runtime-specific conventions, making it hard to transfer, compare, and study as a scientific object. We ask whether the high-level control logic of an agent harness can instead be externalized as a portable executable artifact. We introduce \textbf{Natural-Language Agent Harnesses} (NLAHs), which express harness behavior in editable natural language, and \textbf{Intelligent Harness Runtime} (IHR), a shared runtime that executes these harnesses through explicit contracts, durable artifacts, and lightweight adapters. Across coding and computer-use benchmarks, we conduct controlled evaluations of operational viability, module ablation, and code-to-text harness migration.

19 Citations

1 Influential

1.5 Altmetric

28.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!