2604.23190v1 Apr 25, 2026 cs.SE

RAT: 완전 자동 환경 구성 기반의 모든 프로그램 실행

RAT: RunAnyThing via Fully Automated Environment Configuration

Renhong Huang

Citations: 65

h-index: 3

Dongdong Hua

Citations: 1

h-index: 1

Yifei Sun

Citations: 1

h-index: 1

Sitao Ding

Citations: 0

h-index: 0

Hanyang Yuan

Citations: 18

h-index: 3

Daixin Wang

Citations: 20

h-index: 3

Yang Yang

Citations: 8

h-index: 2

자율 코드 에이전트의 핵심 과제 중 하나는 리포지토리 수준의 소프트웨어 엔지니어링 작업을 자동화하는 것인데, 이는 실행 환경 구성을 어렵게 하기 때문입니다. 그러나 수동 구성은 여전히 많은 노동력을 필요로 하는 병목 현상이며, 완전 자동화된 환경 구성으로의 전환이 필요합니다. 기존 방식은 종종 미리 정의된 아티팩트에 의존하거나 특정 프로그래밍 언어에 제한되어 있어 실제 리포지토리에 적용하기 어렵습니다. 본 논문에서는 임의의 리포지토리에서 자동 환경 구성을 위한 언어에 구애받지 않는 프레임워크인 RAT(RunAnyThing)을 제안합니다. RAT은 의미 기반 초기화, 계획 메커니즘, 특수 도구 세트, 그리고 견고한 샌드박스를 통합한 다단계 파이프라인을 특징으로 합니다. 또한, 엄격한 평가를 위해 실제 리포지토리의 분포와 다양성을 반영하는 벤치마크인 RATBench를 제안합니다. 광범위한 실험 결과, RAT은 최첨단 성능을 달성했으며, 강력한 기준 모델 대비 평균 29.6% 향상된 환경 설정 성공률(ESSR)을 보였습니다.

Original Abstract

Automating repository-level software engineering tasks is a foundational challenge for autonomous code agents, largely due to the difficulty of configuring executable environments. However, manual configuration remains a labor-intensive bottleneck, necessitating a transition toward fully automated environment configuration. Existing approaches often rely on pre-defined artifacts or are restricted to specific programming languages, limiting their applicability to real-world repositories. In this paper, we first propose RAT (RunAnyThing), a language-agnostic framework for automated environment configuration on arbitrary repositories. RAT features a multi-stage pipeline that integrates semantic initialization, a planning mechanism, specialized toolset, and a robust sandbox for configuration. Furthermore, to enable rigorous evaluation, we propose RATBench, a benchmark that reflects the the distribution and heterogeneity of real-world repositories. Extensive experiments demonstrate that RAT achieves state-of-the-art performance, improving the Environment Setup Success Rate (ESSR) by an average of 29.6% over strong baselines.

0 Citations

0 Influential

1.5 Altmetric

7.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!