2604.14609v1 Apr 16, 2026 cs.AI

엘 아젠테 포르자도르: 양자 시뮬레이션을 위한 작업 기반 에이전트 생성

El Agente Forjador: Task-Driven Agent Generation for Quantum Simulation

Ignacio Gustin

Citations: 9

h-index: 2

Zijian Zhang

Citations: 72

h-index: 1

Aiwei Yin

Citations: 5

h-index: 1

Amaan Baweja

Citations: 1

h-index: 1

Jiaru Bai

Citations: 72

h-index: 1

Varinia Bernales

Citations: 72

h-index: 1

Alán Aspuru-Guzik

Citations: 72

h-index: 1

과학 분야의 인공지능은 연구 개발 과정을 가속화할 수 있는 잠재력을 가지고 있습니다. 대규모 언어 모델(LLM)과 에이전트 기반 워크플로우의 발전은 다양한 과학적 작업의 효율성을 높이는 데 기여하고 있습니다. 그러나 현재 대부분의 에이전트 시스템은 정적이며 수동으로 구성된 도구 세트에 의존하는데, 이는 새로운 분야에 대한 적응성과 변화하는 라이브러리에 대한 대응성을 저해합니다. 본 연구에서는 '엘 아젠테 포르자도르'라는 다중 에이전트 프레임워크를 제시합니다. 이 프레임워크에서 범용 코딩 에이전트는 4단계의 워크플로우(도구 분석, 도구 생성, 작업 실행, 반복적인 솔루션 평가)를 통해 자동으로 계산 도구를 생성, 검증 및 재사용합니다. 양자 화학 및 양자 역학 분야의 24개 작업과 5가지 코딩 에이전트 구성에 대한 평가를 통해, 다음 세 가지 운영 모드를 비교했습니다. 첫째, 각 작업에 대한 도구를 즉석에서 생성하는 방식, 둘째, 미리 구성된 도구 세트를 재사용하는 방식, 셋째, 코딩 에이전트를 직접 사용하여 문제를 해결하는 방식입니다. 그 결과, 도구 생성 및 재사용 프레임워크가 기준 방식보다 일관되게 정확도를 향상시키는 것으로 나타났습니다. 또한, 더 강력한 코딩 에이전트에 의해 구축된 도구 세트를 재사용하면 API 비용을 줄이고, 상대적으로 성능이 낮은 코딩 에이전트의 솔루션 품질을 크게 향상시킬 수 있음을 확인했습니다. 추가적인 사례 연구를 통해, 서로 다른 분야에서 생성된 도구를 결합하여 복합적인 작업을 해결할 수 있음을 보여줍니다. 종합적으로, 이러한 결과는 LLM 기반 에이전트가 과학적 지식과 코딩 능력을 활용하여 재사용 가능한 과학 도구를 자율적으로 구축할 수 있음을 보여주며, 에이전트의 기능이 특정 구현 방식이 아닌, 해결하도록 설계된 작업에 의해 정의되는 패러다임을 제시합니다.

Original Abstract

AI for science promises to accelerate the discovery process. The advent of large language models (LLMs) and agentic workflows enables the expediting of a growing range of scientific tasks. However, most of the current generation of agentic systems depend on static, hand-curated toolsets that hinder adaptation to new domains and evolving libraries. We present El Agente Forjador, a multi-agent framework in which universal coding agents autonomously forge, validate, and reuse computational tools through a four-stage workflow of tool analysis, tool generation, task execution, and iterative solution evaluation. Evaluated across 24 tasks spanning quantum chemistry and quantum dynamics on five coding agent setups, we compare three operating modes: zero-shot generation of tools per task, reuse of a curriculum-built toolset, and direct problem-solving with the coding agents as the baseline. We find that our tool generation and reuse framework consistently improves accuracy over the baseline. We also show that reusing a toolset built by a stronger coding agent can reduce API cost and substantially raises the solution quality for weaker coding agents. Case studies further demonstrate that tools forged for different domains can be combined to solve hybrid tasks. Taken together, these results show that LLM-based agents can use their scientific knowledge and coding capabilities to autonomously build reusable scientific tools, pointing toward a paradigm in which agent capabilities are defined by the tasks they are designed to solve rather than by explicitly engineered implementations.

1 Citations

0 Influential

1 Altmetric

6.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!