2602.20639v1 Feb 24, 2026 cs.AI

구체화된 행동을 통한 과학적 발견에서의 LLM 활용

Grounding LLMs in Scientific Discovery via Embodied Actions

Jinfeng Zhou

Citations: 189

h-index: 5

Hongning Wang

Citations: 2,831

h-index: 18

Bo Zhang

Citations: 20

h-index: 3

Yuxuan Chen

Citations: 39

h-index: 4

Minlie Huang

Citations: 2,101

h-index: 20

Jianing Yin

University of Pennsylvania

Citations: 29

h-index: 3

대규모 언어 모델(LLM)은 과학적 발견 분야에서 상당한 잠재력을 보여주었지만, 이론적 추론과 검증 가능한 물리 시뮬레이션 간의 간극을 좁히는 데 어려움을 겪습니다. 기존의 솔루션은 수동적인 "실행-응답" 루프 방식으로 작동하며, 따라서 실시간 인지 능력이 부족하여 일시적인 이상 현상(예: 수치 불안정 또는 발산하는 진동)에 대한 에이전트의 대응을 저해합니다. 이러한 제한 사항을 해결하기 위해, 우리는 LLM을 구체화된 행동과 밀접하게 결합된 인지-실행 루프를 통해 에이전트로 변환하는 프레임워크인 EmbodiedAct를 제안합니다. EmbodiedAct를 MATLAB 환경에 구현하고, 복잡한 엔지니어링 설계 및 과학 모델링 작업에 적용하여 성능을 평가했습니다. 광범위한 실험 결과, EmbodiedAct는 기존의 기준 모델보다 훨씬 뛰어난 성능을 보이며, 장기 시뮬레이션에서 안정성과 신뢰성을 확보하고 과학 모델링의 정확도를 향상시키는 최첨단(SOTA) 성능을 달성했습니다.

Original Abstract

Large Language Models (LLMs) have shown significant potential in scientific discovery but struggle to bridge the gap between theoretical reasoning and verifiable physical simulation. Existing solutions operate in a passive "execute-then-response" loop and thus lacks runtime perception, obscuring agents to transient anomalies (e.g., numerical instability or diverging oscillations). To address this limitation, we propose EmbodiedAct, a framework that transforms established scientific software into active embodied agents by grounding LLMs in embodied actions with a tight perception-execution loop. We instantiate EmbodiedAct within MATLAB and evaluate it on complex engineering design and scientific modeling tasks. Extensive experiments show that EmbodiedAct significantly outperforms existing baselines, achieving SOTA performance by ensuring satisfactory reliability and stability in long-horizon simulations and enhanced accuracy in scientific modeling.

1 Citations

0 Influential

10 Altmetric

51.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!