2602.12056v1 Feb 12, 2026 cs.AI

LawThinker: 동적 환경에서의 심층 연구 법률 에이전트

LawThinker: A Deep Research Legal Agent in Dynamic Environments

Tongyu Wen

Citations: 29

h-index: 2

Zhicheng Dou

Citations: 2,389

h-index: 24

Xinyu Yang

Citations: 215

h-index: 3

Chenlong Deng

Citations: 762

h-index: 9

Binyu Xie

Citations: 32

h-index: 2

법률적 추론은 정확한 결과뿐만 아니라 절차적으로 부합하는 추론 과정도 필요로 한다. 그러나 기존 방법론들은 중간 추론 단계를 검증하는 메커니즘이 부족하여, 적용 불가능한 법령 인용과 같은 오류가 감지되지 않은 채 추론 사슬 전체로 전파되는 문제가 있다. 이를 해결하기 위해, 본 논문에서는 동적 사법 환경에 맞춰 '탐색-검증-기억(Explore-Verify-Memorize)' 전략을 채택한 자율 법률 연구 에이전트인 LawThinker를 제안한다. 핵심 아이디어는 모든 지식 탐색 단계 직후에 검증 과정을 필수적인 원자적(atomic) 연산으로 수행하는 것이다. DeepVerifier 모듈은 각 검색 결과를 지식의 정확성, 사실과 법률의 관련성, 절차적 준수성이라는 세 가지 차원에서 검사하며, 메모리 모듈을 통해 긴 호흡의 과제(long-horizon tasks)에서도 라운드 간 지식 재사용을 지원한다. 동적 벤치마크인 J1-EVAL에서의 실험 결과, LawThinker는 직접 추론 방식 대비 24%, 워크플로우 기반 방식 대비 11% 향상된 성능을 달성했으며, 특히 과정 중심 지표에서 두드러진 개선을 보였다. 또한 세 가지 정적 벤치마크에 대한 평가를 통해 모델의 일반화 능력을 추가로 확인하였다. 코드는 https://github.com/yxy-919/LawThinker-agent 에서 확인할 수 있다.

Original Abstract

Legal reasoning requires not only correct outcomes but also procedurally compliant reasoning processes. However, existing methods lack mechanisms to verify intermediate reasoning steps, allowing errors such as inapplicable statute citations to propagate undetected through the reasoning chain. To address this, we propose LawThinker, an autonomous legal research agent that adopts an Explore-Verify-Memorize strategy for dynamic judicial environments. The core idea is to enforce verification as an atomic operation after every knowledge exploration step. A DeepVerifier module examines each retrieval result along three dimensions of knowledge accuracy, fact-law relevance, and procedural compliance, with a memory module for cross-round knowledge reuse in long-horizon tasks. Experiments on the dynamic benchmark J1-EVAL show that LawThinker achieves a 24% improvement over direct reasoning and an 11% gain over workflow-based methods, with particularly strong improvements on process-oriented metrics. Evaluations on three static benchmarks further confirm its generalization capability. The code is available at https://github.com/yxy-919/LawThinker-agent .

2 Citations

0 Influential

47.455212266792 Altmetric

239.3 Score

Original PDF

AI Analysis

Korean Summary

LawThinker는 동적인 사법 환경에서 정확한 결과와 절차적 준수성을 모두 달성하기 위해 개발된 자율 법률 연구 에이전트입니다. 기존 모델들이 중간 추론 단계의 오류를 감지하지 못해 잘못된 법률을 인용하는 문제를 해결하기 위해 '탐색-검증-기억(Explore-Verify-Memorize)' 전략을 도입했습니다. 핵심 요소인 DeepVerifier 모듈은 지식 탐색 후 즉시 정확성, 관련성, 절차적 적합성을 검증하여 오류 전파를 차단합니다. 또한, 검증된 지식과 사건 맥락을 저장하는 메모리 모듈을 통해 긴 호흡의 법률 자문이나 재판 시뮬레이션에서도 일관된 추론을 유지합니다.

Key Innovations

탐색-검증-기억(Explore-Verify-Memorize) 전략을 통한 단계별 추론 강화
지식 정확성, 사실-법률 관련성, 절차적 준수성을 평가하는 DeepVerifier 모듈
법률 지식과 사건 문맥을 분리 관리하여 재사용성을 높인 이중 메모리 메커니즘
법률 탐색, 검증, 기억을 지원하는 15종의 특화된 법률 도구(Tools) 활용

Learning & Inference Impact

이 프레임워크는 학습 단계보다는 추론(Inference) 단계에 결정적인 영향을 미칩니다. 모델이 단순히 답변을 생성하는 것에 그치지 않고, 각 추론 단계마다 외부 도구를 호출하여 근거를 탐색하고 검증하도록 강제함으로써 환각(Hallucination)을 억제합니다. 검증 실패 시 추론을 수정하거나 재탐색하는 피드백 루프를 통해 법적 논리의 정합성을 높이며, 메모리 모듈을 활용해 긴 문맥의 대화에서도 이전 단계의 검증된 정보를 효율적으로 재사용하여 연산 효율과 일관성을 동시에 개선합니다.

Technical Difficulty

중급

Estimated implementation complexity based on methodology.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!