2604.08284v1 Apr 09, 2026 cs.CL

대규모 언어 모델의 규칙 기반 지식에 대한 분산 다층 편집

Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models

Wenting Zhao

Citations: 4

h-index: 2

Yongshun Gong

Citations: 8

h-index: 2

Hao Sun

Citations: 95

h-index: 5

Yaqi Zhao

Citations: 7

h-index: 1

Yilong Yin

Citations: 45

h-index: 3

Yating Wang

Citations: 8

h-index: 1

대규모 언어 모델은 개별 사실뿐만 아니라, 기호 표현, 자연어 설명, 구체적인 예시를 포괄하는 추론을 지원하는 규칙을 저장합니다. 그러나 대부분의 모델 편집 방법은 사실 수준의 지식을 대상으로 하며, 특정 편집은 국소적인 개입을 통해 달성될 수 있다고 가정합니다. 이러한 가정은 규칙 수준의 지식에는 적용되지 않습니다. 왜냐하면 단일 규칙은 여러 상호 의존적인 형태로 일관성을 유지해야 하기 때문입니다. 본 연구에서는 규칙 수준의 지식 편집 문제를 메커니즘적 연구를 통해 탐구합니다. 이 연구를 지원하기 위해, 수학 및 물리학 분야의 80개에서 200개의 수동으로 검증된 규칙으로 RuleEdit 벤치마크를 확장했습니다. 세밀한 인과 관계 추적을 통해 트랜스포머 레이어 내에서 규칙 지식이 형태별로 조직되어 있음을 확인했습니다. 공식 및 설명은 초기 레이어에 집중되어 있는 반면, 예시는 중간 레이어와 더 관련성이 높습니다. 이러한 결과는 규칙 지식이 균일하게 국소화되어 있지 않으며, 따라서 단일 레이어 또는 연속적인 블록 개입을 통해 신뢰성 있게 편집될 수 없다는 것을 시사합니다. 이러한 통찰력을 바탕으로, 공식 및 설명에 대한 공유된 초기 레이어 업데이트와 예시에 대한 별도의 중간 레이어 업데이트를 적용하는 분산 다층 편집(DMLE) 방법을 제안합니다. DMLE는 표준 편집 지표에서 경쟁력을 유지하면서도, GPT-J-6B, Qwen2.5-7B, Qwen2-7B, 및 LLaMA-3-8B 모델에서 평균적으로 13.91%의 예시 이식성 및 50.19%의 규칙 이해도 향상을 보여주며, 가장 강력한 기준 모델보다 뛰어난 규칙 수준의 편집 성능을 달성합니다. 코드 및 추가 정보는 https://github.com/Pepper66/DMLE 에서 확인할 수 있습니다.

Original Abstract

Large language models store not only isolated facts but also rules that support reasoning across symbolic expressions, natural language explanations, and concrete instances. Yet most model editing methods are built for fact-level knowledge, assuming that a target edit can be achieved through a localized intervention. This assumption does not hold for rule-level knowledge, where a single rule must remain consistent across multiple interdependent forms. We investigate this problem through a mechanistic study of rule-level knowledge editing. To support this study, we extend the RuleEdit benchmark from 80 to 200 manually verified rules spanning mathematics and physics. Fine-grained causal tracing reveals a form-specific organization of rule knowledge in transformer layers: formulas and descriptions are concentrated in earlier layers, while instances are more associated with middle layers. These results suggest that rule knowledge is not uniformly localized, and therefore cannot be reliably edited by a single-layer or contiguous-block intervention. Based on this insight, we propose Distributed Multi-Layer Editing (DMLE), which applies a shared early-layer update to formulas and descriptions and a separate middle-layer update to instances. While remaining competitive on standard editing metrics, DMLE achieves substantially stronger rule-level editing performance. On average, it improves instance portability and rule understanding by 13.91 and 50.19 percentage points, respectively, over the strongest baseline across GPT-J-6B, Qwen2.5-7B, Qwen2-7B, and LLaMA-3-8B. The code is available at https://github.com/Pepper66/DMLE.

0 Citations

0 Influential

31.45879734614 Altmetric

157.3 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!