2605.03706v1 May 05, 2026 cs.CL

SAM-NER: 의미적 원형 매개 방식을 이용한 제로샷 개체명 인식

SAM-NER: Semantic Archetype Mediation for Zero-Shot Named Entity Recognition

Ruichu Cai

Citations: 83

h-index: 5

Boyan Xu

Citations: 346

h-index: 10

Zhifeng Hao

Citations: 276

h-index: 9

Jun-Wei Gan

Citations: 76

h-index: 2

Miao-miao Mai

Citations: 952

h-index: 9

제로샷 개체명 인식(ZS-NER)은 도메인 및 스키마 변화에 취약하며, 특히 새로운 레이블 정의가 대규모 언어 모델(LLM)의 내재적 의미 구조와 일치하지 않는 경우가 많습니다. 그 결과, 개체 언급을 세분화된 대상 레이블에 직접 매핑하면 체계적인 의미적 편향이 발생할 수 있으며, 특히 대상 스키마가 새롭거나 의미적으로 겹치는 경우 더욱 그렇습니다. 본 연구에서는 extbf{SAM-NER}라는 세 단계 프레임워크를 제안합니다. SAM-NER은 extit{의미적 원형 매개(Semantic Archetype Mediation)} 방식을 기반으로 하며, 중간 단계의 도메인 불변 원형 공간을 통해 도메인 간 전이를 안정화합니다. SAM-NER은 (i) 협력적 추출 및 합의 기반 노이즈 제거를 통해 높은 수준의 커버리지와 정확도를 갖는 개체 스팬을 얻는 extit{개체 발견(Entity Discovery)}을 수행하고, (ii) 개체들을 고수준 온톨로지 추상화로부터 추출된 보편적인 의미 원형 집합으로 투영하는 extit{추상 매개(Abstract Mediation)}를 수행하고, (iii) 제약 조건 및 정의에 기반한 추론을 통해 고정된 LLM을 사용하여 원형 수준 예측을 대상 도메인 유형으로 변환하는 extit{의미적 교정(Semantic Calibration)}을 적용합니다. CrossNER 벤치마크에서의 실험 결과, SAM-NER은 다양한 도메인 환경에서 기존의 강력한 ZS-NER 모델들을 지속적으로 능가하는 것으로 나타났습니다. 저희의 구현체는 다음 GitHub 주소에서 공개될 예정입니다: https://github.com/DMIRLAB-Group/SAM-NER.

Original Abstract

Zero-shot Named Entity Recognition (ZS-NER) remains brittle under domain and schema shifts, where unseen label definitions often misalign with a large language model's (LLM's) intrinsic semantic organization. As a result, directly mapping entity mentions to fine-grained target labels can induce systematic semantic drift, especially when target schemas are novel or semantically overlapping. We propose \textbf{SAM-NER}, a three-stage framework based on \emph{Semantic Archetype Mediation} that stabilizes cross-domain transfer through an intermediate, domain-invariant archetype space. SAM-NER: (i) performs \emph{Entity Discovery} via cooperative extraction and consensus-based denoising to obtain high-coverage, high-fidelity entity spans; (ii) conducts \emph{Abstract Mediation} by projecting entities into a compact set of universal semantic archetypes distilled from high-level ontological abstractions; and (iii) applies \emph{Semantic Calibration} to resolve archetype-level predictions into target-domain types through constrained, definition-aligned inference with a frozen LLM. Experiments on the CrossNER benchmark show that SAM-NER consistently outperforms strong prior ZS-NER baselines in cross-domain settings. Our implementation will be open-sourced at https://github.com/DMIRLAB-Group/SAM-NER.

0 Citations

0 Influential

30.493061443341 Altmetric

152.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!