2606.16276v1 Jun 15, 2026 cs.AI

SpecAlign: Efficient Specification-Grounded Alignment of Large Language Models via Synthetic Data

Xiangliang Zhang
Xiangliang Zhang
Citations: 997
h-index: 16
Zhengqing Yuan
Zhengqing Yuan
Citations: 976
h-index: 10
Wenjie Wang
Wenjie Wang
Citations: 69
h-index: 4
Yue Zhao
Yue Zhao
Citations: 7
h-index: 2
Yue Huang
Yue Huang
Citations: 662
h-index: 10
Shiyi Du
Shiyi Du
Citations: 47
h-index: 2
Han Bao
Han Bao
Citations: 93
h-index: 4
Yuchen Ma
Yuchen Ma
Citations: 70
h-index: 4
Yanfang Ye
Yanfang Ye
Citations: 193
h-index: 7

As large language models (LLMs) are increasingly deployed in real-world applications, alignment is no longer governed by a single universal notion of safety or helpfulness, but instead by provider- or application-specific model specifications. These specifications are typically long, structured, and frequently updated, yet existing alignment pipelines lack a systematic mechanism to operationalize them as training signals. In this paper, we propose specification-grounded alignment, a new alignment paradigm that treats provider-authored model specifications as the primary alignment target rather than abstract principles or static benchmarks. To instantiate this paradigm, we introduce SpecAlign, a framework that synthesizes alignment data directly from specification documents. SpecAlign combines structured rule annotation, controllable specification instantiation, and multi-agent adversarial data synthesis to generate fine-grained, boundary-aware preference pairs that capture both compliant behaviors and meaningful specification violations. Experiments across multiple model specifications and backbone models demonstrate that training with SpecAlign consistently improves rule compliance while preserving general capabilities and avoiding over-conservative behavior. These results suggest that grounding alignment in explicit model specifications enables rapid, precise, and scalable adaptation of LLM behavior to evolving policy requirements.

0 Citations
0 Influential
8 Altmetric
40.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!