2605.25796v1 May 25, 2026 cs.CR

SAMark: A Self-Anchored Text Watermarking with Paragraph-Level Paraphrase Robustness

Yibo Yan

Citations: 1,360

h-index: 19

Xuming Hu

Citations: 760

h-index: 16

Wenjie Qu

Citations: 206

h-index: 8

Jiaheng Zhang

Citations: 199

h-index: 8

Kening Zheng

Citations: 245

h-index: 7

Jiahao Huo

Citations: 466

h-index: 9

Philip S. Yu

Citations: 125

h-index: 4

Ming Zhou

Citations: 565

h-index: 3

Semantic-level watermarking (SWM) improves robustness against text modifications by treating sentences as the basic unit. However, robustness to paragraph-level paraphrasing remains difficult because such attacks globally disrupt watermark signals by changing sentence order. In this work, we propose SAMark, a self-anchored watermarking framework that removes the dependency on sentence order by establishing a step-independent green region in semantic space. To improve detectability, we introduce a multi-channel hyperbolic scoring mechanism that amplifies watermark signals while suppressing noise from weakly aligned candidates. We further propose a diversity-aware filtering strategy that combines hard filtering with soft regularization, extending beyond simple n-gram repetition filters to address semantic redundancy. Experimental results show that SAMark achieves up to 90.2% TP@FP1% under typical paragraph-level paraphrasing attacks, outperforming the strongest prior baseline by more than 30% on average, while maintaining generation quality competitive with unwatermarked text and breaking the robustness-quality trade-off that limits prior methods.

0 Citations

0 Influential

9.5 Altmetric

47.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!