2605.25796v1 May 25, 2026 cs.CR

SAMark: A Self-Anchored Text Watermarking with Paragraph-Level Paraphrase Robustness

Yibo Yan
Yibo Yan
Citations: 1,360
h-index: 19
Xuming Hu
Xuming Hu
Citations: 760
h-index: 16
Wenjie Qu
Wenjie Qu
Citations: 206
h-index: 8
Jiaheng Zhang
Jiaheng Zhang
Citations: 199
h-index: 8
Kening Zheng
Kening Zheng
Citations: 245
h-index: 7
Jiahao Huo
Jiahao Huo
Citations: 466
h-index: 9
Philip S. Yu
Philip S. Yu
Citations: 125
h-index: 4
Ming Zhou
Ming Zhou
Citations: 565
h-index: 3

Semantic-level watermarking (SWM) improves robustness against text modifications by treating sentences as the basic unit. However, robustness to paragraph-level paraphrasing remains difficult because such attacks globally disrupt watermark signals by changing sentence order. In this work, we propose SAMark, a self-anchored watermarking framework that removes the dependency on sentence order by establishing a step-independent green region in semantic space. To improve detectability, we introduce a multi-channel hyperbolic scoring mechanism that amplifies watermark signals while suppressing noise from weakly aligned candidates. We further propose a diversity-aware filtering strategy that combines hard filtering with soft regularization, extending beyond simple n-gram repetition filters to address semantic redundancy. Experimental results show that SAMark achieves up to 90.2% TP@FP1% under typical paragraph-level paraphrasing attacks, outperforming the strongest prior baseline by more than 30% on average, while maintaining generation quality competitive with unwatermarked text and breaking the robustness-quality trade-off that limits prior methods.

0 Citations
0 Influential
9.5 Altmetric
47.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!