2601.17569v1 Jan 24, 2026 cs.CL

개인 맞춤형 생성에서의 사용자 프라이버시 향상: 서버 측에서 생성된 초안에 대한 클라이언트 측 검색 기반 수정

Improving User Privacy in Personalized Generation: Client-Side Retrieval-Augmented Modification of Server-Side Generated Speculations

Citations: 1,237

h-index: 14

Citations: 753

h-index: 14

개인화는 대규모 언어 모델(LLM)의 출력 결과를 개별 사용자의 선호도와 배경 지식에 맞추는 데 매우 중요합니다. 최첨단 솔루션은 검색 증강(retrieval augmentation)을 기반으로 하며, 이는 LLM에 사용될 관련 컨텍스트를 사용자 프로필에서 검색하는 방식입니다. 이러한 방법은 검색된 개인 정보를 클라우드 제공업체에 노출시키는 것과 로컬 모델의 성능에 의존하는 것 사이의 균형을 맞춥니다. 본 논문에서는 서버 측 LLM에 개인 프로필을 공개하지 않고도 고품질의 개인화를 제공하는 상호 작용 프레임워크인 $P^3$을 소개합니다. $P^3$에서, 대규모 서버 측 모델은 사용자 쿼리에만 기반하여 $k$개의 초안 토큰 시퀀스를 생성하고, 검색 기능을 통해 사용자의 개인 프로필에 접근할 수 있는 소규모 클라이언트 측 모델이 이러한 초안을 평가하고 수정하여 사용자 선호도를 더 잘 반영합니다. 이 과정은 종료 토큰이 생성될 때까지 반복됩니다. 세 가지 개인 맞춤형 질문 답변 데이터 세트로 구성된 최신 벤치마크인 LaMP-QA에 대한 실험 결과, $P^3$은 비개인화된 서버 측 모델과 개인화된 클라이언트 측 기본 모델 모두를 능가하며, 평균적으로 7.4%에서 9%의 통계적으로 유의미한 성능 향상을 달성했습니다. 특히, $P^3$은 전체 프로필이 대규모 서버 측 모델에 노출되는 '유출' 시나리오의 유용성의 90.3%에서 95.7%를 복구합니다. 링크 가능성 및 속성 추론 공격을 포함한 프라이버시 분석 결과, $P^3$은 비개인화된 서버 측 모델의 프라이버시를 유지하며, 개인 정보 없이 쿼리를 제출하는 것과 비교하여 미미한 추가적인 정보 유출(1.5%–3.5%)만 발생합니다. 또한, 이 프레임워크는 엣지 배포에 효율적이며, 클라이언트 측 모델이 총 토큰의 9.2%만 생성합니다. 이러한 결과는 $P^3$이 향상된 프라이버시를 갖춘 개인 맞춤형 생성에 대한 실용적이고 효과적인 솔루션을 제공한다는 것을 보여줍니다.

Original Abstract

Personalization is crucial for aligning Large Language Model (LLM) outputs with individual user preferences and background knowledge. State-of-the-art solutions are based on retrieval augmentation, where relevant context from a user profile is retrieved for LLM consumption. These methods deal with a trade-off between exposing retrieved private data to cloud providers and relying on less capable local models. We introduce $P^3$, an interactive framework for high-quality personalization without revealing private profiles to server-side LLMs. In $P^3$, a large server-side model generates a sequence of $k$ draft tokens based solely on the user query, while a small client-side model, with retrieval access to the user's private profile, evaluates and modifies these drafts to better reflect user preferences. This process repeats until an end token is generated. Experiments on LaMP-QA, a recent benchmark consisting of three personalized question answering datasets, show that $P^3$ consistently outperforms both non-personalized server-side and personalized client-side baselines, achieving statistically significant improvements of $7.4%$ to $9%$ on average. Importantly, $P^3$ recovers $90.3%$ to $95.7%$ of the utility of a ``leaky'' upper-bound scenario in which the full profile is exposed to the large server-side model. Privacy analyses, including linkability and attribute inference attacks, indicate that $P^3$ preserves the privacy of a non-personalized server-side model, introducing only marginal additional leakage ($1.5%$--$3.5%$) compared to submitting a query without any personal context. Additionally, the framework is efficient for edge deployment, with the client-side model generating only $9.2%$ of the total tokens. These results demonstrate that $P^3$ provides a practical, effective solution for personalized generation with improved privacy.

0 Citations

0 Influential

7 Altmetric

35.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!