2605.28066v1 May 27, 2026 cs.CL

PromptEmbedder:: Efficient and Transferable Text Embedding via Dual-LLM Soft Prompting

Yu-Che Tsai
Yu-Che Tsai
Citations: 197
h-index: 8
Kuan-Yu Chen
Kuan-Yu Chen
Citations: 6
h-index: 1
Shou-De Lin
Shou-De Lin
Citations: 33
h-index: 2
Yurui Chang
Yurui Chang
Citations: 31
h-index: 3
Yuanzhou Chen
Yuanzhou Chen
Citations: 38
h-index: 4
Ching-Yu Tsai
Ching-Yu Tsai
Citations: 9
h-index: 2
Yu-Hsiang Chuang
Yu-Hsiang Chuang
Citations: 0
h-index: 0

Large Language Models (LLMs) have demonstrated remarkable efficacy in text embedding, yet current adaptation methods like LoRA face significant bottlenecks in computational efficiency and cross-architecture transferability. Whenever a new backbone emerges, existing approaches require costly retraining from scratch. To address this, we propose PromptEmbedder, a novel dual-LLM framework that decouples embedding knowledge from specific backbone weights. PromptEmbedder utilizes a Prompting LLM to generate instruction-aware soft prompts for a frozen Embedding LLM via a differentiable generation process with continuous relaxation, ensuring full gradient flow during contrastive training. By localizing task-specific knowledge within the Prompting LLM, adapting to new architectures requires only retraining a lightweight linear alignment matrix. Evaluations on the MTEB benchmark show that PromptEmbedder achieves comparable performance with LoRA finetuning while reducing GPU memory by 40% and accelerating training by 3.7x. Our approach establishes a scalable, architecture-agnostic paradigm for efficient LLM-based representation learning.

0 Citations
0 Influential
4 Altmetric
20.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!