2605.29940v1 May 28, 2026 cs.AI

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

Zhen Bi
Zhen Bi
Citations: 15
h-index: 2
Jungang Lou
Jungang Lou
Citations: 44
h-index: 3
Zhixuan Chu
Zhixuan Chu
Citations: 3
h-index: 1
Zihao Xue
Zihao Xue
Citations: 6
h-index: 2
Longtao Huang
Longtao Huang
Citations: 31
h-index: 2
Zeyu Yang
Zeyu Yang
Citations: 3
h-index: 1
Bin Zhu
Bin Zhu
Citations: 132
h-index: 5
Yan Wang
Yan Wang
Citations: 12
h-index: 2
Zhen-Hua Hu
Zhen-Hua Hu
Citations: 12
h-index: 2
Xiongtao Zhang
Xiongtao Zhang
Citations: 264
h-index: 9

Large language models (LLMs) have been widely adopted for synthetic data generation, significantly reducing annotation costs. However, most existing studies treat synthesis as a set of isolated tasks and overlook a more fundamental question: whether a model can learn to synthesize by accumulating experience from past tasks and transferring it to future ones. In this work, we introduce StreamSynth, a new setting in which synthesis tasks arrive sequentially and experience from historical tasks provides informative signals for future synthesis. To address this setting, we propose SynLearner, a general framework that enables synthesis models to acquire reusable synthesis experience over a task stream. Instead of generating data independently for each task, SynLearner encourages the model to explore diverse synthesis patterns, learn from feedback, and balance sample quality with set-level diversity as tasks evolve. Extensive experiments across multiple benchmarks show that SynLearner effectively leverages experience from earlier tasks to improve synthesis performance on later ones, exhibiting consistent cross-task transferability. These findings provide evidence for the feasibility of StreamSynth and highlight synthetic data generation as an experience-driven process that can benefit from task streams.

0 Citations
0 Influential
4.5 Altmetric
22.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!