Recursive learning without collapse: a weighting-based stabilization framework
Authors
Paper Information
-
Journal:
Journal of the Royal Statistical Society Series B -
DOI:
10.1093/jrsssb/qkag099 -
Published:
July 01, 2026 -
Added to Tracker:
Jul 02, 2026
Abstract
Abstract Recent studies identified an intriguing phenomenon in recursive generative model training known as model collapse, where models trained on data generated by previous models exhibit severe performance degradation. Addressing this issue and developing more effective training strategies have become central challenges in generative model research. In this paper, we investigate this phenomenon within a novel framework, where generative models are iteratively trained on a combination of newly collected real data and synthetic data from the previous training step. To develop an optimal training strategy for integrating real and synthetic data, we evaluate the performance of a weighted training scheme in various scenarios, including Gaussian distribution estimation, generalized linear models, and non-parametric estimation. We theoretically characterize the impact of the mixing proportion and weighting scheme of synthetic data on the final model’s performance. Our key finding is that, across different settings, the optimal weighting scheme under different proportions of synthetic data asymptotically follows a unified expression, revealing a fundamental trade-off between leveraging synthetic data and model performance. In some cases, the optimal weight assigned to real data corresponds to the reciprocal of the golden ratio. Finally, we validate our theoretical results on extensive simulated datasets and a real tabular dataset.
Author Details
Guang Cheng
AuthorShirong Xu
AuthorHengzhi He
AuthorCitation Information
APA Format
Guang Cheng
,
Shirong Xu
&
Hengzhi He
(2026)
.
Recursive learning without collapse: a weighting-based stabilization framework.
Journal of the Royal Statistical Society Series B
, 10.1093/jrsssb/qkag099.
BibTeX Format
@article{paper1359,
title = { Recursive learning without collapse: a weighting-based stabilization framework },
author = {
Guang Cheng
and Shirong Xu
and Hengzhi He
},
journal = { Journal of the Royal Statistical Society Series B },
year = { 2026 },
doi = { 10.1093/jrsssb/qkag099 },
url = { https://doi.org/10.1093/jrsssb/qkag099 }
}