JASA
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
Authors
Cong Fang
Weijie J. Su
Jiancong Xiao
Ziniu Li
Xingyu Xie
Emily Getzen
Qi Long
Research Topics
High-Dimensional Statistics
Computational Statistics
Paper Information
-
Journal:
Journal of the American Statistical Association -
DOI:
10.1080/01621459.2025.2555067 -
Added to Tracker:
Sep 24, 2025
Author Details
Cong Fang
AuthorWeijie J. Su
AuthorJiancong Xiao
AuthorZiniu Li
AuthorXingyu Xie
AuthorEmily Getzen
AuthorQi Long
AuthorResearch Topics & Keywords
High-Dimensional Statistics
Research AreaComputational Statistics
Research AreaCitation Information
APA Format
Cong Fang
,
Weijie J. Su
,
Jiancong Xiao
,
Ziniu Li
,
Xingyu Xie
,
Emily Getzen
&
Qi Long
.
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization.
Journal of the American Statistical Association
, 10.1080/01621459.2025.2555067.
BibTeX Format
@article{paper541,
title = { On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization },
author = {
Cong Fang
and Weijie J. Su
and Jiancong Xiao
and Ziniu Li
and Xingyu Xie
and Emily Getzen
and Qi Long
},
journal = { Journal of the American Statistical Association },
doi = { 10.1080/01621459.2025.2555067 },
url = { https://www.tandfonline.com/doi/full/10.1080/01621459.2025.2555067 }
}