JASA

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Authors
Jiancong Xiao Ziniu Li Xingyu Xie Emily Getzen Cong Fang Qi Long Weijie J. Su
Research Topics
High-Dimensional Statistics Computational Statistics
Paper Information
  • Journal:
    Journal of the American Statistical Association
  • DOI:
    10.1080/01621459.2025.2555067
  • Added to Tracker:
    Sep 24, 2025
Author Details
Jiancong Xiao
Author
Ziniu Li
Author
Xingyu Xie
Author
Emily Getzen
Author
Cong Fang
Author
Qi Long
Author
Weijie J. Su
Author
Research Topics & Keywords
High-Dimensional Statistics
Research Area
Computational Statistics
Research Area
Citation Information
APA Format
Jiancong Xiao , Ziniu Li , Xingyu Xie , Emily Getzen , Cong Fang , Qi Long & Weijie J. Su . On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization. Journal of the American Statistical Association , 10.1080/01621459.2025.2555067.
BibTeX Format
@article{paper541,
  title = { On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization },
  author = { Jiancong Xiao and Ziniu Li and Xingyu Xie and Emily Getzen and Cong Fang and Qi Long and Weijie J. Su },
  journal = { Journal of the American Statistical Association },
  doi = { 10.1080/01621459.2025.2555067 },
  url = { https://www.tandfonline.com/doi/full/10.1080/01621459.2025.2555067 }
}