JMLR

Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood

Authors

Jiangrong Ouyang Mingming Gong Howard Bondell

Research Topics

Bayesian Statistics

View Full Paper

Paper Information

Journal:
Journal of Machine Learning Research
Added to Tracker:
Mar 03, 2026

Abstract

Policy inference plays an essential role in the contextual bandit problem. In this paper, we use empirical likelihood to develop a Bayesian inference method for the joint analysis of multiple contextual bandit policies in finite sample regimes. The proposed inference method is robust to small sample sizes and is able to provide accurate uncertainty measurements for policy value evaluation. In addition, it allows for flexible inferences on policy comparison with full uncertainty quantification. We demonstrate the effectiveness of the proposed inference method using Monte Carlo simulations and its application to an adolescent body mass index data set.

Author Details

Jiangrong Ouyang

Author

Mingming Gong

Author

Howard Bondell

Author

Research Topics & Keywords

Bayesian Statistics

Research Area

Citation Information

APA Format


                                
                                    
                                    Jiangrong Ouyang
                                
                                    
                                        , 
                                    
                                    Mingming Gong
                                
                                    
                                         & 
                                    
                                    Howard Bondell
                                
                                . 
                                Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood. 
                                Journal of Machine Learning Research
                                .

BibTeX Format


@article{paper973,

  title = { Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood },

  author = { 
                                
                                    Jiangrong Ouyang
                                
                                     and Mingming Gong
                                
                                     and Howard Bondell
                                
                                },

  journal = { Journal of Machine Learning Research },



  url = { https://www.jmlr.org/papers/v27/23-0958.html }

}

Back to Papers

View Full Paper More from JMLR