JMLR
Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood
Authors
Jiangrong Ouyang
Mingming Gong
Howard Bondell
Research Topics
Bayesian Statistics
Paper Information
-
Journal:
Journal of Machine Learning Research -
Added to Tracker:
Mar 03, 2026
Abstract
Policy inference plays an essential role in the contextual bandit problem. In this paper, we use empirical likelihood to develop a Bayesian inference method for the joint analysis of multiple contextual bandit policies in finite sample regimes. The proposed inference method is robust to small sample sizes and is able to provide accurate uncertainty measurements for policy value evaluation. In addition, it allows for flexible inferences on policy comparison with full uncertainty quantification. We demonstrate the effectiveness of the proposed inference method using Monte Carlo simulations and its application to an adolescent body mass index data set.
Author Details
Jiangrong Ouyang
AuthorMingming Gong
AuthorHoward Bondell
AuthorResearch Topics & Keywords
Bayesian Statistics
Research AreaCitation Information
APA Format
Jiangrong Ouyang
,
Mingming Gong
&
Howard Bondell
.
Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood.
Journal of Machine Learning Research
.
BibTeX Format
@article{paper973,
title = { Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood },
author = {
Jiangrong Ouyang
and Mingming Gong
and Howard Bondell
},
journal = { Journal of Machine Learning Research },
url = { https://www.jmlr.org/papers/v27/23-0958.html }
}