JMLR

Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood

Authors
Jiangrong Ouyang Mingming Gong Howard Bondell
Research Topics
Bayesian Statistics
Paper Information
  • Journal:
    Journal of Machine Learning Research
  • Added to Tracker:
    Mar 03, 2026
Abstract

Policy inference plays an essential role in the contextual bandit problem. In this paper, we use empirical likelihood to develop a Bayesian inference method for the joint analysis of multiple contextual bandit policies in finite sample regimes. The proposed inference method is robust to small sample sizes and is able to provide accurate uncertainty measurements for policy value evaluation. In addition, it allows for flexible inferences on policy comparison with full uncertainty quantification. We demonstrate the effectiveness of the proposed inference method using Monte Carlo simulations and its application to an adolescent body mass index data set.

Author Details
Jiangrong Ouyang
Author
Mingming Gong
Author
Howard Bondell
Author
Research Topics & Keywords
Bayesian Statistics
Research Area
Citation Information
APA Format
Jiangrong Ouyang , Mingming Gong & Howard Bondell . Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood. Journal of Machine Learning Research .
BibTeX Format
@article{paper973,
  title = { Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood },
  author = { Jiangrong Ouyang and Mingming Gong and Howard Bondell },
  journal = { Journal of Machine Learning Research },
  url = { https://www.jmlr.org/papers/v27/23-0958.html }
}