JMLR

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection

Authors

Donglin Zeng Yufeng Liu Daiqi Gao

View Full Paper

Paper Information

Journal:
Journal of Machine Learning Research
Added to Tracker:
Sep 08, 2025

Abstract

Dynamic treatment regimes or policies are a sequence of decision functions over multiple stages that are tailored to individual features. One important class of treatment policies in practice, namely multi-stage stationary treatment policies, prescribes treatment assignment probabilities using the same decision function across stages, where the decision is based on the same set of features consisting of time-evolving variables (e.g., routinely collected disease biomarkers). Although there has been extensive literature on constructing valid inference for the value function associated with dynamic treatment policies, little work has focused on the policies themselves, especially in the presence of high-dimensional features. We aim to fill the gap in this work. Specifically, we first obtain the multi-stage stationary treatment policy by minimizing the negative augmented inverse probability weighted estimator of the value function to increase asymptotic efficiency. An $L_1$ penalty is applied on the policy parameters to select important features. We then construct one-step improvements of the policy parameter estimators for valid inference. Theoretically, we show that the improved estimators are asymptotically normal, even if nuisance parameters are estimated at a slow convergence rate and the dimension of the features increases with the sample size. Our numerical studies demonstrate that the proposed method estimates a sparse policy with a near-optimal value function and conducts valid inference for the policy parameters.

Author Details

Donglin Zeng

Author

Yufeng Liu

Author

Daiqi Gao

Author

Citation Information

APA Format


                                
                                    
                                    Donglin Zeng
                                
                                    
                                        , 
                                    
                                    Yufeng Liu
                                
                                    
                                         & 
                                    
                                    Daiqi Gao
                                
                                . 
                                Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection. 
                                Journal of Machine Learning Research
                                .

BibTeX Format


@article{paper488,

  title = { Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection },

  author = { 
                                
                                    Donglin Zeng
                                
                                     and Yufeng Liu
                                
                                     and Daiqi Gao
                                
                                },

  journal = { Journal of Machine Learning Research },



  url = { https://www.jmlr.org/papers/v26/23-0660.html }

}

Back to Papers

View Full Paper More from JMLR