JMLR

On Model Identification and Out-of-Sample Prediction of PCR with Applications to Synthetic Controls

Authors

Devavrat Shah Anish Agarwal Dennis Shen

Research Topics

Statistical Learning

View Full Paper

Paper Information

Journal:
Journal of Machine Learning Research
Added to Tracker:
Sep 08, 2025

Abstract

We analyze principal component regression (PCR) in a high-dimensional error-in-variables setting with fixed design. Under suitable conditions, we show that PCR consistently identifies the unique model with minimum $\ell_2$-norm. These results enable us to establish non-asymptotic out-of-sample prediction guarantees that improve upon the best known rates. In the course of our analysis, we introduce a natural linear algebraic condition between the in- and out-of-sample covariates, which allows us to avoid distributional assumptions for out-of-sample predictions. Our simulations illustrate the importance of this condition for generalization, even under covariate shifts. Accordingly, we construct a hypothesis test to check when this condition holds in practice. As a byproduct, our results also lead to novel results for the synthetic controls literature, a leading approach for policy evaluation. To the best of our knowledge, our prediction guarantees for the fixed design setting have been elusive in both the high-dimensional error-in-variables and synthetic controls literatures.

Author Details

Devavrat Shah

Author

Anish Agarwal

Author

Dennis Shen

Author

Research Topics & Keywords

Statistical Learning

Research Area

Citation Information

APA Format


                                
                                    
                                    Devavrat Shah
                                
                                    
                                        , 
                                    
                                    Anish Agarwal
                                
                                    
                                         & 
                                    
                                    Dennis Shen
                                
                                . 
                                On Model Identification and Out-of-Sample Prediction of PCR with Applications to Synthetic Controls. 
                                Journal of Machine Learning Research
                                .

BibTeX Format


@article{paper538,

  title = { On Model Identification and Out-of-Sample Prediction of PCR with Applications to Synthetic Controls },

  author = { 
                                
                                    Devavrat Shah
                                
                                     and Anish Agarwal
                                
                                     and Dennis Shen
                                
                                },

  journal = { Journal of Machine Learning Research },



  url = { https://www.jmlr.org/papers/v26/23-0102.html }

}

Back to Papers

View Full Paper More from JMLR