JMLR

Assumption-lean and data-adaptive post-prediction inference

Authors
Jiacheng Miao Xinran Miao Yixuan Wu Jiwei Zhao Qiongshi Lu
Research Topics
Statistical Learning
Paper Information
  • Journal:
    Journal of Machine Learning Research
  • Added to Tracker:
    Sep 08, 2025
Abstract

A primary challenge facing modern scientific research is the limited availability of gold-standard data, which can be costly, labor-intensive, or invasive to obtain. With the rapid development of machine learning (ML), scientists can now employ ML algorithms to predict gold-standard outcomes using variables that are easier to obtain. However, these predicted outcomes are often used directly in subsequent statistical analyses, ignoring imprecision and heterogeneity introduced by the prediction procedure. This will likely result in false positive findings and invalid scientific conclusions. In this work, we introduce PoSt-Prediction Adaptive inference (PSPA) that allows valid and powerful inference based on ML-predicted data. Its “assumption-lean” property guarantees reliable statistical inference without assumptions on the ML prediction. Its “data-adaptive” feature guarantees an efficiency gain over existing methods, regardless of the accuracy of ML prediction. We demonstrate the statistical superiority and broad applicability of our method through simulations and real-data applications.

Author Details
Jiacheng Miao
Author
Xinran Miao
Author
Yixuan Wu
Author
Jiwei Zhao
Author
Qiongshi Lu
Author
Research Topics & Keywords
Statistical Learning
Research Area
Citation Information
APA Format
Jiacheng Miao , Xinran Miao , Yixuan Wu , Jiwei Zhao & Qiongshi Lu . Assumption-lean and data-adaptive post-prediction inference. Journal of Machine Learning Research .
BibTeX Format
@article{paper476,
  title = { Assumption-lean and data-adaptive post-prediction inference },
  author = { Jiacheng Miao and Xinran Miao and Yixuan Wu and Jiwei Zhao and Qiongshi Lu },
  journal = { Journal of Machine Learning Research },
  url = { https://www.jmlr.org/papers/v26/24-0056.html }
}