Biometrika Nov 19, 2025

Sparse higher order partial least squares for simultaneous variable selection, dimension reduction and tensor denoising

Authors
Kwangmoon Park Sündüz Keleş
Research Topics
High-Dimensional Statistics
Paper Information
  • Journal:
    Biometrika
  • DOI:
    10.1093/biomet/asaf082
  • Published:
    November 19, 2025
  • Added to Tracker:
    Feb 10, 2026
Abstract

Abstract Motivated by the challenge of estimating effects of DNA methylation on 3D genomic contacts captured by multi-modal single cell Hi-C data, we consider the tensor response partial least squares (PLS) model with Y=B×1X+F, where the correlated high-dimensional predictors X∈Rn×d1 and sparse and noisy high-dimensional responses Y∈Rn×∏mdm are observed, the low rank and sparse partial least squares coefficient tensor B∈R∏mdm is unknown, and F∈Rn×∏mdm is the noise tensor. In this work, we study the problem of estimating the partial least squares coefficient B and identifying its active entries in the tensor partial least squares framework. We show that the consistency of the existing tensor partial least squares estimator (Zhao et al., 2012) cannot be guaranteed under a high-dimensional regime in which both the number of predictors and the response tensor dimensions grow faster than the sample size. To address this high-dimensionality, we propose the sparse higher order partial least squares estimator and an accompanying algorithm that simultaneously perform variable selection, dimension reduction and tensor response denoising. We establish asymptotic guarantees for SHOPS in the high-dimensional regime and validate these results through comprehensive simulation studies, demonstrating SHOPS‘s advantages over baseline approaches. Finally, application of SHOPS to the motivating multi-modal single-cell Hi-C data reveals novel biological insights into gene regulation by multiple regulatory elements.

Author Details
Kwangmoon Park
Author
Sündüz Keleş
Author
Research Topics & Keywords
High-Dimensional Statistics
Research Area
Citation Information
APA Format
Kwangmoon Park & Sündüz Keleş (2025) . Sparse higher order partial least squares for simultaneous variable selection, dimension reduction and tensor denoising. Biometrika , 10.1093/biomet/asaf082.
BibTeX Format
@article{paper878,
  title = { Sparse higher order partial least squares for simultaneous variable selection, dimension reduction and tensor denoising },
  author = { Kwangmoon Park and Sündüz Keleş },
  journal = { Biometrika },
  year = { 2025 },
  doi = { 10.1093/biomet/asaf082 },
  url = { https://doi.org/10.1093/biomet/asaf082 }
}