Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations
Authors
Paper Information
-
Journal:
Biometrika -
DOI:
10.1093/biomet/asag009 -
Published:
February 13, 2026 -
Added to Tracker:
Feb 14, 2026
Abstract
Summay We consider the problem of estimating ratios of means of a multivariate outcome across covariates when the data are observed with unknown sample-specific and category-specific perturbations. Our model admits a partially identifiable estimand, and we establish full identifiability by imposing interpretable parameter constraints. To reduce bias and guarantee the existence of estimators in the presence of sparse observations, we apply an asymptotically negligible and constraint-invariant penalty to the loss function. We develop a fast coordinate-descent algorithm for estimation, and an augmented Lagrangian algorithm for estimation under null hypotheses. We construct a model-robust score test and demonstrate valid inference even for small sample sizes and under violated distributional assumptions. The flexibility of the approach, and comparisons with related methods, are illustrated through a simulation study and a meta-analysis of microbial associations with colorectal cancer.
Author Details
D S Clausen
AuthorS V Teichman
AuthorA D Willis
AuthorCitation Information
APA Format
D S Clausen
,
S V Teichman
&
A D Willis
(2026)
.
Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations.
Biometrika
, 10.1093/biomet/asag009.
BibTeX Format
@article{paper902,
title = { Estimating Ratios of Means of Multicategory Data Observed with Sample and Category Perturbations },
author = {
D S Clausen
and S V Teichman
and A D Willis
},
journal = { Biometrika },
year = { 2026 },
doi = { 10.1093/biomet/asag009 },
url = { https://doi.org/10.1093/biomet/asag009 }
}