Feature Learning in Finite-Width Bayesian Deep Linear Networks with Multiple Outputs and Convolutional Layers
Authors
Research Topics
Paper Information
-
Journal:
Journal of Machine Learning Research -
Added to Tracker:
Jul 15, 2025
Abstract
Deep linear networks have been extensively studied, as they provide simplified models of deep learning. However, little is known in the case of finite-width architectures with multiple outputs and convolutional layers. In this manuscript, we provide rigorous results for the statistics of functions implemented by the aforementioned class of networks, thus moving closer to a complete characterization of feature learning in the Bayesian setting. Our results include: (i) an exact and elementary non-asymptotic integral representation for the joint prior distribution over the outputs, given in terms of a mixture of Gaussians; (ii) an analytical formula for the posterior distribution in the case of squared error loss function (Gaussian likelihood); (iii) a quantitative description of the feature learning infinite-width regime, using large deviation theory. From a physical perspective, deep architectures with multiple outputs or convolutional layers represent different manifestations of kernel shape renormalization, and our work provides a dictionary that translates this physics intuition and terminology into rigorous Bayesian statistics.
Author Details
Federico Bassetti
AuthorMarco Gherardi
AuthorAlessandro Ingrosso
AuthorMauro Pastore
AuthorPietro Rotondo
AuthorResearch Topics & Keywords
Bayesian Statistics
Research AreaCitation Information
APA Format
Federico Bassetti
,
Marco Gherardi
,
Alessandro Ingrosso
,
Mauro Pastore
&
Pietro Rotondo
.
Feature Learning in Finite-Width Bayesian Deep Linear Networks with Multiple Outputs and Convolutional Layers.
Journal of Machine Learning Research
.
BibTeX Format
@article{JMLR:v26:24-1158,
author = {Federico Bassetti and Marco Gherardi and Alessandro Ingrosso and Mauro Pastore and Pietro Rotondo},
title = {Feature Learning in Finite-Width Bayesian Deep Linear Networks with Multiple Outputs and Convolutional Layers},
journal = {Journal of Machine Learning Research},
year = {2025},
volume = {26},
number = {88},
pages = {1--35},
url = {http://jmlr.org/papers/v26/24-1158.html}
}