Unified Discrete Diffusion for Categorical Data
Authors
Paper Information
-
Journal:
Journal of Machine Learning Research -
Added to Tracker:
Dec 30, 2025
Abstract
Discrete diffusion models have attracted significant attention for their application to naturally discrete data, such as language and graphs. While discrete-time discrete diffusion has been established for some time, it was only recently that Campbell et al. (2022) introduced the first framework for continuous-time discrete diffusion. However, their training and backward sampling processes significantly differ from those of the discrete-time version, requiring nontrivial approximations for tractability. In this paper, we first introduce a series of generalizations and simplifications of the evidence lower bound (ELBO) that facilitate more accurate and easier optimization both discrete- and continuous-time discrete diffusion. We further establish a unification of discrete- and continuous-time discrete diffusion through shared forward process and backward parameterization. Thanks to this unification, the continuous-time diffusion can now utilize the exact and efficient backward process developed for the discrete-time case, avoiding the need for costly and inexact approximations. Similarly, the discrete-time diffusion now also employ the MCMC corrector, which was previously exclusive to the continuous-time case. Extensive experiments and ablations demonstrate the significant improvement, and we open-source our code at: https://github.com/LingxiaoShawn/USD3.
Author Details
Lingxiao Zhao
AuthorXueying Ding
AuthorLijun Yu
AuthorLeman Akoglu
AuthorCitation Information
APA Format
Lingxiao Zhao
,
Xueying Ding
,
Lijun Yu
&
Leman Akoglu
.
Unified Discrete Diffusion for Categorical Data.
Journal of Machine Learning Research
.
BibTeX Format
@article{paper715,
title = { Unified Discrete Diffusion for Categorical Data },
author = {
Lingxiao Zhao
and Xueying Ding
and Lijun Yu
and Leman Akoglu
},
journal = { Journal of Machine Learning Research },
url = { https://www.jmlr.org/papers/v26/25-0171.html }
}