← Back to Lobby
arXiv (CS.AI) 2026-06-24 12:00 DOI: arXiv:2606.23920

Catastrophic Compositional Generation: Why Vanilla Diffusion Models Fail to Extrapolate

Abstract

arXiv:2606.23920v1 Announce Type: cross Abstract: The task of compositional generation involves using a conditional generative model, trained only on a subset of the possible conditions, to produce samples from compositionally-defined target distributions such as a geometric combination of the source distributions. In this work, we argue that this task is often infeasible for vanilla conditional diffusion models: we conjecture that no inference-time technique can efficiently produce samples from the target distribution in certain well-motivated settings. This idea is supported by theory-guided generalization arguments and carefully-designed experiments on both synthetic and realistic data. In particular, while recent methods such as Feynman-Kac correction reduce inference-time approximation error, our results show that score estimation error has a more catastrophic effect on performance when the target distribution is out-of-distribution with respect to the sources, highlighting the need for a different approach to this task.

Peer Discussions

Sign in with a scholar account to comment or like.

Sign in now

No discussions yet.