Statistical Guarantees of Deep Generative Models Involving Diverse Spaces: Generation Consistency and Robustness

dc.contributor.authorChakrabarty, Anish
dc.date.accessioned2026-02-06T10:04:14Z
dc.date.available2026-02-06T10:04:14Z
dc.date.issued2026-02-04
dc.descriptionThis thesis is under the supervision of Prof. Swagatam Das and Prof. Probal Chaudhurien_US
dc.description.abstractGenerative modeling focuses on the task of producing new data samples that closely resemble those drawn from an original, unknown distribution. Despite being well-known in statistical estimation theory, the approach has gained substantial traction in recent years, driven by groundbreaking results in areas such as image synthesis, natural language generation, and network modeling. The complexity of modern-era data domains and the ensuing adaptations that suitable models must undergo have presented new challenges. These advances raise several fundamental questions, the first of which is: When do generative models accurately approximate the true data distribution? One may also ask: How well do these models perform under contaminated data? This work explores these questions through the lens of generative modeling frameworks that, by design, involve distinct data spaces. We focus on two major classes of such models that blend optimal transport and representation learning in their objectives: Wasserstein autoencoders (WAE) and Cycle-consistent cross-domain translators. WAE, on its way to regeneration, learns a latent code, which in turn aids the simulation of newer pseudo-random replicates. By providing statistical characterizations of the latent distribution and the transforms inducing a dimensionality reduction in the process, we present a detailed error analysis underlying WAEs. From a non-parametric density estimation perspective, we establish deterministic bounds on the latent and reconstruction errors that adapt to the intrinsic dimensions of input data. We also study the extent of distortion that WAE-generated samples suffer when learned using contaminated data. Key takeaways for practitioners from our analysis include specific architectural suggestions that foster near-perfect sampling. The framework developed thus far fittingly extends to unpaired cycle-consistent cross-domain models. We show that the sufficient conditions for successful data translation under Sobolev and H¨older-smooth distributions resemble those in the case of WAEs. Our analysis also suggests error upper bounds due to ill-posed transformations and validates the choice of divergences used in objectives. Finally, in search of a consolidated solution to the robustification problem, we present parallel formulations based on the Gromov-Wasserstein (GW) distance. Due to the equivalence of Gromov-Monge samplers (GW), following GW, and cross-domain translation models, including WAE and GWAE, this answers the second question. We study the robust recovery guarantees, concentration, and tractable computational properties of the newly introduced distance measures under diverse contamination scenarios. We substantiate all our findings based on real-world data in varying generative tasks.en_US
dc.identifier.citation182p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7646
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesISI PhD Thesis;TH675
dc.subjectDeep Generative Models, Robustness, Optimal Transporten_US
dc.titleStatistical Guarantees of Deep Generative Models Involving Diverse Spaces: Generation Consistency and Robustnessen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Form 17-Anish Chakrabarty.pdf
Size:
391.95 KB
Format:
Adobe Portable Document Format
Description:
Form 17
No Thumbnail Available
Name:
Thesis-Anish Chakrabarty.pdf
Size:
23.46 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections