🗓 Session 1C — 🕦 11:30 AM SGT.
Title: Learning to Discretize Denoising Diffusion ODEs.
Come by if you're into #GenerativeAI / #DiffusionModels
🗓 Session 1C — 🕦 11:30 AM SGT.
Title: Learning to Discretize Denoising Diffusion ODEs.
Come by if you're into #GenerativeAI / #DiffusionModels
LD3 can be applied to diffusion models in other domains, such as molecular docking.
LD3 can be applied to diffusion models in other domains, such as molecular docking.
LD3 can be trained on a single GPU in under one hour. For smaller datasets like CIFAR-10, training can be completed in less than 6 minutes.
LD3 can be trained on a single GPU in under one hour. For smaller datasets like CIFAR-10, training can be completed in less than 6 minutes.
LD3 significantly improves sample quality.
LD3 significantly improves sample quality.
This surrogate loss is theoretically close to the original distillation objective, leading to better convergence and avoiding underfitting.
This surrogate loss is theoretically close to the original distillation objective, leading to better convergence and avoiding underfitting.
A potential problem with the student model is its limited capacity. To address this, we propose a soft surrogate loss, simplifying the student's optimization task.
A potential problem with the student model is its limited capacity. To address this, we propose a soft surrogate loss, simplifying the student's optimization task.
[1/n]
We propose LD3, a lightweight framework that learns the optimal time discretization for sampling from pre-trained Diffusion Probabilistic Models (DPMs).
[1/n]
We propose LD3, a lightweight framework that learns the optimal time discretization for sampling from pre-trained Diffusion Probabilistic Models (DPMs).