Skip to content

Modernize networks, data norm, and DDIM

Just implement everything from this paper.

image

  • Replace resnet w ViT, lower LR on vision encoder
  • Replace U-Net w DiT
  • Adjusted DDIM denoising
  • Data normalization and clipping
Edited by Kevin Haninger