We train latent diffusion models, replacing the commonly-used U-Net backbone with a transformer that operates on latent patches. We analyze the scalability of our Diffusion Transformers (DiTs) through ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results