Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
Paper
•
2510.18457
•
Published
•
3
Pretrained checkpoints, features, and samples for VFM-VAE,
introduced in the paper:
Tianci Bi et al., “Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models”, arXiv:2510.18457