Lotus Diffusion-based Visual Foundation Model for High-quality Dense Prediction Leveraging the visual priors of pre-trained text-to-image diffusion models offers a promising solution to enhance zero-shot generalization in dense prediction tasks. However, existing methods often
Lotus: Diffusion-Based Visual Foundation Model for Dense Prediction
By
–
