We proposed UPainting to unify simple and complex scene image generation. The model integrates cross-modal guidance into a text-conditional diffusion model. UPainting outperforms other models in caption similarity & image fidelity. More details: https://
upainting.github.io
UPainting Unifies Simple Complex Scene Image Generation
By
–
Leave a Reply