AI Dynamics

Global AI News Aggregator

Tencent SRPO: Aligning Diffusion Models with Human Preferences

Tencent released SRPO on Hugging Face Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference By fine-tuning the FLUX1dev model with optimized denoising and online reward adjustment, improve its human-evaluated realism and aesthetic quality by over 3x

→ View original post on X — @_akhaliq,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *