AI Dynamics

Global AI News Aggregator

A2D-VL 7B: Diffusion-Based Parallel Vision-Language Model

We trained a state-of-the-art diffusion VLM, A2D-VL 7B for parallel generation by finetuning an existing autoregressive VLM on the diffusion language modeling task, using the masked diffusion framework which "noises" tokens by masking them and "de-noises" tokens by predicting the

→ View original post on X — @runwayml,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *