AI Dynamics

Global AI News Aggregator

AVFormer Achieves State-of-the-Art Audiovisual Speech Recognition

Presenting AVFormer, a simple method for injecting visual information into frozen speech models for zero-shot audiovisual (AV) automatic speech recognition (ASR). Read about how AVFormer achieves state-of-the-art AV-ASR performance and more → https://
goo.gle/3IU40P3

→ View original post on X — @googleai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *