AI Dynamics

Global AI News Aggregator

About

AVFormer Achieves State-of-the-Art Audiovisual Speech Recognition

Presenting AVFormer, a simple method for injecting visual information into frozen speech models for zero-shot audiovisual (AV) automatic speech recognition (ASR). Read about how AVFormer achieves state-of-the-art AV-ASR performance and more → https://
goo.gle/3IU40P3

→ View original post on X — @googleai