Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers paper page: https://
huggingface.co/papers/2307.03
183
… In this paper, we focus on Whisper, a recent automatic speech recognition model trained with a massive 680k hour labeled speech corpus recorded
Whisper-AT: Noise-Robust Speech Recognition and Audio Event Tagging
By
–
