6/ MERT – an acoustic music understanding model with large-scale self-supervised training; it incorporates a superior combination of teacher models to outperform conventional speech and audio approaches.
MERT: Self-Supervised Acoustic Music Understanding Model
By
–
