4/ Spectron – a spoken language model trained end-to-end to directly process spectrograms; it can be fine-tuned to generate high-quality accurate spoken language; surpasses existing spoken language models in speaker preservation and semantic coherence.
Spectron: End-to-End Spoken Language Model Surpasses Existing Systems
By
–
