Fuck yeah! MaskGCT – New open SoTA Text to Speech model! 🔥
— Vaibhav (VB) Srivastav (@reach_vb) 30 octobre 2024
> Zero-shot voice cloning
> Emotional TTS
> Trained on 100K hours of data
> Long form synthesis
> Variable speed synthesis
> Bilingual – Chinese & English
> Available on Hugging Face
Fully non-autoregressive… pic.twitter.com/CAUX6cTiAG
Fuck yeah! MaskGCT – New open SoTA Text to Speech model! > Zero-shot voice cloning
> Emotional TTS
> Trained on 100K hours of data
> Long form synthesis
> Variable speed synthesis
> Bilingual – Chinese & English
> Available on Hugging Face Fully non-autoregressive
Leave a Reply