Yep! It’s SoTA/ near-SoTA atm! This serves as the key piece in a TTS/ TTA pipeline. As it allows us to encode an audio into discrete representations and then perform language modelling on it!
SoTA Audio Encoding for TTS/TTA Language Modeling Pipeline
By
–
Leave a Reply