for sure, the probabilistic breakdown is the same: given text sequence x and its embedding e, p(x) = p(x | e) p(e). this looks a lot like a latent variable model with latent embedding e in my case p(x | e) is done by vec2text with openAI embeddings; we just need to learn p(e)
Probabilistic Models and Text Embeddings: Learning p(e)
By
–