this was a neat little read. the idea is to do data augmentation with embeddings. they randomly sample around an embedding and then decode with vec2text. there is a trick to randomly sampling while not leaving the embedding manifold; they try to sample within an embedding "cone"
Data Augmentation via Embedding Cone Sampling with Vec2Text
By
–