@jxmnop - AI Dynamics - Page 67 of 78

machine learning research question:
what’s an idea that you think would catch on, if only someone spent the money to test it at scale? i’ll go first: tokenization-free transformers

→ View original post on X — @jxmnop

14 December 2023

Fixed Embedding Spaces and Conditional Generation in AI Models

By

@jxmnop

–

13 December 2023 20h39

thankful twitter hasn't implemented a peer review process yet unlike latent diffusion, in this case the embedding space is fixed (it's openAI ada 2 in my notebook) I think this is kind of like conditional generation

→ View original post on X — @jxmnop

13 December 2023

Probabilistic Models and Text Embeddings: Learning p(e)

By

@jxmnop

–

13 December 2023 17h34

for sure, the probabilistic breakdown is the same: given text sequence x and its embedding e, p(x) = p(x | e) p(e). this looks a lot like a latent variable model with latent embedding e in my case p(x | e) is done by vec2text with openAI embeddings; we just need to learn p(e)

→ View original post on X — @jxmnop

13 December 2023

Language Model Fine-Tuning in Embedding Space

By

@jxmnop

–

13 December 2023 16h42

fun idea I tested out this morning: Language model fine-tuning in embedding space here's the idea: learn a model of *embeddings* of a certain text distribution; then, to generate text, sample embedding and map back to text with vec2text this lets us generate language without

→ View original post on X — @jxmnop

13 December 2023

BPE Tokenization Dies: 2024 Marks Major Transformer Evolution

By

@jxmnop

–

11 December 2023 17h07

i can see it now: 2024 will be remembered as the year BPE died tokenization is by far the clunkiest part of a transformer; one last remaining bit of inelegance in an otherwise hyperoptimized model architecture time for it to go

→ View original post on X — @jxmnop

11 December 2023