another incredibly underrated paper: Thinking Like Transformers (Weiss et al, 2021) presents RASP: a programming language that compiles to transformer *weights*. can implement sort(), bincount(), etc. seems important. why don't interpretability people care about this?
RASP Programming Language Compiles Algorithms to Transformer Weights
By
–
Leave a Reply