Research we co-authored on subliminal learning—how LLMs can pass on traits like preferences or misalignment through hidden signals in data—was published today in @Nature
. Read the paper: https://
nature.com/articles/s4158
6-026-10319-8
…
LLMs Pass Hidden Traits Through Subliminal Learning Signals
By
–
Leave a Reply