New piece on emergence in language models by @JacobSteinhardt: https://bounded-regret.ghost.io/emergent-deception-optimization/#fnref7 I found the takeaways quite lucid:
– Capabilities that would lower training loss will emerge in the future
– As models scale up, simple heuristics tend to get replaced by complex ones
Emergence in Language Models: Capabilities and Heuristics
By
–
Leave a Reply