[In a world… fully aligned…]
TEENAGER: “Prompt engineer?” What’s that?
OLD MAN: Just words, kid. Just words. […one final task… remains…]
SCIENTIST: The leaked model is pre-trained! There’s no RLHF! Not even instruct!
OLD MAN: *Dons e/acc shades* Let’s think step by step.
Old man and scientist discuss alignment of leaked pre-trained model
By
–