I do think there's (a) a Sovereignty-vs-corrigibility dichotomy here, and (b) a pointer at what makes corrigibility hard. It's possible we should have different words for Sovereign-alignment and corrigible-alignment.
By
–
I do think there's (a) a Sovereignty-vs-corrigibility dichotomy here, and (b) a pointer at what makes corrigibility hard. It's possible we should have different words for Sovereign-alignment and corrigible-alignment.