The first comment (kudos for open review) links to a post that says some of this at greater length, but to repeat my own reaction: "There's nothing in there about alignment. The proposed motivational system is internal-system-health reward with nothing about caring for humans."
Alignment Concerns with Internal Health Reward System Design
By
–
Leave a Reply