AI Dynamics

Global AI News Aggregator

About

Instruction tuning merges human will into AI controlled by awkward kid

Instruction tuning / RLHF is technically a Human Instrumentality Project, merging the preferences of countless humans to form an oversized, living amalgam of our will. We then hand control of it to a random, socially awkward kid and hope for the best.

→ View original post on X — @goodside