AI Dynamics

Global AI News Aggregator

About

InstructGPT/RLHF tuning makes model assume all questions answerable

My guess is this is InstructGPT/RLHF rather than anything in the pre-training corpus. Tuning implicitly makes it assume all questions are answerable — it sees all text as “ ” and Q/A is a subset of that.

→ View original post on X — @goodside