My guess is this is InstructGPT/RLHF rather than anything in the pre-training corpus. Tuning implicitly makes it assume all questions are answerable — it sees all text as “ ” and Q/A is a subset of that.
InstructGPT/RLHF tuning makes model assume all questions answerable
By
–