There’s conditioning from the dialog syntax that it’s being naively given in the same format that the instruct-tuned version receives. It’s seen these in pre-training, but the association isn’t strong enough apparently to make it act like a chatbot even most of the time.
Pretraining and conditioning issues in instruct-tuned dialogue models
By
–