Am old enough to remember when @GeoffreyHinton told me I was stupid for saying that LLMs regurgitate training data. He was wrong. LLM regurgitation is now one of the best-established findings in the field. Excerpt below from a new DeepMind paper; every single one of the
DeepMind paper provides evidence for LLM training data regurgitation
By
–
