if you find this area exciting, there is some fascinating concurrent work which examines the problem of inferring training *distributions* from LLMs: https://
x.com/iamgroot42/sta
tus/1934636691639472630
… (cc @chhaviyadav_ @kamalikac et al!)
Inferring Training Distributions from Large Language Models
By
–