The influence distributions are heavy-tailed, with the tail approximately following a power law. Most influence is concentrated in a small fraction of training sequences. Still, the influences are diffuse, with any particular sequence only slightly influencing the final outputs.
Training Data Influence Distributions Follow Heavy-Tailed Power Laws
By
–
Leave a Reply