Additional response 1: Many emergent abilities also cannot be explained by these arguments. Consider the below plots which show a U-shaped phenomena: performance actually decreases for several model scales, until it suddenly spikes up again.
U-shaped Performance: Emergent Abilities in Model Scaling
By
–
Leave a Reply