those too, but i think more importantly i'm asking about the relationship between model parameters, training steps (whatever that means in the embeddings case), and downstream performance
By
–
those too, but i think more importantly i'm asking about the relationship between model parameters, training steps (whatever that means in the embeddings case), and downstream performance