Instead of building them out inside llm.c it might be faster to export the model weights into "common infra" and run evals with that. I don't have time to get around to it right away but made an Issue a few days ago for someone to potentially take a look.
Exporting Model Weights for Faster Evaluation Infrastructure
By
–
Leave a Reply