Could evolution replace gradients for LLM training? Introducing EGGROLL, a new paper that makes gradient-free training actually practical 100x training throughput vs vanilla ES 100x memory reduction 91% inference speed
Pure integer training Trending #1 on alphaXiv
EGGROLL: Gradient-Free LLM Training Achieves 100x Throughput
By
–
