One possible exercise: You have a one-hidden-layer MLP, which is so big that the weights can only fit across 4 machines. Describe how you’d do the forward-backward passes. What would you do if each machine had a small probability of breaking down?
Distributed MLP Training Across Multiple Machines: Architecture
By
–
Leave a Reply