This explanation was kind of polarising — some people found it pointlessly abstruse, others found it worked for them when others didn't: https://
thinc.ai/docs/backprop1
01
… If you get down to the bit with the two tables, it explains how a max function can implement nand, while a linear
Explanation of backpropagation: max function implements NAND gates
By
–
Leave a Reply