Pretty amazing that we can get such good results even without batchnorm — and not using a proper resnet init like Fixup either! Maybe I should try lsuv with batchnorm-free resnet sometime…
Training ResNets Without Batchnorm: Achieving Strong Results
By
–
Leave a Reply