Identity mappings in deep residual networks

Identity mappings in deep residual networks


Identity mappings in deep residual networks

pre-activation ResNet

  1. the optimization is further eased because f is an identity mapping
  2. using BN as pre-activation improves regularization of the models

相关文章: