Posts by Category

Deep Learning

BatchNorm vs LayerNorm: Theory, Assumptions, and Dynamics

3 minute read

Published: