Skip to content
Better HN
Show HN: Why Neural Networks Need He Init, Clipping, and Momentum | Better HN