Math 17 JAN, 2020 Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks By Wei Hu