Xavier Glorot and Yoshua Bengio paper

Book

A paper referenced for its insights into weight initialization techniques, particularly for tanh activations, to improve gradient propagation.

Mentioned in 1 video