softmax / cross-entropy gradient (analytic formula)

Software / App

Analytically derived gradient of cross-entropy w.r.t. logits: p - 1_{y} (scaled by 1/N for batch).

Mentioned in 1 video