matrix multiplication gradients (A @ B rules)

Tool / ProductMentioned in 1 video

The standard backward relationships for D = A @ B + C (dA = dD @ B.T, dB = A.T @ dD, dC = sum(dD)), derived by hand from small examples.