In stochastic gradient descent (SGD), why is the negative direction of the gradient the fastest direction of the function?

NoSuchKey

Guess you like

Origin blog.csdn.net/qq_28057379/article/details/105178156