Why do deep networks (vgg, resnet) not use the softmax (probability normalization) function in the end, but directly add the fc layer?

NoSuchKey

Guess you like

Origin blog.csdn.net/qq_43374694/article/details/132588508