Why do deep networks (vgg, resnet) not use the softmax (probability normalization) function in the end, but directly add the fc layer?

NoSuchKey

おすすめ

転載: blog.csdn.net/qq_43374694/article/details/132588508