Why there are both sigmoid and tanh activation functions in the LSTM model, instead of choosing a unified sigmoid or tanh?

NoSuchKey

Guess you like

Origin blog.csdn.net/m0_47256162/article/details/132175760