In the text summarization project, the loss is masked. If the loss is 0, it means that no gradient update is performed. How to deal with <PAD> after padding?

NoSuchKey

Guess you like

Origin blog.csdn.net/wtl1992/article/details/131607789