Application of Data Augmentation to Text Classification Tasks

Data augmentation is an important technique in natural language processing (NLP), which is used to enhance the diversity and quantity of data sets to improve the generalization performance and robustness of the model.

I mainly use two comparison methods, one is Roberta + data enhancement (random replacement, deletion, insertion, exchange); the other is Roberta + data enhancement (random replacement, deletion, insertion), and then use it after comparison to see the specific effect.

Table of contents

1. Roberta+ data enhancement (random replacement, deletion, insertion, exchange)

Guess you like

Origin blog.csdn.net/weixin_43734080/article/details/131217152