425776024 / nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
Apache License 2.0
1.78k stars 169 forks source link

如果能支持定义stopwords 就更好了 #32

Open XiaoqingNLP opened 1 year ago

XiaoqingNLP commented 1 year ago

作者开发的中文版数据增强功能很好用,看了一下类实现,如果能支持stopwords 会更好,即部分词不进行增强,例如库nlpaug

import nlpaug.augmenter.word as naw
aug_swap = naw.RandomWordAug(action="swap", stopwords=stopwords, aug_p=0.1)
425776024 commented 1 year ago

作者开发的中文版数据增强功能很好用,看了一下类实现,如果能支持stopwords 会更好,即部分词不进行增强,例如库nlpaug

import nlpaug.augmenter.word as naw
aug_swap = naw.RandomWordAug(action="swap", stopwords=stopwords, aug_p=0.1)

感谢提意见,最近1年工作、精力,没有花时间在github上