qinwf / jiebaR

Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
Other
342 stars 110 forks source link

请教一下分词后报错是什么原因 #68

Open DDDchiyukawaii opened 4 years ago

DDDchiyukawaii commented 4 years ago

第一,环境信息 R version 3.6.1 (2019-07-05) Platform: x86_64-w64-mingw32/x64 (64-bit) Running under: Windows 10 x64 (build 15063)

Matrix products: default

locale: [1] LC_COLLATE=Chinese (Simplified)_China.936 [2] LC_CTYPE=Chinese (Simplified)_China.936
[3] LC_MONETARY=Chinese (Simplified)_China.936 [4] LC_NUMERIC=C
[5] LC_TIME=Chinese (Simplified)_China.936

attached base packages: [1] stats graphics grDevices utils datasets methods base

other attached packages: [1] jiebaR_0.10.99 jiebaRD_0.1

loaded via a namespace (and not attached): [1] compiler_3.6.1 Rcpp_1.0.2

第二,全部错误信息 word=scan('1.txt',sep="n",what="",encoding="UTF-8") engine=worker() word_seg=segment(word,engine) 2019-11-07 10:47:50 ../inst/include/lib/PreFilter.hpp:21 ERROR decode failed.

处理一批类似的文档,结果有的分词成功,有的会这样报错 新手第一次接触编程,请问是什么原因?

LUOQINYUE commented 4 years ago

我也有这个报错,想知道是什么原因