vectorizer.fit_transform 错误

问题定位：在 TFIDF_space.py 文件的 stpwrdlst = readfile(stopword_path).splitlines()一行；原因：读入停用词文件，使用“rb”模式读字节了，改为“r”模式就好了运行环境：win10, pycharm中解决方案：

在 Tools.py 中新建函数：

# 读取文件
def readfile_str(path):  
with open(path, "r",encoding="utf8") as fp:  
    content = fp.read()  
return content

在 TFIDF_space.py 中导入函数 from Tools import readfile_str,
TFIDF_space.py 文件中 stpwrdlst = readfile(stopword_path).splitlines()替换为 stpwrdlst = readfile_str(stopword_path).splitlines()

sheldonresearch / chinese_text_classification