houbb / sensitive-word

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)
https://houbb.github.io/opensource/sensitive-word
Apache License 2.0
4.1k stars 545 forks source link

为什么默认初始化一个SensitiveWordBs对象,要占用这么高内存 #62

Closed lxw6933 closed 3 months ago

lxw6933 commented 3 months ago

初始化空对象 打印对象大小: long objectSize = ObjectSizeCalculator.getObjectSize(sensitiveWordBs); log.info("sensitiveWordBs size:【{}】byte", objectSize); 输出: sensitiveWordBs size:【35043720】byte

houbb commented 3 months ago

默认加载了内置字典,可以配置去掉。