Open mike820324 opened 9 years ago
First thought is,
key: $uuid value: { site: $url name: comicName description: comicDescription link: comicLInk }
and according to name and description, build inverted index. In order to build a inverted index, I need, chinese segmentation. And a leveldb update method.
Index-search module is a pretty well tested library. But the problem is that this module is using natural to generate the keywords and some other classifier algorithms, unfortuneately natural currently doesn't support Chinese. ~~
currently the indexer design is not that good, and require many copy and paste when adding new comic website. I really should redesign the indexer.