drogozhang / Criminal-Intelligence-QA-System

Demo for advanced Java final project in 18-19 1 of Canghong Jin
24 stars 8 forks source link

Some questions about ur word2vec detecting #1

Closed zhoustan closed 3 years ago

zhoustan commented 5 years ago

It's a good project that I interested in.

But could u explain more details about how ur team use the model to do the text extraction, I mean the files in neo4jcsv?

Waiting ur answer! Thank u very much!

drogozhang commented 5 years ago

The files in neo4jcsv is the import files to storage in neo4j database. Use the cypher which is the neo4j query. And there is no text detection problem... only text classification... For natural language query like "2017年龙湾区的冰毒数量”, After JIEBA word segmentation. I use pre-trained word vector(also give links in README.md) model to transform the word like "年" to vector which dimension is 300. Vertically stacked vectors combine one matrix which can used in CNN text classifier or BiLSTM, LSTM can input the vector as x through time.