kbastani / graphify

Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.
http://graphify.github.io/graphify
Apache License 2.0
382 stars 89 forks source link

Classification time proportional to ? #20

Open letronje opened 9 years ago

letronje commented 9 years ago

I have observed that as we train graphify more and more, the size of the neo4j database on disk keeps increasing and beyond a point, each classification request takes more than a few minutes and makes it almost unusable.

Is there a way to train graphify for more accuracy but at the same time keep the classification time within usable limits ( like say 30 seconds or under a minute ? )

To understand the slowup, could you tell me which of the following parameters affect the classification time for a text given to it and how ?