dileepajayakody / semanticvectors

Automatically exported from code.google.com/p/semanticvectors
Other
1 stars 0 forks source link

want to see document vector : tf-idf #76

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. How to get vectors corresponding to a document in text file after indexing 
and applying tf-idf and LSA on it?
2. Basically, I also want to get the size of dimensions of a document vector.
3. Is the number of dimensions of both document and term vector same?

What is the expected output? What do you see instead?
Want to see document vector.

What version of the product are you using? On what operating system?
I am using "semanticvectors-4.0 "  on ubuntu 12.04

Please provide any additional information below.

Original issue reported on code.google.com by rohitdee...@gmail.com on 10 Nov 2013 at 1:03

GoogleCodeExporter commented 9 years ago
For this you'll want to set the -searchtype flag to printquery. And you may 
need to use the -matchcase option if your pathnames have any uppercase 
characters. So it's something like:

{{{~/Data/Bible: java pitt.search.semanticvectors.Search -matchcase  
-queryvectorfile docvectors.bin -searchtype printquery 
bible_chapters/Genesis/Chapter_1
Opening query vector store from file: docvectors.bin
Searching term vectors, searchtype PRINTQUERY
Found vector for 'bible_chapters/Genesis/Chapter_1'
RealVector.  Dense.  Coordinates are:
-0.08940928 0.048319492 0.0840949 -0.1529306 0.0111201005 0.0010032 -0.06975765 
... }}}

Original comment by dwidd...@gmail.com on 10 Nov 2013 at 4:42

GoogleCodeExporter commented 9 years ago
(Hope that makes sense - the {{{   }}} notation was meant to make the part in 
between look like code, but apparently it doesn't! Please add further comments 
if you have any more trouble with this.)

Original comment by dwidd...@gmail.com on 10 Nov 2013 at 4:46