llovelylip / galagosearch

Automatically exported from code.google.com/p/galagosearch
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Term count statistics aren't easy to dump #19

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
It would be convenient to extract term counts from the index using
dump-index, but this isn't implemented.

Workaround:
Use dump-index to dump posting lists, then use a script to convert this
data into term count information.

Original issue reported on code.google.com by trevor.s...@gmail.com on 10 May 2009 at 9:28

GoogleCodeExporter commented 9 years ago

Original comment by trevor.s...@gmail.com on 21 May 2009 at 5:47

GoogleCodeExporter commented 9 years ago

There is now a function akin to indri's xc:

galago xcount -x=term

Additionally there is a document count function:

galago doccount -x=term

Original comment by sjh...@gmail.com on 21 Jun 2011 at 3:26