sotheanithsok / Habeas

A complete implementation of large scale search engine including on-disk indexing, multiple queries options, and user interfaces.
MIT License
0 stars 0 forks source link

Building The Index #54

Closed jblacklock closed 5 years ago

jblacklock commented 5 years ago

"Your PositionalInvertedIndex must be written to disk using the DiskIndexWriter class from Homework 5. Your index file must be constructed in the following pattern: dft d tf t,d p1 p2 · · · pi, where d is the document ID, tf t,d is the term frequency of a term in the document (i.e., the number of positions that the term appears at), and p1 p2 · · · pi are each of the i positions of that term in that document. All document IDs and positions must be written as gaps."

sellabae commented 5 years ago

Implemented in #53 and merged to master with the update on document frequency