jorendorff / indexor

A toy search engine
3 stars 3 forks source link

results return only words starting with A or B #2

Closed Courey closed 8 years ago

Courey commented 8 years ago

lamesearch runs without barking, but results returned are limited to those starting with A or B. Cause unknown at this point.

Courey commented 8 years ago

There are words beginning with other letters (and even other alphabets) in the index, so the indexing does not seem to be the problem.

Courey commented 8 years ago

We were limiting the files tokenized to the first 250 because the entire sample takes a while to load. That first 250 includes a-belfast. So anything from belfast-z was left out for shorter load times.