propublica / Capitol-Words

Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date
BSD 3-Clause "New" or "Revised" License
122 stars 34 forks source link

xfer backend to elasticsearch #54

Open drinks opened 12 years ago

drinks commented 12 years ago

Try to offload some ngram/shingle functionality to the lucene analyzer