CodeforLeipzig / stadtratmonitor

Stadtratmonitor - Frontend
https://stadtratmonitor.leipzig.codefor.de/
MIT License
4 stars 4 forks source link

Extract words by frequencies from paper contents #11

Open jrlover opened 8 years ago

jrlover commented 8 years ago

From @joergreichert on January 12, 2016 20:30

Using elastic search feature "term vectors" https://www.elastic.co/guide/en/elasticsearch/reference/1.3/docs-termvectors.html

the most frequent words in these term vectors can be used to provide categories lists (like E-commerce web sites) to drill down into papers

Copied from original issue: ahx/stadtratmonitor#13