uhh-lt / dats

Discourse Analysis Tool Suite
Apache License 2.0
15 stars 2 forks source link

Remove unused data from dbs #415

Closed bigabig closed 4 days ago

bigabig commented 4 days ago

this removes word frequencies from sdoc data (as we have a wordfrequencies sql table) and we remove large chunks of data stored in elasticsearch.

from now on, only the content is stored in elasticsearch