BIDS-projects / topic-modeling

Categorization of various data science institutions into several different topics
Apache License 2.0
1 stars 0 forks source link

LDA weighting #11

Closed chewisinho closed 8 years ago

chewisinho commented 8 years ago

The main changes: Assumes that the database will have a 'deg_sep' column. Adds 'deg_sep' as an attribute to the DocumentItem, and then uses a weighting function to control the importance of the documents. The file models.py contains various statistical models to adjust the weighting, with adjustable parameters.