ContinuumIO / topik

A Topic Modeling toolbox
BSD 3-Clause "New" or "Revised" License
93 stars 24 forks source link

change DictionaryCorpus implementation from list to dict #50

Closed youngblood closed 8 years ago

msarahan commented 9 years ago

For clarity, this: https://github.com/ContinuumIO/topik/blob/master/topik/intermediaries/raw_data.py#L256

This implies changes to import: https://github.com/ContinuumIO/topik/blob/master/topik/intermediaries/raw_data.py#L317 updating docs (inserting tokenized data) - this improvement will be very good: https://github.com/ContinuumIO/topik/blob/master/topik/intermediaries/raw_data.py#L291 Yielding results (this will change to dictionary iteration; use for doc in self._documents.items(): https://github.com/ContinuumIO/topik/blob/master/topik/intermediaries/raw_data.py#L279