StanfordHCI / termite

(development moved to new repos)
BSD 3-Clause "New" or "Revised" License
115 stars 36 forks source link

Use sparse matrix representation throughout the entire data processing pipeline. #4

Closed jcchuang closed 11 years ago

jcchuang commented 11 years ago

Right now, there are two ways to load a term-topic probability matrix: By calling either MatrixState.importMatrix() or MatrixState.importEntries().

The former loads a 2D array of numbers and the latter loads an 1D array of non-zero entries. For matrices with a large number of zeros (in our case), the latter is the more efficient representation. However, the client code currently passes the matrix from MatrixState to MatrixModel using a full matrix representation, nullifying any advantage we get from loading a sparse matrix.

jcchuang commented 11 years ago

Resolved in 0b13bb50bbacb58c44a6a3141d860c9d07942b91