quinngroup / dr1dl-pyspark

Dictionary Learning in PySpark
Apache License 2.0
1 stars 1 forks source link

P3: Column-wise whitening of S #45

Closed magsol closed 8 years ago

magsol commented 8 years ago

The very first step of the algorithm, before the loops even begin, is to whiten the columns of the input matrix S. This means subtracting off the mean and rescaling the columns to have unit norms.

Luckily, thunder-project has the perfect function: http://thunder-project.org/thunder/docs/generated/thunder.RowMatrix.html#thunder.RowMatrix.zscore . Make sure we specify axis = 1 (the column axis) and this will perform the whitening.