lensacom / sparkit-learn

PySpark + Scikit-learn = Sparkit-learn
Apache License 2.0
1.15k stars 255 forks source link

Implement ArrayRDD.var() method #21

Open kszucs opened 9 years ago

kszucs commented 9 years ago

see http://docs.scipy.org/doc/numpy/reference/generated/numpy.ndarray.var.html#numpy.ndarray.var

There is already an implementation in https://github.com/lensacom/sparkit-learn/blob/master/splearn/feature_selection/variance_threshold.py#L62 but it should use ndarray.var instead of mean_variance_axis.