databricks / spark-tfocs

A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)
Apache License 2.0
89 stars 37 forks source link

Add and document DVector and DMatrix types to clarify RDD[Vector] usage, additional vector comments and cleanups. #8

Closed staple closed 9 years ago

staple commented 9 years ago

Previously RDD[Vector] had been used to represent both distributed vectors and matrices. This PR introduces the DVector and DMatrix types to clarify usage when the distributed RDD[Vector] format is used. In addition, new comments describe usage requirements for these datatypes.

Also removes the old, unused function implementations operating on RDD[Double] vectors.

Also adds documentation and clarification of various vector operations throughout the code, including a few optimizations.