quinngroup / dr1dl-pyspark

Dictionary Learning in PySpark
Apache License 2.0
1 stars 1 forks source link

Top R sorting in PySpark #37

Closed magsol closed 8 years ago

magsol commented 8 years ago

The PySpark API provides a couple of sorting primitives:

The last one in particular looks promising for our uses (here's a StackOverflow question on its use).

magsol commented 8 years ago

This actually is not needed, since the topR operation is performed on a vector, which will be held in memory.