Closed magsol closed 8 years ago
The PySpark API provides a couple of sorting primitives:
The last one in particular looks promising for our uses (here's a StackOverflow question on its use).
This actually is not needed, since the topR operation is performed on a vector, which will be held in memory.
The PySpark API provides a couple of sorting primitives:
The last one in particular looks promising for our uses (here's a StackOverflow question on its use).