rheem-ecosystem / rheem

Rheem - a cross-platform data processing system
https://rheem-ecosystem.github.io
5 stars 0 forks source link

KeyBy Functionality #7

Open JorgeQuiane opened 7 years ago

JorgeQuiane commented 7 years ago

Currently, we support a KeyBy functionality for the Join operator that looks like this:

dataQuanta1.keyBy(_._1)
  .join(dataQuanta2.keyBy(_._2))
  .assemble((dq1, dq2) => (dq2._1, dq1._2))

We need to extend this functionality to other aggregation operators, such as Reduce:

dq.keyBy(_._1).reduce( … )