Instruction Implementation Meta Issue - Githubissues

stratosphere / incubator-systemml

Mirror of Apache SystemML (Incubating)

Apache License 2.0

1 stars 4 forks source link

Instruction Implementation Meta Issue #2

Open fschueler opened 8 years ago

fschueler commented 8 years ago

This issue tracks the status of instructions. Please insert the instruction that you're working on and your name into the table and don't forget to mark it as finished once you're done.

Instruction	Assignee	Status	Comment
TSMMFLInstruction	@fschueler	:white_check_mark:
ReblockFLInstruction	@fschueler	:white_check_mark:
CSVReblockFLInstruction	@fschueler	:white_check_mark:
MatrixScalarFLInstruction	@carabolic	:white_check_mark:	Should work for all supported scalar operations; test wit `MLContext` is missing
WriteFLInstruction	@fschueler	:white_check_mark:
RandFLInstruction	@fschueler	:white_check_mark:
MapmmFLInstruction	@carabolic	:white_check_mark:
MapmmChainFLInstruction	@carabolic	:white_check_mark:
ArithmeticBinaryFLInstruction	@fschueler	:white_check_mark:
MatrixMatrixArithmeticFLInstruction	@fschueler	:white_check_mark:
AggregateUnaryFLInstruction	@fschueler	:white_check_mark:

Symbol	Explanation
:white_check_mark:	completed
:construction:	in progress
:x:	will/can not be implemented

carabolic commented 8 years ago

I see a lot of times Spark instructions are manually optimized to use a broadcast/ map-side join. I think this optimization is not needed for Flink. I think we should always use the Flink operator with the best semantics and give the Flink optimizer the enough meta information to find the perfect plan. For instance the matrix multiplication (Mapmm and MapmmChain) could be extended to use Flink's JoinHints.