We would like to have a reusable implementation of multinomial logistic regression (training and prediction), such that users can easily employ it in their DaphneDSL scripts.
Steps:
Translate the DML built-in functions multiLogReg() and multiLogRegPredict() from Apache SystemDS to DaphneDSL. The translation can be done semi-automatically using DAPHNE's dml2daph tool (currently not on main, but a usable version of the Python script can be found in PR #576).
The translated code should be made available to DAPHNE users as importable DaphneDSL scripts in scripts/algorithms/. Inspiration on the format can be taken from scripts/algorithms/decisionTree_.daph.
Furthermore, script-level test cases should be added that test if the resulting multiLogReg implementation yields meaningful results. Inspiration for the test cases could be taken from the test cases in Apache SystemDS as well, as we did for the decision tree test cases in test/api/cli/algorithms/DecisionTreeRandomForestTest.cpp.
Hints:
Note that there is already a variant of multiLogReg in scripts/algorithms/multiLogReg.daph. However, this script was an early manual translation with many todos left open; it should be replaced by a new translation.
Any changes made to the translation tool dml2daph in the context of this translation task would be welcome contributions as well.
Side note: MultiLogReg is particularly interesting to us at the moment because it is required for SystemDS's clustered classification script, which we want to be able to run soon.
We would like to have a reusable implementation of multinomial logistic regression (training and prediction), such that users can easily employ it in their DaphneDSL scripts.
Steps:
multiLogReg()
andmultiLogRegPredict()
from Apache SystemDS to DaphneDSL. The translation can be done semi-automatically using DAPHNE'sdml2daph
tool (currently not on main, but a usable version of the Python script can be found in PR #576).scripts/algorithms/
. Inspiration on the format can be taken fromscripts/algorithms/decisionTree_.daph
.test/api/cli/algorithms/DecisionTreeRandomForestTest.cpp
.Hints:
scripts/algorithms/multiLogReg.daph
. However, this script was an early manual translation with many todos left open; it should be replaced by a new translation.dml2daph
in the context of this translation task would be welcome contributions as well.Side note: MultiLogReg is particularly interesting to us at the moment because it is required for SystemDS's clustered classification script, which we want to be able to run soon.