Closed GoogleCodeExporter closed 9 years ago
I think such validation go beyond what ValidityCheckTask is intended to do.
This task verifies whether the input data, the features and the type of
learning (single- vs. multi-label) produce a valid task setup.
It should not verify whether each individual feature produces an outcome that
is *compatible* with the specific learning algorithm used, since that is also
framework-dependent and we don't want dependencies from TC core to the machine
learning frameworks.
Maybe we should disable string features altogether. Please re-open if you have
better ideas.
Original comment by daxenber...@gmail.com
on 15 Aug 2014 at 4:21
I agree with Johannes.
Just some additional comments:
- We should maybe not disable String features altogether, as the instanceId is
also one :)
- It would be nice to fail-fast on such setups, but as this is a problem that
occurs deep in the ML framework, I also don't see how we could avoid that.
Original comment by torsten....@gmail.com
on 15 Aug 2014 at 6:44
Original issue reported on code.google.com by
Tobias.H...@gmail.com
on 15 Aug 2014 at 1:44