Open andrasfuchs opened 2 years ago
Update: I'm not 100% sure, but it looks like I get this exception when I use SdcaLogisticRegressionOva
, LbfgsLogisticRegressionOva
or LbfgsMaximumEntropyMulti
, but it works fine with FastTreeOva
, FastForestOva
and LightGbmMulti
.
@JakeRadMSFT think this is due to how you guys are building the pipeline itself? Or do we need to go do into the ML.NET code itself?
I used Visual Studio 2022 v17.0.4 with the ML.NET Model Builder 2022 v16.9.2.2205603 extension with my C# console application that includes these mbconfigs to generate the ML models: https://drive.google.com/file/d/1CZJitEjEd3GEhBbHZUMOgEbRg3GaPsMD/view?usp=sharing
Here is the data that I used for training and for the model accuracy measurements: https://drive.google.com/file/d/1C2Of2UIHN2y7l5J-SvEvacFhTAHMCCcm/view?usp=sharing
To get the exception, you need to stop the training when SdcaLogisticRegressionOva
, LbfgsLogisticRegressionOva
or LbfgsMaximumEntropyMulti
has best accuracy.
Hi guys, did you have the chance to look into this? Do you need any more data from me to reproduce the problem?
Until then, is there any workaround for this issue, can I remove some of the training algorithms from the training loop of the model builder? Or is there a way to manually choose the algorithm for the code/model generator in model builder after training?
@LittleLittleCloud didn't we see this issue somewhere else? What was the root cause?
@LittleLittleCloud can you also share with @andrasfuchs how to remove training algorithms from AutoML?
I'll take a look, meanwhile, to disable trainers you can refer this comment https://github.com/dotnet/machinelearning-modelbuilder/issues/1998#issuecomment-1026240486
System Information (please complete the following information):
Describe the bug I get
IndexOutOfRangeException
when callingPredictionEngine
'sPredict
method. The exception is thrown in the AddFeature method of the BufferBuilder class.To Reproduce Steps to reproduce the behavior:
ModelInput
with values and callMLModel.Predit(ModelInput)
methodExpected behavior I was expecting to have a
ModelOutput
object returned, but I got the above exception instead. It would be also great to have a more detailed exception to help the bug hunting if it is caused by misconfiguration on the user's part.Screenshots, Code, Sample Projects
Additional context Probably not a critical information, but my data source is a ~360 MB big CSV file with 5120 float feature columns, 6 label columns (of which 5 are ignored for the training) and it has 5500 rows.