Closed RomainGoussault closed 3 years ago
This means that we cannot create scenarios where each partner owns specific data. It could be great to add a warning, an error, and/or a sentence about that in the doc
I'll add a note in the documentation @RomainGoussault. I don't think it should be "fixed".
When running the titanic example with this config file: https://github.com/SubstraFoundation/distributed-learning-contributivity/blob/3ab411b6e79fcb34e71294b8c6a2ae98bdf1f8c7/tests/config_end_to_end_test_titanic.yml, it fails (see stacktrace below).
The issue is that sometimes the data we fit only contains one class and the sklearn solver does not like that and always want to have 2 class. https://stackoverflow.com/questions/40524790/valueerror-this-solver-needs-samples-of-at-least-2-classes-in-the-data-but-the
Note that in this specific example (dataset_proportion=0.2 and minibatch_cout=10), the batch size is 4 which very small.
@arthurPignet @bowni