Open MichaelRoeder opened 1 month ago
The issue stems from the fact that for the sampled learning problem, the root state (Top concept) has zero quality. For a given E^+:[3] E^-:[3], Quality(T,E+, E-) must be strictly greater than 0. Yet, on that particular dataset, and the learning problem, this is not satisfied.
Thank you. I will improve the assertion error message to ensure that the reason behind this error is clear
I understand that the top concept shouldn't have a quality of zero. That is clearly a problem. So I guess this means that we should check in detail why the scoring method returns a quality of zero although the algorithm was able to select positive and negative examples beforehand, right? :thinking:
the algorithm was able to select positive and negative examples beforehand, right?
The error might stem from the fact after sampling positive and negative examples, we do not ensure that the sampled sets are different form each other, e.g.,
lp = (str_dl_concept_i,set(random.sample(individuals_i, size_of_examples)),set(random.sample(individuals_j, size_of_examples)))
Problem
DRILL crashes sometimes during its training phase because of an assertion that is not fulfilled.
How to recreate
Please message Quan if you need the embedding file or any other additional information.