FRosner / DataGenerator

Tool for generating various test data for machine learning and data mining algorithms.
Apache License 2.0
10 stars 2 forks source link

Categorial distribution sometimes samples value with probability 0 #204

Closed FRosner closed 10 years ago

FRosner commented 10 years ago

Integration test failed.

bildschirmfoto 2013-12-11 um 14 51 37

If you sample from a distribution with probabilities (0,0,1) e.g., sometimes 0.00000005 gets treated as 0 and the first value gets sampled. This can be avoided by comparing with maximum precision.

FRosner commented 10 years ago

Failure still happening. There seems to be a different problem.

FRosner commented 10 years ago

The problem is that 0.00000004 as random value gets treated as 0 and thus generating a discrete feature value 0 with cumulative probabilities (0,0,1), because 0 = 0.