It was mentioned, in paper Gradient-based boosting for statistical relational learning: The relational dependency network case, that the initial potential had to be set to -1.8 in order to capture the uniform distribution. Where does this value -1.8 come from?
Hello,
It was mentioned, in paper Gradient-based boosting for statistical relational learning: The relational dependency network case, that the initial potential had to be set to -1.8 in order to capture the uniform distribution. Where does this value -1.8 come from?
Thank you.