xoopR / distr6

R6 object-oriented interface for probability distributions.
https://xoopr.github.io/distr6/
Other
99 stars 23 forks source link

Defining Large Rate in Possion #63

Closed w090613 closed 5 years ago

w090613 commented 5 years ago

As listed in Wiki, for entropy of poisson, there are two equations used by the size of rate. I am wondering how large this lambda should be. Some of the papers on google has mentioned rate=10 but the approximation is to another equation slightly different from wiki one. See https://pure.tue.nl/ws/portalfiles/portal/1959440/Metis199989.pdf for reference

RaphaelS1 commented 5 years ago

As your own experiment. Could you test the equation for different thresholds for large lambda? So try both equations for rate = 1,10,20,30,50,100

RaphaelS1 commented 5 years ago

From my own mini experiment just testing the absolute difference between the Poisson distribution and its Normal approximation at varying lambda, and from previous conventions, I am in favour of rate = 30. But I will defer to @fkiraly for a more mathematical answer.

fkiraly commented 5 years ago

Should this not be part of issue #62 ?

fkiraly commented 5 years ago

Also, what equation/threshold is this about?

RaphaelS1 commented 5 years ago

Should this not be part of issue #62 ?

I asked Xiaowen to separate them as this is a more general question about "when is the Poisson lambda parameter considered 'large'" which is an old question with no general consensus (although I was taught when lambda = 30).

Also, what equation/threshold is this about?

Entropy of Poisson. For large lambda, there is a closed form analytic expression.

fkiraly commented 5 years ago

@RaphaelS1, the closed form is explicit only "up to lower order terms", i.e., an error term whose asymptotics you can derive. Without an explicit upper bound, though, you can't just set it to (numeric) zero. It may require some calculations/derivations which we may want to avoid at this point?

w090613 commented 5 years ago

Hi @RaphaelS1 , from my current code, the entropy using two different equations when rate =20 differ to an order of 10^-6 and when rate =30 differ to an order of 10^-7. The lowest difference is when rate is around 83 differing to an order of 10^-9.

RaphaelS1 commented 5 years ago

Thanks for that @w090613. Let's use rate = 30 in the future when needed but for now as per #62 we won't include entropy.