Closed maxbiostat closed 5 years ago
Thanks. A PR would be great, with a couple of minor caveats:
I was sure I did this (but clearly I didn't). So perhaps I'm remembering somewhere else in the code. But that's up to me to find out.
I'm busy this week, but should be able to look at it next week?
Can you squash all commits in your repo into a single commit
Squashing done. Please take your time to review this. Thanks for implementing these techniques in an open-source package. Least I can do is contribute a little bit. I'm still a bit puzzled why pulling the frequencies and values from internal
wouldn't work; maybe I'm doing something wrong. I reckon you planned to have some sort of compression, and that's why internal
includes freq
and values
.
@maxbiostat Thanks for the PR #80. I've merged and made a few changes.
One of my plans is for the discrete distributions to be able to handle much larger datasets. This means avoiding using the underlying data and only using freq/values, i.e. the data compression idea.
Happy for further PR on other discrete distributions. Thanks
I was wondering whether for the discrete distributions implementing a compressed likelihood wouldn't speed up estimation. The basic idea is to multiply the likelihood of
x
by the frequency ofx
to avoid computing the same thing over and over. I have an implementation over on my fork. I'm happy to submit a PR for you to review and fix it to conform to the packages standards. Here is some supporting info: