parallelization simulation based indexes and inference wrappers

pysal / segregation

Segregation Measurement, Inferential Statistics, and Decomposition Analysis

https://pysal.org/segregation/

BSD 3-Clause "New" or "Revised" License

112 stars 26 forks source link

Closed renanxcortes closed 3 years ago

renanxcortes commented 5 years ago

Some indexes such as Modified Dissimilarity (Dct), Modified Gini (Gct) and Bias-Corrected Dissimilarity (Dbc) could be leveraged to work in parallel since they rely on independent draws of probability distributions and recalculating the index.
Also, the inference wrappers (Infer_Segregation and Compare_Segregation) could be leveraged to work in parallel since they rely on independent simulations framework.

knaaptime commented 5 years ago

this might be relevant

which also raises the issue that we might want to allow a seed argument in the current implementation

renanxcortes commented 5 years ago

I was just surfing on the web reading some posts and read something that let me wondering if we could use some kind of just-in-time for these functions: http://numba.pydata.org/numba-doc/0.17.0/user/jit.html or something similar to what it is possible to do in R with for loops with the compiler package: https://www.r-statistics.com/2012/04/speed-up-your-r-code-using-a-just-in-time-jit-compiler/

knaaptime commented 5 years ago

i'm familiar with numba in concept but never been sure what kind of code its able to speed up. maybe we could try some speed tests?

knaaptime commented 3 years ago

solved for inference classes by #174

knaaptime commented 3 years ago

resolved by #183