Error when using -1 with nonparametric inference

0todd0000 commented 1 year ago

From #188

For number of iterations, if I use -1, I get the error :

The total number of iterations (Inf) is very large and may cause computational problems. In the call to "inference"... (1) Set "iterations" to a number between 10 and 10,000, or (2) set "force_iterations" to "true" to enable calculations for iterations > 10,000.

What is the minimum number of iterations necessary and how does it alter the result?

Thanks, Vignesh

0todd0000 commented 1 year ago

If you add 'force_iterations', true to the argument list the error will disappear.

Please see #173, #106 and other issues in this forum for additional details regarding choosing iterations.

vigneshrk29 commented 1 year ago

Hi,

Thanks for the links to to other issues. I have one question pertaining to the number of iterations.

1) From my understanding, increasing the number of iterations leads to convergence and also repeats for random pertubations of the dataset. I have attached an image before, where I have conducted nonparameteric paired ttest for 120,220, 1000 and 10000 iterations. The SPM{t} look qualitatively similar with p value changing based on number of iterations (1/iterations). How does this link with convergence? Have the results still not converged?

ttest2Iterations

I get similar SPM{F} values for one way repeated measure ANOVA with the same variations (but it takes ages to run). I would like to understand further how the iterations may impact the results?

Thanks

0todd0000 commented 1 year ago

In this case the effect size is very large so iterations does not qualitatively affect the results. When effect sizes are smaller you can see very different results for different numbers of iterations and/or random number generator states.

To judge convergence (generally):

start with at least 1000 iterations but preferably 10,000 iterations
run several times (about 10 times) with different random number generator states
if the numerical fluctuations in the critical test statistic value (zc) are small with respect to the difference between the maximum absolute z value (i.e., if zc changes are small with respect to z_max - zc) then there is likely convergence
OR more simply: if the results do not change qualitatively then convergence can be judged to be adequate

vigneshrk29 commented 1 year ago

Thank you. Sorry for the delay but in one of the posts you mentioned that the t-statistic is qualitatively similar to the effect size. I have additionally caculated Cohen's d but would like to know if there should be correalation between SPM{t} and Cohen's d? If the value of SPM{T} is larger, would it mean we should get a larger Cohen's d value?

Thanks

0todd0000 commented 1 year ago

Yes, test statistic values and effect sizes are (nonlinearly) correlated. There are many free resources available for considering test statistics vs. effect sizes, including this one at medium.com

I suggest plotting the Cohen's d trajectory. It will look qualitatively quite similar to the t-statistic trajectory.

vigneshrk29 commented 1 year ago

Thank you. I will do that

vigneshrk29 commented 1 year ago

Hello Todd,

Thank you. Cohen's d value did qualitatively match that of the t-statistic. I would like to clarify two additional questions:

1) If Cohen's d value ranges from 0.05 to 0.5, would you use the maximum value and state the effect size is moderate? In some papers, the calculate Cohen's d values at the point of peak difference but I do have a non-directed analysis. How would you suggest the effect size is reported?

2) In some of the previous tickets, there was a mention of effect size being added to spm code. Has it been added? I am currently using Matlab's meanEffectSize function to calculate Cohen's d.

Thanks

0todd0000 commented 1 year ago

Preliminary effect sizes will be supported in the next major release (version 0.5). I recommend against following standard effect size interpretations like "moderate" because one is much more likely to observe a "moderate" effect when analyzing 1D data.

vigneshrk29 commented 1 year ago

Thank you. What would you recommend is the best way to state effect sizes for 1d data? Would stating the range make more sense ?

0todd0000 commented 1 year ago

Yes, I think the range would be better. An academic paper about this issue, along with interpretation guidelines is currently being prepared; if review goes smoothly these should be available some time in 2024.

vigneshrk29 commented 1 year ago

Hi Todd,

Thanks. I look forward to that paper. I would like to clarify one question about it since stating Cohen's d would allow me to complete this project

1) Since I am performing paired t-test between two groups contained 501 samples each and I am trying to determine the effect size of the significant differences: would stating the range of Cohen's d across the time series be better or would providing a single value of Cohen's d comparing the medians between the two samples be better (in the specific time range)?

Thanks

0todd0000 commented 1 year ago

I can't answer this question because there are no relevant effect size reporting standards of which I am aware.

0todd0000 / spm1dmatlab

Error when using -1 with nonparametric inference #189