jasp-stats / jasp-issues

This repository is solely meant for reporting of bugs, feature requests and other issues in JASP.
59 stars 29 forks source link

Feature request: Bayesian Kruskal Wallis/ "Non-parametric" BANOVAs #284

Closed AugustOlsson closed 9 months ago

AugustOlsson commented 5 years ago

First, I am thankful for the inclusion of the Bayesian Mann-Whitney test in the latest update of JASP.

Enhancement request: A Baysian verison of Kruskal-Wallis H-test and/or other baysian versions of non-parametric tests for data with an independent variable with three or more levels (such as Van der Waerden test) would be very useful. Purpose/Use-case: We often work with an independent variable with more than two levels, making Kruskal-Wallis H-test or similar rank-based tests neccesary for testning data not fullfilling assumptions for parametric testing.

Best, August

EJWagenmakers commented 5 years ago

Great suggestions! @JohnnyDoorn ?

CharlesCousyn commented 3 years ago

Any news from this feature? I'm very interested to perform with JASP a Bayesian non parametric One Way Anova in my current project. (Bayesian equivalent of Kruskal-Wallis H-test)

JohnnyDoorn commented 3 years ago

Hi @CharlesCousyn,

Unfortunately we are still in the research phase with this analysis (there is no comprehensive way for a Bayes factor for this test yet, as far as I know), so it will still take some time before this analysis can be in JASP (perhaps a year or so).

Kind regards, Johnny

CharlesCousyn commented 3 years ago

Thank you for your response I understand! Good luck with that!

mmpeng9 commented 2 years ago

Hi @JohnnyDoorn, I am also very interested in this feature. Any updates for it? Thanks!

JohnnyDoorn commented 2 years ago

Hi @maomiaopeng,

My own research has unfortunately shifted to other topics, and the development of this test had never quite taken off I'm afraid. My hope is that other researchers develop a Bayes factor Kruskal-Wallis test, which we could then implement in JASP. For now, if you are only interested in the one-way case, you could take a look at transforming your dependent variable to ranks, and then performing a Bayesian ANOVA. This is based on the approach by Conover, W. J.; Iman, R. L. (1981). Rank transformations as a bridge between parametric and nonparametric statistics. American Statistician. 35 (3): You can read more about the approach (which is quite rough, but ought to be a decent alternative) here..

Kind regards Johnny

mmpeng9 commented 2 years ago

@JohnnyDoorn, Thank you so much for the help! I'll read them and try. Wish you all the best in the other topics!

andrianarimisa commented 9 months ago

Let's suppose that I found 30%, 25%, 25% , 10%, and 10% the responses of people that I interviewed respectively choosing event A, B, C, D, and E... If I want to use "inductive approach" for the analysis as I want to propose a general theorie on it, without doing inferentiel statistique , could someone help me to find the best way to do it...

JohnnyDoorn commented 9 months ago

@andrianarimisa What sort of test do you want to do? Do you want to test whether these percentages/counts meaningfully differ from uniformly distributed frequencies (i.e., 20%)? In that case, I would suggest conducting a (Bayesian) multinomial test where you specify your observed counts. Also I am not sure this is the best platform to ask your question - you can also use the JASP forum

tomtomme commented 9 months ago

closing as duplicate of https://github.com/jasp-stats/jasp-issues/issues/1531 That one is about classic and bayesian versions of those tests and has more references, so let's discuss there.

andrianarimisa commented 5 months ago

I got a critics from a rewier on a manuscript because of the test I use for a percentage data (Student T.test) : It is the percentage of individuals of Sp1 and Sp2 of Mosquitos captured in seven sites, which means percentage of individuals with Plasmodium in their Salivary gland.

I did t-test as the data are normally distributed - but the critique is Student t.test is far from appropriate to compare proportion data .. but find more appropriate test - which ones ?

Below is the data

Mosquito.sp1<-c(68.42, 51.43,11.27,100,47.06,42, 25.33)

Mosquito.sp2<-c(80,68.54,71.55,84.62,91.43,62)

Can anyone help me ? Why and what other thests sould I use to compare percentage data then ?

Cheers


From: Thomas Langkamp @.> Sent: Sunday, February 18, 2024 11:44 PM To: jasp-stats/jasp-issues @.> Cc: andrianarimisa @.>; Mention @.> Subject: Re: [jasp-stats/jasp-issues] Feature request: Bayesian Kruskal Wallis/ "Non-parametric" BANOVAs (#284)

closing as duplicate of #1531https://github.com/jasp-stats/jasp-issues/issues/1531 That one is about classic and bayesian versions of those tests and has more references, so let's discuss there.

— Reply to this email directly, view it on GitHubhttps://github.com/jasp-stats/jasp-issues/issues/284#issuecomment-1951440130, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ADHHXCMFMJMUWK3YEMULMTLYUJRZPAVCNFSM4GOO5UTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJVGE2DIMBRGMYA. You are receiving this because you were mentioned.

JohnnyDoorn commented 5 months ago

Hi @andrianarimisa ,

So you want to test whether there is a significant difference in percentages between sp1 and sp2, across these different sites? Do you have the raw data instead of only percentages? Because then you could run a contingency table on the counts of sp1/sp2 and whether there is plasmodium, maybe conditional on the sites where you measured. This would result in a series of 7 chi squared tests. You could combine the counts if you want to assume there is no difference between the 7 sites.

Cheers, Johnny

andrianarimisa commented 5 months ago

Yes, I want to compare the difference if there is a significant difference of the percentage between Sp1 and Sp2... I do not have in total the rough data as some where from a person involved in the study - unfortunately for Sp 2 ---

One idea I had , but I dont know if it is correct : just supposing that all individuals captured and tested were based on 100 individuals so if of them

Mosquito.sp2<-c(80,68.54,71.55,84.62,91.43,62) - something like 80 indiv; 67 indiv; 72 indiv, 85 etc... But I am alittle bit afraid as it might be cheating – so that's why I used proportion to let it be harmonized but I got that "reviewer comments... And I am get stuck !

Regards, Aristide


From: Johnny van Doorn @.> Sent: Monday, June 17, 2024 2:18 PM To: jasp-stats/jasp-issues @.> Cc: andrianarimisa @.>; Mention @.> Subject: Re: [jasp-stats/jasp-issues] Feature request: Bayesian Kruskal Wallis/ "Non-parametric" BANOVAs (#284)

Hi @andrianarimisahttps://github.com/andrianarimisa ,

So you want to test whether there is a significant difference in percentages between sp1 and sp2, across these different sites? Do you have the raw data instead of only percentages? Because then you could run a contingency table on the counts of sp1/sp2 and whether there is plasmodium, maybe conditional on the sites where you measured. This would result in a series of 7 chi squared tests. You could combine the counts if you want to assume there is no difference between the 7 sites.

Cheers, Johnny

— Reply to this email directly, view it on GitHubhttps://github.com/jasp-stats/jasp-issues/issues/284#issuecomment-2173118045, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ADHHXCOOKTTAMQQ2JY3B5TTZH3AYTAVCNFSM4GOO5UTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TEMJXGMYTCOBQGQ2Q. You are receiving this because you were mentioned.Message ID: @.***>

andrianarimisa commented 5 months ago

By the way, is there a real "reference stated " that t-test is not appropriate for proportion ? I was looking on the web all references but did not find any , that's why I am asking..

Cheers


From: Aristide Andrianarimisa @.> Sent: Monday, June 17, 2024 3:19 PM To: jasp-stats/jasp-issues @.>; jasp-stats/jasp-issues @.> Cc: Mention @.> Subject: Re: [jasp-stats/jasp-issues] Feature request: Bayesian Kruskal Wallis/ "Non-parametric" BANOVAs (#284)

Yes, I want to compare the difference if there is a significant difference of the percentage between Sp1 and Sp2... I do not have in total the rough data as some where from a person involved in the study - unfortunately for Sp 2 ---

One idea I had , but I dont know if it is correct : just supposing that all individuals captured and tested were based on 100 individuals so if of them

Mosquito.sp2<-c(80,68.54,71.55,84.62,91.43,62) - something like 80 indiv; 67 indiv; 72 indiv, 85 etc... But I am alittle bit afraid as it might be cheating – so that's why I used proportion to let it be harmonized but I got that "reviewer comments... And I am get stuck !

Regards, Aristide


From: Johnny van Doorn @.> Sent: Monday, June 17, 2024 2:18 PM To: jasp-stats/jasp-issues @.> Cc: andrianarimisa @.>; Mention @.> Subject: Re: [jasp-stats/jasp-issues] Feature request: Bayesian Kruskal Wallis/ "Non-parametric" BANOVAs (#284)

Hi @andrianarimisahttps://github.com/andrianarimisa ,

So you want to test whether there is a significant difference in percentages between sp1 and sp2, across these different sites? Do you have the raw data instead of only percentages? Because then you could run a contingency table on the counts of sp1/sp2 and whether there is plasmodium, maybe conditional on the sites where you measured. This would result in a series of 7 chi squared tests. You could combine the counts if you want to assume there is no difference between the 7 sites.

Cheers, Johnny

— Reply to this email directly, view it on GitHubhttps://github.com/jasp-stats/jasp-issues/issues/284#issuecomment-2173118045, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ADHHXCOOKTTAMQQ2JY3B5TTZH3AYTAVCNFSM4GOO5UTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TEMJXGMYTCOBQGQ2Q. You are receiving this because you were mentioned.Message ID: @.***>