ben-domingue / irw

Code related to data for the Item Response Warehouse
https://datapages.github.io/irw/
6 stars 9 forks source link

Do you know the Wooly Bully? Testing era-based knowledge to verify participant age online #41

Closed ben-domingue closed 2 months ago

ben-domingue commented 6 months ago

https://osf.io/bn4xy/?view_only=7252e963f3bd4c0c981eed6ddd085ee8 paper. https://link.springer.com/article/10.3758/s13428-022-01944-y#Sec34

ben-domingue commented 6 months ago

@anniehang is working on this

anniehang commented 6 months ago

I think representing the subject's choice for each item with numerical values is not meaningful in practice. Should I still use the numerical values as the response, or should I use correct(1) or incorrect(0) as the response?

ben-domingue commented 6 months ago

corerct/incorrect would be great!

anniehang commented 6 months ago

ok!

anniehang commented 5 months ago

vertifyage_study1a.csv vertifyage_study1b.csv vertifyage_study2.csv vertifyage_study3a.csv vertifyage_study3b.csv vertifyage_study3c.csv

ben-domingue commented 5 months ago

morning @anniehang. this is great!

i am wondering if we could perhaps collapse studies 1b, 3a, 3b, and 3c. they seem to be based on a common version of the measure (best i can tell, we can check this) and we can add an 'age' column and then also a 'group' column (with values: 1b, 3a, 3b, 3c). what do you think? my general hope is to have relatively few datasets coming from each study and to the extent that we just have different people taking the same measure here i'm ok with that (but it would be good for us to verify that they are taking the same measure; they seem to be from my skim of it but do you have add'l insight?).

ben-domingue commented 5 months ago

also we should make 'i dont know' NA if we haven't already

anniehang commented 5 months ago

Ok, I think that's a good idea. The number of participants reported in the paper reflects the count after missing values were removed. so I directly eliminated participants with missing data from the analysis.

ben-domingue commented 5 months ago

do you want to put together an updated dataset and i can take a look at it? and, just to confirm, the 'don't know' responses were already reported as NA values?

anniehang commented 5 months ago

ok, i got it!

anniehang commented 5 months ago

vertifyage_study1a1b3b3c.csv vertifyage_study2.csv vertifyage_study3a.csv

ben-domingue commented 5 months ago

one question about 1a1b3b3c: there are only 18 items but it looks (from the text) like there should be 19. did one get dropped for some reason?

anniehang commented 5 months ago

Oh, I made a mistake; there are 19 items in 1a. I think I should merge 1a and 2 together. However, what confuses me is that the data for 1b, 3b, and 3c only provide 18 items.

ben-domingue commented 5 months ago

hm. very strange. if you think that something may be wrong in their data (such that it doesn't map onto what they say), i can email them? i'm sure they'd be happy to know that there is a problem if we are confused.

anniehang commented 5 months ago

ok, I can send them an email to inquire about it.

ben-domingue commented 5 months ago

i'm also happy to email them! let me know if you prefer that? no big deal for me to do it :)

anniehang commented 5 months ago

If it's not too much trouble, I would be happy if you could email them. šŸ˜ƒ

ben-domingue commented 5 months ago

happy to :)

ben-domingue commented 2 months ago

second email sent july 2. @ben-domingue

ben-domingue commented 2 months ago

They finally got back to me: Thanks for reaching out and for your interest in our paper! Iā€™m actually seeing 19 in some and 20 in others, with the 20th extra item being ā€œTik Tok,ā€ which didnā€™t make it into our final list of 19 (it was in the original development of the items, which you can see in the supplemental materials). I canā€™t say why it was dropped in some of the studies since data collection was a long time ago (and I wasnā€™t involved yet). If you decide to include the items in the IRW, you should include the 19 that are listed in Table S1. What I would propose is that we just subset to that final list of 19 if that is easy?

ben-domingue commented 2 months ago

@anniehang see above

anniehang commented 2 months ago

ok, i got it !

ben-domingue commented 2 months ago

thanks annie. you are the best :)

anniehang commented 2 months ago

Apologies! I realized my previous mistake. Some sub-datasets did not provide answers for Q8, while others had increased answers for the 'TikTok' column, so I mixed them up. By reviewing the participants' original answers to Q8, I can infer whether their responses were correct. Now each sub-experiment has 19 items.

anniehang commented 2 months ago

AVI-S.csv

ben-domingue commented 2 months ago

great!!