Open ShawnKing98 opened 3 weeks ago
Hello! Thank you for getting in touch. For Q1–3, note that the Ns given in the paper reflect number of human participants rather than number of trials. Additionally:
.npy
file, so keep an eye out for that update!entwisle_norms.csv
) and adults (in adult/
). For each cue, we look at the distribution over all targets, and compare with model-rated similarities between the cue and each target. As in (4), I'm hoping to provide a Python version of the analysis which should make it easier to use.
Hi, I'm trying to test my own model on DevBench and look at the difference between model prediction & human prediction. I'm able to generate the embedding features and prediction logits by running
eval.py
, but I'm having some troubles while parsing the human prediction data. Could you please help me with this? The details are listed below:manifest.csv
that go through the model, but only 108 human trials data in your providedhuman.csv
; neither is consistent with what you wrote in the paper (N=1780). Is there data missing or am I getting something wrong?human.rds
file. I guess it's an R file and tried to parse it using some python package, but failed to do so. Since there are many people who are familiar with python but not with R, is it possible for you to provide an alternative file that is readable by python? I'll really appreciate that!entwisle_norms.csv
file and severalCue_Target_Pairs
files, but I have no clue what they mean. Could you please elaborate on the format of these human performance data file, or provide a code template that is able to parse the human data?Thank you very much for your patience, and I appreciate any help you can provide!