Open mzettersten opened 6 months ago
Go ahead with import; #125 has been resolved.
First-pass IDless import complete. Ready for code review
Checklist for code review v2024
To start:
Common issues to check:
Trials
Trial Types
Stimuli
Subjects
General
exclusions look to only be for participant-level, not trial-level (probably all we have)
full phrase is missing for some of the data (presumably because we don't have it?) (Alvin confirms we don't have it)
[resolved] looks like trial_type info is coming from a trial_info csv, possibly coded by someone (alvin?) off of raw stimuli? I'm wondering why there are trials that are marked vanilla but have condition mispronounced. condition column of trial_info was miscoded (very understandably) and has been corrected.
I think everything is good except I couldn't track down the images.
It sounds from the readme that the unpublished bh2017 but with younger kids should have videos that could be screenshotted somewhere, but I didn't find it in a cursory look through osf repos.
The schott osf is still private (and the corresponding github https://github.com/e-schott/CrossLanguagePhonologicalOverlap) doesn't seem to have stimuli. Idk if it's worth asking for the stimuli.
@vboyce bh2017 + unpub video files can be found here: https://osf.io/htn9j/ (would also be good to update bh2017 if you get the images (ref #97))
@vboyce Should I reach out and ask? Maybe for images for all of the various projects, if it were to make sense?
@mzettersten reaching out for the various projects for image stimuli would be great!
and @alvinwmtan thanks, I can do the screenshotting for bh2017 and unpub and update both places!
hmm, do the bh2017 files open for you @alvinwmtan ? For me the two bouche ones do, but the others seem corrupted or wrong extension or something, and I can't open them.
import not started, refer to https://github.com/langcog/peekbank-data-import/issues/125 first