Closed tjmahr closed 8 years ago
029L is special. I wrote a ReadMe file about them in their raw data folder:
029L was seen for Visit A in April. The child did not return until four months later, so we decided to start all over with this child. I decided to archive the April visit data in a zip file so that it is archived but not readily available. The email history regarding this child is given below. --TJM, 8/30/2013
The child had 34 in their filenames because that was the age at the original, archived visit, but a 38 in the AgeAtvA_1-2
column.
Not every task was redid, so e.g. the child's EVT is at 34 months but the PPVT is at 38 months. This case was part of our motivation to include the date of all tasks in the database and compute age on a task by task basis on the fly.
I created issue #36 to pinpoint more instances like this one...
Fixed all discrepancies. In each case, the filename/stimlog version of the ID was chosed and the database record was updated.
Updated the original spreadsheet sources for 129L, 130L, 608L, 628L, 651L where the discrepancy originated. Did not 029L because of this child's gap between visits.
@patrickreidy found some discrepancies between the ChildStudy.LongResearchID field and the long IDs used in some filenames. He provided a spreadsheet of the discrepancies
The purpose of the LongResearchID field is to document a piece of data (the ID we assigned to a participant) and to make it easier to link a ChildStudy record to raw data files which use that piece of data in the filename.
The database IDs were created by concatenating columns of hand-entered data from Excel spreadsheets. The hand-entered values and the research id used on test day are not created at the same time or by the same person, I think, so it's not surprising that there are a handful of discrepancies over ~600 child-study pairings.