Closed edsu7 closed 9 months ago
Jon will reindex the data.
Multiple issues
submitterSpecimenId
will be addressed and fixed in dictionary release 1.19submitterSpecimenId
as submitterDonorId
, @hknahal will share list of discrepancies with data submitter to verify. Followed up with deletion and resubmission.@lindaxiang @edsu7 Closing this ticket since 156 donors were removed. Followed up with data submitter regarding 1,248 donors with submitter_donor_id discrepancies. Can create a separate ticket for this once data submitter confirms issue and whether they want to have them deleted.
@edsu7 can you add update?
Superseded by https://github.com/icgc-argo/workflow-roadmap/issues/395
Breakdown of 1,571 donors showing in Program Dashboard:
156 samples need to removed at request of program (will be addressed in this ticket: https://github.com/icgc-argo/workflow-roadmap/issues/267): https://docs.google.com/spreadsheets/d/1ggshqQS9c40nPizDjxuhEG7hHdyvDCSZ/edit#gid=425884767
submitter_specimen_id
field which include commas (highlighted in yellow). The regex in the dictionary should have prevented this, so unclear how these got validated. Created a separate ticket to address this: https://github.com/icgc-argo/argo-dictionary/issues/405An additional 1,248 donors have discrepancies that will need to followed up with program: https://docs.google.com/spreadsheets/d/1ggshqQS9c40nPizDjxuhEG7hHdyvDCSZ/edit#gid=902346357
167 donors were imported from ICGC25K: https://docs.google.com/spreadsheets/d/1ggshqQS9c40nPizDjxuhEG7hHdyvDCSZ/edit#gid=60315048