Closed philipwfowler closed 9 months ago
I've done some digging as this sample seems to take ~1h to get to this stage, so its been easy to background The issue is that some variants aren't being assigned any vcf evidence. I don't have time to do another run through today, so I'm dumping the nucleotide indices here so I can pick this up on Monday:
3981284
3981285
3981286
3981437
3981438
3981500
3981536
3981622
3981676
3981739
On a side note, the fact that this takes so long is concerning and leads me to think that multithreading and/or rewriting this will be required in the near future. This took 1h and didn't even complete a list of variants, let alone mutations and effects...
I've got a fix in place for this now, but the process took >1h and produced an output JSON of ~300MB
More difficult to diagnose as I don't know which line in the VCF file is triggering this error.
Estimates affects 20 samples out of 44,139.
site.07.subj.277B68C2-382F-4363-9DAB-22EAECE8BBE2.lab.277B68C2-382F-4363-9DAB-22EAECE8BBE2.iso.1.v0.12.4.per_sample.vcf.gz