genome-in-a-bottle / giab_data_indexes

This repository contains data indexes from NIST's Genome in a Bottle project.
232 stars 71 forks source link

multiple primary records for same read group RMNISTHS_30xdownsample.bam #22

Open nbedelman opened 1 year ago

nbedelman commented 1 year ago

Hello,

I'm trying to test out a variant calling pipeline using the GIAB BAM file downloaded from the ftp server (/ReferenceSamples/giab/data/NA12878/NIST_NA12878_HG001_HiSeq_300x/RMNISTHS_30xdownsample.bam), but am getting the following error:

Fatal error: Assertion failed in ../src/host/dragen_api/bam2dbam_transformer.cpp line 445 -- false -- There are multiple input primary records for read HWI-D00360:5:H814YADXX:2:2215:17273:66909, in the same read group. This is a violation of the BAM standard, which indicates that if two records have matching QNAME, they should be construed as deriving from the same template. Perhaps there was an error in setting up the read groups during BAM creation.

I saw a previous issue for a separate file where bams were merged improperly. Could that be happening here? Thanks!

Nate