DiltheyLab / HLA-LA

Fast HLA type inference from whole-genome data
GNU General Public License v3.0
120 stars 40 forks source link

No contigs for extraction specified #90

Closed kvaldez closed 11 months ago

kvaldez commented 1 year ago

Hello! I created an extraction file for my data based on the documentation, however I'm running into this error:

No contigs for extraction specified in /opt/conda/envs/hla-la/opt/hla-la/src/../graphs/PRG_MHC_GRCh38_withIMGT/knownReferences/hla-la_hg19_Compass.txt? at /opt/conda/envs/hla-la/bin/HLA-LA.pl line 391.

I attached my file here, can you suggest a solution? hla-la_hg19_Compass.txt

Thanks, Kristin

Zoeyoungxy commented 1 year ago

Hello! I met the same question. Have you solved the problem? The error message: Using /home/anaconda3/envs/hlala/opt/hla-la/src/../graphs/PRG_MHC_GRCh38_withIMGT/knownReferences/reference_extraction_UNIX.txt as reference file. No contigs for extraction specified in /home/anaconda3/envs/hlala/opt/hla-la/src/../graphs/PRG_MHC_GRCh38_withIMGT/knownReferences/reference_extraction_UNIX.txt? at /home/anaconda3/envs/hlala/bin/HLA-LA.pl line 391.

The samtools idxstats file and reference_extraction file: reference_extraction_UNIX.txt sample.idxstats.txt

Any suggestions will be highly appreciated.

Thanks, -Zoey

kvaldez commented 1 year ago

Hi Zoey! No unfortunately I haven't resolved the issue either.

Kristin

AlexanderDilthey commented 11 months ago

Hi @kvaldez and @Zoeyoungxy,

The issue is that the file does not specify any genomic locations as targets for the extraction process - i.e. there are no values / only 0 in the columns ExtractCompleteContig, PartialExtraction_Start, and PartialExtraction_Stop.

Check out e.g. https://github.com/DiltheyLab/HLA-LA/blob/master/additionalReferences/PRG_MHC_GRCh38_withIMGT/G1000_Ymasked.txt for an example - the general strategy should be to mark the complete MHC for extraction, plus any alternative MHC or HLA allele contigs.

Best wishes,

Alex