Closed stevekm closed 6 years ago
Hi stevekm,
If aligning your reads using the Ensembl reference genome is not an option, I would suggest the following:
for example: sed 's/(.*)/chr\1/' Conpair/data/markers/GRCh37.autosomes.phase3_shapeit2_mvncall_integrated.20130502.SNV.genotype.sselect_v4_MAF_0.4_LD_0.8.bed > Conpair/data/markers/GRCh37.autosomes.phase3_shapeit2_mvncall_integrated.20130502.SNV.genotype.sselect_v4_MAF_0.4_LD_0.8_chr.bed
Make sure the reference genome you used to map reads have .dict and .fa.fai files in the same directory
Run: $ Conpair/scripts/run_gatk_pileup_for_sample.py -B tumor.bam -O TUMOR_pileup -M Conpair/data/markers/GRCh37.autosomes.phase3_shapeit2_mvncall_integrated.20130502.SNV.genotype.sselect_v4_MAF_0.4_LD_0.8_chr.bed --remove_chr_prefix -R
Please let me know if it worked for you.
Best wishes, Ewa
I'm closing this issue, as there have not been any further questions/comments.
I followed the instructions as specified in the README for setting up the
human_g1k_v37.fa
reference files. However, when I tried to run the program, I got this message:As the error message describes, the contigs in my sample .bam file are all "chr1", "chr2", etc., while those in the reference file are instead labeled as "1", "2", etc. I aligned my .bam file against the UCSC hg19 genome.fa file. Using GATK version 3.8.
Is there a recommended solution for this? Or can I just rename the contigs in the reference file to match the ones in my .bam file?