vlink / marge

4 stars 6 forks source link

error during prepare_files #7

Closed msubirana closed 1 year ago

msubirana commented 1 year ago

I am trying to use mmarge in a human chip-seq using a set of somatic mutations in the same sample to analyse the motif in them. But I got the following error:

(mmarge-1.0-env) bash-4.2$ MMARGE.pl prepare_files -files $vcf -core 12 -genome $genome -force -no-genome -add -ind NET10_TI,NET10_BL -dir $dir -genome_dir $genome_dir
Using multithreading for IO writing
Checking if /gpfs42/robbyfs/scratch/lab_lpasquali/msubirana/marc/insulinomas/mmarge_prove/NET10_TI_BL_sorted.vcf is sorted by chromosome and position.
This make take a while.
If you want to skip this step, restart with -no-check. If the file is not sorted, output files are overwritten.

File /gpfs42/robbyfs/scratch/lab_lpasquali/msubirana/marc/insulinomas/mmarge_prove/NET10_TI_BL_sorted.vcf is sorted.

None of the individuals you specified is in the VCF files!
Here are all individuals specified in the header of your file:
    NET10_TI
    NET10_BL
Processing all individuals in 5 seconds
Print Ctrl + C to abort
.....
Starting to process all individuals
Reading in file /gpfs42/robbyfs/scratch/lab_lpasquali/msubirana/marc/insulinomas/mmarge_prove/NET10_TI_BL_sorted.vcf
Merge files and mutations!
Reading in file and caching!
Status: 100% Completed

Processing NET10_TI
        chromosome chr1 allele 1
Can't open /gpfs42/robbyfs/scratch/lab_lpasquali/msubirana/marc/insulinomas/mmarge_prove/dir/NET10_TI/chrchr1_allele_1.mut: No such file or directory
vlink commented 1 year ago

Hi,

It seems that in your VCF file the chromosomes are not labeled as 1, 2, 3 etc. but chr1, chr2, chr3 etc. which I think might cause this error. Could you reformat your VCF file and try again?

msubirana commented 1 year ago

Ok thanks! I will take a look.

Another question. I already called the h3k27ac peaks of my chip data. Can it be possible to use mmarge in these peaks and using a vcf for analyzing differential motif between mutated and wildtype?

msubirana commented 1 year ago

Ok my vcf contained TUMOR and CTRL samples, I removed CTRL column (that was empty and now it works)

What do you thing about:

I already called the h3k27ac peaks of my chip data. Can it be possible to use mmarge in these peaks and using a vcf for analyzing differential motif between mutated and wildtype?

vlink commented 1 year ago

yes that is the main usage for MMARGE. After you prepared the genomes and mutation files, you can run mutation_analysis for this. If you have any more questions regarding how to run the pipeline, you can email me directly: verena.m.link@gmail.com