Closed alhafidzhamdan closed 3 years ago
Hello,
This error is very likely being thrown because the length of one of your regions is zero in the BED file ("tot" = 0). From the head of your input file, I see one region that is only 1 bp in length (ORF29 region in line 2), thus I would check if you have any regions in your input that are zero length. I would also consider removing very short regions from your input (e.g. < 10-50 bp) as the statistical methods may be unreliable for such short windows.
Let me know if this helps. Feel free to send your input file to me as well if you are having trouble identifying the source of the issue.
Hi there, Thanks for getting back! Yes it seems to work- i've excluded regions <50bp. I have encountered another error while trying to use a blacklist bed file. Here are my commands and the error:
python ../mutEnricher.py coding ../../Elements/Annotations/cds_mutenricher_hg38.gtf vcf_files.txt --use-local -c covariates/gene_covariates.txt --anno-type SnpEff -p 12 --gene-field gene_name -o coding --blacklist ../../Elements/Annotations/Blacklisted_SSMs.tsv
--------------------------MUTENRICHER CODING--------------------------
MutEnricher version: 1.3.2
----------------------------INITIALIZATION----------------------------
Output directory for results: coding
Analysis prefix: mutation_enrichment_
Statistical testing type: nsamples
Considering all variants in background rate calculations.
Annotation type: SnpEff
Considering both SNPs and indels in analysis.
--use-local selected with covariates provided. Local backgrounds will be considered in covariate cluster rate calculations.
Set pool with 12 processors
-----------------------------LOADING GENES----------------------------
Loading GTF...
Deleting 31 genes annotated to multiple chromosomes.
GTF loaded.
Loading genes...
Loaded 20265 genes from input GTF file.
Loading blacklist variants file...
Traceback (most recent call last):
File "../mutEnricher.py", line 222, in <module>
if __name__ == '__main__': main()
File "../mutEnricher.py", line 46, in main
run(parser, args, version)
File "/Non-coding/MutEnricher/coding_enrichment.py", line 258, in run
blacklist = load_blacklist(bl_fn)
NameError: name 'load_blacklist' is not defined
I could not find a def line for load_blacklist in the coding_enrichment.py.
A
Glad to hear the first issue is resolved.
Thank you for pointing out the issue with the blacklist file parsing - I had not included this function in the coding enrichment code as I rarely use this option. I have now included it in the updated version of the overall code (version 1.3.3) and verified that it was working on a test set. Please give the update a try and let me know if you encounter any other issues.
That works! Thanks for your support! Very much appreciate it! A
Hi there,
I tried to create a covariate file using a processed bed file from GencodeV38, using the command below and I got this error. Could you please help me troubleshoot?
I'm not sure what's causing this error- happy to provide you with the bed file if you think it'd be useful.
A