File "/home/rmonti/miniconda3/envs/janggu/lib/python3.6/site-packages/janggu/utils.py", line 308, in _str_to_iv
raise Exception('Unable to parse {} into genomic interval:\n{}'.format(givstr, e))
Exception: Unable to parse chr17:-200-800 into genomic interval:
invalid literal for int() with base 10: ''
The original error message only contained the last line (invalid literal...), I wrote the exception handling to see what was going on.
Arguably trying to parse sequence beyond the chromosome boundaries should lead to an error (?), but I would prefer there being a warning and the sequence to be zero-padded instead.
Thanks @remomomo for pointing that out.
I've fixed this issue. Now, subintervals that stretch beyond the chromosome start or end will be zero-padded automatically.
I've run into an issue when trying to load fasta sequences from a reference.
I have a sequence with the following coordinates in my ROI .BED-file:
I try to load the sequences from an hg19 reference:
This gives me an error:
The original error message only contained the last line (
invalid literal...
), I wrote the exception handling to see what was going on.Arguably trying to parse sequence beyond the chromosome boundaries should lead to an error (?), but I would prefer there being a warning and the sequence to be zero-padded instead.