ablab / stringdecomposer

Tool for decomposition centromeric assemblies and long reads into monomers
Other
32 stars 2 forks source link

[Error Report] terminate called after throwing an instance of 'std::length_error' what(): basic_string::_M_create #22

Open 865699871 opened 2 years ago

865699871 commented 2 years ago

We used stringdecomposer to transform DNA sequence into block. Most sequence can work well, but one of DNA sequence failed.

The error report is here.

The length of DNA sequence is 675,518bp and the templet is 450bp. Why this error occur?

Prepared reads 100%: Aligned chr1:250789205-251464723 terminate called after throwing an instance of 'std::length_error' what(): basic_string::_M_create Traceback (most recent call last): File "/home/soft/stringdecomposer-master/bin/stringdecomposer", line 10, in sys.exit(main()) File "/home/soft/stringdecomposer-master/bin/../stringdecomposer/main.py", line 228, in main raw_decomposition = run(args.sequences, args.monomers, args.threads, args.scoring, args.batch_size, raw_decomp_fn, args.ed_thr, logger) File "/home/soft/stringdecomposer-master/bin/../stringdecomposer/main.py", line 194, in run subprocess.run([SD_BIN, sequences, monomers, num_threads, batch_size, ins, dels, mm, match, str(ed_thr)], stdout = f, check = True) File "/home/shgao/.conda/envs/SCAT/lib/python3.9/subprocess.py", line 528, in run raise CalledProcessError(retcode, process.args, subprocess.CalledProcessError: Command '['/home/soft/stringdecomposer-master/stringdecomposer/build/bin/dp', './chr1_250789205_251464723.fa', './chr1_251090002_251199773_450.fa', '1', '5000', '-1', '-1', '-1', '1', '-1']' died with <Signals.SIGABRT: 6>.

TanyaDvorkina commented 2 years ago

Hi,

Thank you for using StringDecomposer!

Is it possible for you to provide dna sequence and template string? If yes, you can send it or link to it to me via email tanunia@gmail.com.

Without seeing input data itself, I can only say that StringDecomposer tuned to work with human monomeric sequences that tend to be shorter (~170bp). Chunks, that StringDecomposer processes, overlap by 500bp (see code line). These overlaps should be more than twice longer than expected monomer sequence, so for monomer sequence of length 450bp overlap size should be at least 900bp (instead of 500bp). Unfortunately, now this constant can be modified only in code.