Open nsmt89 opened 4 years ago
The start/stops of putative regions suggest that the script parameters used we not optimal. Instead of using 0, you can try other values like 1 or 2. You can also use another parameter (entered after the 0, 1 or 2), which define the minimum size for a region to be considered sc or ir. The default it 10,000. Try 2000 and see what your results are.
Syntax: perl sequence_based_ir_id.pl sequence_file_to_split name_base skip_parameter min_region_size
I see you have been struggling with your assembly across GetOrganelle and NOVOplasty as well. Send me the file you are putting into sequence_based_ir_id.pl and I will troubleshoot. I can get a sense of what is going on more quickly than a back and forth.
Thank you very much for noticing. Can I email my fasta file directly?
Np. mrmckain at ua dot edu
I have emailed my fasta file. Thank you.
Hi, I tried using sequence_based_ir_id.pl for my assembly that I assembled from different software, but it only give me 3 regions (ir, ir, sc). The IRs are not even the same length (however only differ about ~1k bp) but the lengths of those regions seem logical for my genome (for LSC and IR) as it is near to related available chloroplast genome. Can I still locate the SSC and produce the full plastome with the right orientation? Do you have any suggestion? CN_NOVO_MITO_1_regions_split0.txt