Open greatfireball opened 6 years ago
@PfaffS can you link/post the default order here within the issue once more?
Sure, here we go, default order should be:
<-------------LSC-------------><-------IRB-------><--------SSC--------><-------IRA-------> <psbA(-)--------------rpl22(-)><------rrn23(+)---><ndhF(-)---ndhD(-)--><------rrn23(-)--->
Please name and/or link the source @PfaffS
Sorry my bad, Michael R. McKain used this on the fast-plast (https://github.com/mrmckain/Fast-Plast), which quotes: "identifies regions from the quadripartite structure of the chloroplast genome, assigns identity, and orders them according to standard convention". Here the Link: https://github.com/mrmckain/Fast-Plast/issues/22
thx!
Let me quote a little more:
Orientation is determined by looking at the relative orientation of the rpl and rps genes in the LSC, all genes in the SSC, and the rrn rRNAs in the IR. The code orientates the LSC so there are more "-" strand rps and rpl genes than "+", more "-" strand than "+" strand genes in the SSC, and with rrn genes on the "-" strand for the IRA. This works for most lineages (that we know of) in angiosperms. In reality, the SSC is probably in both directions across copies of the plastome in a plant. This is more for convention than anything else.
A default orientation for the three sequence parts would be good to ensure reproducibility.