An optional c_terminal field is added to the enzyme struct, that allows N-terminal digestion
cleave_at can be set to the special character "$" (e.g. end of line for standard regex) to turn off digestion and match the entire fasta entry
cleave_at can be set to the empty string "" to denote non-specific digestion
Notably, min_len, max_len and missed_cleavages still apply to all of the above (however, restrict and c_terminal are ignored for non-specific or no- digest) - the following configuration will produce all possible peptides between 5 and 7 AA's long:
This PR adds support for the items in https://github.com/lazear/sage/issues/37
c_terminal
field is added to the enzyme struct, that allows N-terminal digestioncleave_at
can be set to the special character "$" (e.g. end of line for standard regex) to turn off digestion and match the entire fasta entrycleave_at
can be set to the empty string "" to denote non-specific digestionNotably,
min_len
,max_len
andmissed_cleavages
still apply to all of the above (however,restrict
andc_terminal
are ignored for non-specific or no- digest) - the following configuration will produce all possible peptides between 5 and 7 AA's long: