epam / Indigo

Universal cheminformatics toolkit, utilities and database search tools
http://lifescience.opensource.epam.com
Apache License 2.0
314 stars 103 forks source link

Import/Export of variant monomers from Fasta/Sequence #2015

Closed olganaz closed 2 months ago

olganaz commented 4 months ago

Background Sometimes users need to register oligonucleotides containing randomized or "mixed" bases. It means that on the defined position a variant monomer could occur. Variant monomer is a monomer which can be used instead of another monomer within listed variants.

Requirements In addition to the requirements for import/export Sequences #1426 and Fasta #1755 the following symbols should be supported for:

ljubica-milovic commented 3 months ago

In addition to X as a symbol that represents "any amino acid", following symbols for Peptides should be supported:

Symbol Symbols of amino acid represented by it
B D, N
J L, I
Z E, Q
ljubica-milovic commented 3 months ago

By X (any amino acid), in addition to the 20 amino acids listed above two more are to be added to the list. X = [ A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y and O, U]

AlexeyGirin commented 2 months ago

Verified with issues.

Versions