Reclone-org / Open-DNA-Collections

2 stars 1 forks source link

[OEC] KOD pol cysteine #6

Open fxbuson opened 3 months ago

fxbuson commented 3 months ago

Describe the Problem

Protein has a cysteine residue inserted.

Sequence Data

KOD DNA Polymerase (BBF10K_003252):

ATGATCCTGGACACCGATTACATCACCGAGGATGGCAAACCGGTGATCCGTATCTTCAAAAAAGAAAATGGCGAGTTCAAAATTGAATACGACCGCACCTTTGAACCGTATTTCTATGCTCTGCTGAAAGATGATTCCGCCATTGAAGAGGTTAAAAAAATTACTGCTGAACGCCATGGCACAGTTGTTACCGTTAAACGCGTCGAAAAGGTCCAAAAAAAGTTCCTGGGTCGCCCGGTAGAAGTTTGGAAACTGTATTTTACCCACCCGCAAGACGTACCGGCTATCCGCGACAAAATCCGCGAGCACCCAGCGGTGATCGATATCTACGAGTATGATATCCCGTTTGCGAAACGCTACCTGATCGACAAGGGTCTGGTGCCGATGGAAGGCGACGAGGAGCTGAAAATGCTGGCATTCGACATCGAAACACTGTATCACGAAGGCGAAGAGTTCGCTGAAGGCCCGATCCTGATGATCTCGTACGCTGACGAAGAAGGCGCCCGTGTTATCACATGGAAAAACGTGGATTTGCCGTATGTTGACGTGGTATCCACCGAACGTGAAATGATCAAGCGTTTCCTGCGAGTGGTTAAAGAGAAAGACCCGGACGTTCTGATCACCTACAACGGCGACAACTTCGATTTTGCCTACCTGAAAAAACGCTGCGAAAAACTCGGTATCAACTTCGCCCTGGGTCGTGACGGTTCGGAACCTAAAATCCAGCGTATGGGTGACCGTTTCGCTGTTGAAGTAAAGGGCCGCATCCATTTCGACCTGTATCCAGTTATCCGTCGTACAATCAACCTGCCAACCTACACTCTGGAAGCGGTATACGAGGCGGTTTTCGGCCAACCGAAAGAAAAAGTTTACGCTGAAGAAATCACCACTGCATGGGAAACCGGCGAAAATCTGGAACGTGTAGCGCGCTACAGCATGGAGGACGCGAAAGTTACTTACGAACTGGGAAAAGAGTTCCTCCCGATGGAAGCACAGCTGAGCCGTCTTATTGGCCAGTCTCTGTGGGACGTTTCCCGTTCTTCTACCGGTAACCTGGTTGAGTGGTTCCTGCTGCGCAAAGCTTATGAGCGTAACGAACTGGCACCGAATAAACCAGATGAAAAAGAACTGGCACGTCGTCGCCAATCTTATGAGGGTGGGTATGTGAAAGAACCGGAACGTGGTCTGTGGGAAAACATCGTCTATCTGGACTTCCGTTGCAGCCTGTACCCAAGCATTATCATCACTCACAACGTGTCACCGGACACTCTGAACCGTGAAGGCTGTAAGGAATATGATGTTGCGCCGCAGGTAGGTCACCGTTTTTGCAAAGACTTCCCGGGTTTTATCCCGAGCCTGCTGGGTGATCTTCTTGAAGAACGCCAGAAAATTAAAAAGAAAATGAAGGCCACCATCGACCCGATTGAACGCAAACTGCTGGATTATCGCCAGCGCGCTATTAAGATTCTGGCTAATTCTTACTATGGCTACTACGGCTACGCTCGCGCACGCTGGTACTGCAAAGAGTGTGCTGAATCCGTAACCGCTTGGGGCCGTGAATATATCACAATGACCATTAAAGAAATCGAGGAGAAATACGGTTTCAAGGTTATTTATAGCGATACTGACGGCTTCTTTGCGACCATCCCAGGCGCGGACGCAGAAACCGTAAAGAAGAAAGCAATGGAGTTTCTTAAATATATTAACGCTAAATTGCCGGGCGCGCTGGAGCTGGAATACGAGGGTTTCTACAAGCGTGGGTTCTTCGTGACGAAGAAGAAGTACGCAGTAATCGACGAAGAAGGCAAAATTACCACTCGCGGCTTGGAAATCGTTCGCCGTGACTGGTCCGAAATTGCTAAAGAAACCCAGGCTCGTGTACTGGAGGCCTTGCTGAAAGATGGCGACGTAGAAAAAGCGGTTCGTATCGTGAAAGAAGTAACCGAAAAGCTGTCAAAATACGAAGTTCCGCCAGAGAAACTGGTTATCCACGAACAGATCACTCGTGATCTGAAAGACTACAAGGCGACGGGTCCGCATGTTGCAGTAGCCAAACGTCTGGCGGCACGTGGTGTGAAAATCCGCCCGGGCACCGTTATCAGTTACATCGTTCTGAAAGGTTCTGGTCGCATCGGTGATCGTGCGATCCCGTTCGATGAGTTCGACCCGACCAAGCACAAATACGACGCAGAATACTACATTGAGAACCAGGTGCTTCCGGCGGTCGAACGTATCCTGCGCGCGTTCGGTTACCGTAAGGAGGACCTGCGTTACCAGAAAACTCGCCAGGTAGGCCTGTCCGCATGGCTGAAACCGAAAGGCACCTGA

MILDTDYITEDGKPVIRIFKKENGEFKIEYDRTFEPYFYALLKDDSAIEEVKKITAERHGTVVTVKRVEKVQKKFLGRPVEVWKLYFTHPQDVPAIRDKIREHPAVIDIYEYDIPFAKRYLIDKGLVPMEGDEELKMLAFDIETLYHEGEEFAEGPILMISYADEEGARVITWKNVDLPYVDVVSTEREMIKRFLRVVKEKDPDVLITYNGDNFDFAYLKKRCEKLGINFALGRDGSEPKIQRMGDRFAVEVKGRIHFDLYPVIRRTINLPTYTLEAVYEAVFGQPKEKVYAEEITTAWETGENLERVARYSMEDAKVTYELGKEFLPMEAQLSRLIGQSLWDVSRSSTGNLVEWFLLRKAYERNELAPNKPDEKELARRRQSYEGGYVKEPERGLWENIVYLDFRCSLYPSIIITHNVSPDTLNREGCKEYDVAPQVGHRFCKDFPGFIPSLLGDLLEERQKIKKKMKATIDPIERKLLDYRQRAIKILANSYYGYYGYARARWYCKECAESVTAWGREYITMTIKEIEEKYGFKVIYSDTDGFFATIPGADAETVKKKAMEFLKYINAKLPGALELEYEGFYKRGFFVTKKKYAVIDEEGKITTRGLEIVRRDWSEIAKETQARVLEALLKDGDVEKAVRIVKEVTEKLSKYEVPPEKLVIHEQITRDLKDYKATGPHVAVAKRLAARGVKIRPGTVISYIVLKGSGRIGDRAIPFDEFDPTKHKYDAEYYIENQVLPAVERILRAFGYRKEDLRYQKTRQVGLSAWLKPKGT*

Follow-Up Actions (Assignees, Labels)

Locate cysteine residue and update wishlist for future correction

racalzadilla commented 2 months ago

Could you be a little more specific? There are 6 cysteines residues in the polypeptide you've posted, and the cistron above it translates identically to the polypeptide. Can't do much else, but I fished out their coordinates if that's what you needed.

For the polypeptide, the cysteines are at [222, 406, 428, 442, 506, 509]. The nucleotides corresponding to these codons in the cistron sequence are at: [(666, 669), (1218, 1221), (1284, 1287), (1326, 1329), (1518, 1521), (1527, 1530)], respectively.**

** Just in case, the coordinates are in Python notation, so the origin is zero-based and the intervals are half open on their right.

Hope that's helpful! Lmk if you need anything else sequence related. ✌️