molevol-ub / DOMINO

Development of molecular markers in non-model organisms
GNU General Public License v3.0
4 stars 3 forks source link

Issue with getting taxa to display when using a custom run and MSA file #6

Open plumbwj01 opened 6 years ago

plumbwj01 commented 6 years ago

Dear developers, I am trying to use DOMINO to identify mapping regions which can be used as unique identifiers for several different species of trees. Someone within my group has already generated Multiple sequence alignments for each of the genes of interest, so I have been trying to use the custom run option to bypass the initial stages and starting at "Mapping/Alignment" but I have found when I get to the "marker discovery/selection" stage it displays no selectable taxa and prevents any advancements from that point. I was wondering if you would know of a solution to this problem?

I am using the MSA files in the phylip format and I have attached some example data below just in case the error lies there. Thanks in advance.

3 8807 Taxon1 ---------- -----AGAGA AATGGAGCCA AGAAAAGCAA TAGATGAATT Taxon2 ---------- ---------- -ATGGA---C GAAAAATCGG TGGACGATAA Taxon3 AAACAGATAG CCTGA----- -ATGGAGCCA GGGCTATCAA TCGAAGATCA

       GTTCTTAAAA TTTCATCCTT GTTTTGATAC AAACACAAAG ATAGGAATAA
       ATTCTCGAAG ATGCACCCTT GCTTTCCTCC ACGAACAAGG ATTGCAATAA
       ATTCTCAAAG CTGCATCCAT GCTTTCCTCC GGACTCGAGG ATTGCAATCG

       TAGGAGGTGG ACCTAGTGGA ACATCAGCTG CTTATGCATT GGTAAAACTT
       TCGGAGCGGG CCCCAGTGGA CTCTCGGCTG CTTATGCACT ATGCAAATTA
       TAGGAGCTGG CCCGAGTGGA TTATCTGCTG CTTATGCTCT TTGTAAACTC

       GGTTATACAA ATATCACTAT TCTTGAGAAG TATCATTCTG TTGGTGGCAT
       GGCTACTCCG ATGTAACTGT GTTCGAAAAA CATCATTCAC CTGGTGGAAT
       GGTTATTCCG ATGTAACCGT GTTTGAGAAA GAACGATCGC CTGGGGGCAT

       GTGTGAATCA GATGATATTG AAGGTATCTA T--------- ----------
       GTGTGAATCA GTTCAAATCG AAGGTAATAA TAATCTGCCG GAAAGAAAAA
       GTGTGAATCA GTTGATATTG AAGGTACAAA T--------- --------AG

       ---------- --GCTATTTA C--------- ---TTTTAGT CTGTTCGTAG
       AAATATCCAC TGAATATTAG ATTTTTTACA TAGATTTAAT T---------
       AAATAT---- --AGTATGAA A------AGA AAGTCTCAAT GTATACTAAG

       AAAGATG--- ---------- ---TCTTGCT TTCTTATTTA GAAACAATTT
       AAAAAAGTTC GA-------- -ATTAATGTG TTCTAACAGG AAGGATTTAC
       AAAAAGGCTC AAGCACGTCC CGTTTATGTC TTCTTACAGG AAGGATATAT

       AACTTTAAAA TTCTTATTTT TAATGACATA TTTGTGTATA GCTTGTTTTA
       GATTTGGGAG GTCAAGTTCT TGCAGCAAAC AGTGCTCCGT CTATATTTCA
       GATTTGGGAG GTCAAGTCCT TGCTGCAAAT AGTGCTCCTA GTGTATTTCA

       GACTACAAGT TTGAAT---- ---------- ---------- -------ATT
       CTTGGCTAAA GAAGTCGGTG CAGAAACTGA GGAAATGGAC ACACATAAAT
       CCTGGCAAGA GAGATTGGAG CCGAAACAGA AGAAATGGAC ACTCATAAAT

       TTTTATTCTT GAACTCCGTG CCTGG----- ---------- -----TTATA
       TTGCACTTAT TAACAGTTCA AATGGAGCTC TCACCGAAAT GAAGTTAGTC
       TTGCACTCAT AGACAACTCG ACTGGAGATC TCAGAGAAAT GAAACTTGTA

       -CACGCTCAC ATAT------ ---------- GTTGAATTAT CCCGAGTCGG
       -GAAGATTAT GTATCCATGA TCTCTCTTAC ACTCAA--AC TCCAGGTCAG
       -GAAGATTAT GTATCTGTGA TCTCTCTCAC ATTGAA--AA TCCAGGTCAG

       GG-GCGTGAG GGAACTTGTC TCC------- -CCTGTCCAA TCATCAGCTC
       -ACATCTTCA GACACACACA GACAAAAAAA AAAACTGTGA TTTTCAGCT-
       TACGCCTTAC AAAGTATATA T--------- -AGTTTTTCA CCGTAAGCTC

       ---------- ---AGTTAGT GACATTGAGA CGGACCTAAG AT----ATCC
       ----ATTTGT AAA------- --CTGCAGGA TAAGGCTAAA GAATCGATTG
       GACCATTTCT GATAGCTCCT --CTACAGGA TAAGGCAAAG GTGTCTATTG

       ATCTTCTTAT CATG------ ---GTATCAG AGTCGTGACT GTCCAATTCT
       GAGTACACGC AGTGAGTGAA ATTGCCTCCG ATT--TAACT CCCGATTATC
       GGGTACACGC CGTGAGTGAA ATTGCCCCAG ATT--TAACT CCCGAATACC

       T--------- ---ATTTCAC TCGAT----- -----GTCGT TTATGGGC--
       TAAAAAATCA AGGATTTCCT TCGGTGCCTA AATCAGTAAT ATACGGATAC
       TCAAGGGTCA AGGATTTCAA TCTGTTCCTA AATCAGTAAT TTACGGATAT

       ---GTGTGAG GGTGTTGAAT TGTCCCAAAT AGG------- ------TGCG
       ACAGCATCTG GATATGGATA CGTGCAAGAC ATGCCTTACG CATACATACA
       ACTGCATCAG GCTATGGATA TGTTCAAGAT ATGCCTTACG CTTACATTCA

       TGGAGGCAGT TTGTCCTC-- ---------- ---------- ----------
       CGAGTTCACA AGAACATCCA TGGCTGGAAA AATCCAACGG TTCGAAGGCG
       TGAGTTCACC AGGACTTCAA TGGCTGGAAA GATTCGCCGG TTTAAAGGAG

       -TTATATGAT TCTTT----- ---------- ---------- AATTTTT---
       GTTACACGAG CCTATGGAAT AAGCTGAGTC AAAGGCTGCC AAATTCTGTT
       GTTACATGAG TGTTTGGAAG AAACTGAGTG AATACATGCC AAGTTTTTTC

       ---------- --TATTATCA ACATAA---- ----ATTGAG GATGGGTGGC
       CCAACACAGA AGTACTATCA GTTAAACGCA ATTCATCCGA AATCAAAGTT
       CCAACACCGA AGTACAATCA ATAAAACGCA GTTCATCTAG CATCAGAGTC

       GG-------- --GGGTGGGG GGGT----AA TTTATA---- ----------
       GAATTCAAAA CTGAGAATGG CGGTGTAAAA TCGAGAGAAT TCGATAAAAT
       GATGCTAAAA ATGAAAGGGG TGATGAACAA GTTATGGAGT TCGATAAAAT

       ---TATGTAG TATGTTGTC- ------ACAC TAAAAGATTT TACGGA----
       TATTATCTCC GGTGCTTTTC CGTTTATCAG TGGAAATACT TACAGATCAC
       CATTATATCG GGTGCTTTCC CTTTCAACAA CGGAAAATTT TATAGATCAC

       ---------- ----ATTC-- TGTACACTAA CAGTATAAAG AAT----TTT
       CTTCACCGAA CACAT----- ---ACATAGA T-ATATCATA TATGTTAATT
       CTTCTTCGAA CATAATTCAT TTCACACTAA T-GTACAGAA AAT----ATT

       CCACTAT--C AATTT----- ATTTGAACG- TGTTGTA--- ----------
       TTATTATAAC ACAAAAAGTT AGCTAATTGC TTTTCTCTGT GCAGATACTG
       CTAATATTTC CGTCTGGATG ATTTGATTG- TTTTCTGTCT GCAGAATTCT

       ---------- -CATACGTAC ATAA------ ---------- ----------
       CGAATAATAA CCGTATTGAC TTAAGCGAGC TGGAAAAAGA GTTGTTCAGT
       CGAGCAAT-- -CGTATGGAC ATGAGTGAGC TCGAAATGGA ATTGTTTAGT

       CAAGTACAAG TAGTCG---- ------TTCA TGAAG----- -------CTA
       AAAGTCAAAA CCATTGACTA TTACACGTCT TGAATAGGGA TTTGAGCATA
       AAAGTACAGA CTATTGACTA CTATACGTCT TGAAGAGGGA TTGAAACATA

       GTCCCA---- TTTTTCTTAT GT-----AGG TTAC-----A TGTAGCTACT
       TTCCAAAGGG TTTTTATTAC TTCGACGAAT TTATGAACGA TCCATCTACT
       TTCCAAAGGG TTTCTATTAC TTTGACAAGT TTATGGATGA TCCGGCAGCT

       T--------- ---------- ---------- ---------- -TATCGTCGT
       AAATCCTGTT GCAATGCAGA GATTTTACGG AGACACGGAT ATTTTCCTGT
       AAACCCTGTT GCAATGCAGA AATTCTATGA TGACACTGAC ATATTTTTGT

       AACATTCTTC TA-------- -TCAGTGTAT ATAAGTTAA- ----------
       TCTGGTCTTC TCGGCTAATA TTCAGGAACA CGAGGTTGCT GAGCTGGCAA
       TTTGGTCTTC TCGGCTGATA TTCAGGGATC AAAAGTCGAG GAGATGGCGA

       -------ACT CTTAAACATA TGTGTTACGA GTCCAGGAAA TA--TTTACG
       TTTCTGCGGT CAAAAGAATG GGAGGT-GAA GTCGAAAGAG TGATTTTACA
       TGAATGCAGT CATAAGTATG GGAGGC-CAA ATCGAAGAAG TGGTTTTACA

       ACTTGACAAG TTCTTGCTGC ----AATAGT GC-------- ----------
       GC--TTCAAG TACTTCCCAC TGTGAAAAGT GAAGGTAATA TCTTGTAATA
       AC--TTCAAG TACTTTCCTC TGTCAACAGT GAAGGTATG- ----------

       -------CTA GCA-AAAAT- ---------- ---------- ---------A
       AGAAA--CGT TCGGA----- ---TGTACTA ATTTCGGGAA AAA------A
       --CGTA-CTT GCAGAATATT CCATGTAATT ATTACTAATC CAAACTCCAA

       GAGCTGAAGT TGAGGAAATG GACAACCATA AGTTTGCCTT CGATCT----
       AAACAGATAT GAAGGAGGGG TACTATGATA AGCTGGAATT TCTACTACAA
       AAGCAGAAAT GAAGGACGGG TTTTACAATA AGCTCGAGTT TCTACTACAG

       --TGATTGAA AATCATA--T CGATGGCTAC TGTTACTGGT CTTTCTGTT-
       GGTCAAAGAA ACACCTACTT TGTTGG---- ---TAATGGC ATTTGAGCTG
       GGTCAACAGA ATACATACTT CGTCAG---- ---TCATGGC ATTTGAGCTG

       ---------- -CCCGAATTA TAATATATGA ---------- -----TGTTC
       ACTGAGAGGA ATGCTAGTTA CGCATTTGAC CTCGTTCGAA AGCATTTCTC
       ACAGAGAGGA ACTCCAGCTA TGCCATGGAA CTCGTCCGAA AGCATTTTGC

       TATTAATGAT TTTATTCATA GTACTTCATG GAATTTGGCG TAAAGCC---
       TAACGATAAT CAAGAACCGA GCTATCCCTA TG--TCAAGG T------ATA
       AAATAATAAT CCTGTACCAA GTTATCCATA TG--TCAAGG TGAGGTCATA

       ---------- ---------- ---------- ---------- ----------
       A--------- ---------- ---------- ---------- ----------
       GATAAG-AAA AAGA------ ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ----CATATT TTATTACAGC
       ---------- ---------- ---------- ----CTAATC TTATTTTAGA

       ---------- -----TTCCT AAGATA---- ---TGTTTCG GACTCTGCAA
       ATGGATAT-- -----GTTGG CATTGATTTT AACAACGCAT GCTCATGCAG
       ACCAATGG-- -----CTT-- ---------- -----CTTCT GGTATTGCAG

       AA-------- ---TGCCGTC AGGT------ -GCATATTAG ATCCT---CC
       AGATTACTGA CATTGAAACC GAACAACGGA GGCTCAGTCG TTCCCAGACA
       AGACTTCTGA TGTTGAAGCC AAATC----- -GTACATTAG TACCTAAGCA

       AATAG----- ---------- ---------- ---------- -----TAGCA
       ATTGGACGAA TCGCCAGGTG TCCAGTTTCC AGAACTTTCC ACCCTCGACG
       ATTAGATGAA TCATCTGGTG TCCAGTTTCC CGATCTTTCT TCCCTTGATG

       CATTTTTAGA GAATCGGACA CG----GGTG CAGC------ ----------
       CATACTTGAA ACATTGGGGA ACCCATAGTG TAACTCAAAG CAAGACACTG
       CTTACTTGCA ATACTGGGCA AGTCACAGTG CAACACAAAG CAAGACACTG

       ---------- ---------- --AACATTTT GCAGAGTCCA AGCAACATAT
       TACACATGGA TCAACGAGAA GGGGCAAATC GTAGCA--CA AAGAACGTAC
       TATACTTGGA TCAACGAAAA GGGACGCATT GCAGCG--CA AAGAACATAC

       ---------- ---------C TCCTTAGTAG TCCAAATAAT TTCCGAGTCA
       AAAGAACTTA ACTCAAACGC TTCAAAGATC TCCGAAAAAC TAACGAGCTG
       AAAGAACTTC ACAATAATGC TTCTGAGATT TCTTACAAAC TCTTGACAAG

       CGAGTAGTAA ACCTTCAAAT AATATTTAAA AGTACT---- ----------
       TAACAACCCG ACAATCAAAA CAGGCGATAG GGTTCTCTTG GTTTATATAC
       CCAAAAACCA ACCATCAAGC CTGGTGACAG AGTGCTCTTG GTCTACATCC

       ---AACTTTA TTACTGTGAA TTTTCCT--- ---------- -----AAATA
       CTGGCCTAGA TTTCATCGAT GCCTTCTTCG GTTGTTTGAG AGCCCGTGTA
       CGGGTCTAGA TTTTGTTGAC GCCTTCTTCG GGTGTCTGAG GGCTGGAGTC

       TTACCTGTTA C--TGATCCC TTTCTGCAGT AAGCAACAGC AGAAGAATCG
       ATACCCGTGC CCGCAATTCC ACCC------ GATCCGCTGC AGAAAACCGG
       ATACCTGTTC CAACAATTCC TCCC------ GAACCATCAC AGAAGGGTGG

       ---ACATGAA TAGCCTCGAG GAGGAGTTGT TCGGTAAAGT ACA-------
       GCAACAAGCA CTTCTCCACA TTTCCAACAT T-TCCAAAGC ATGCAATGCA
       ---ACAAGCA TTACTTCATG TGGTTAATAT T-GCTAAATC ATGCAATGCA

       ---------- ----AACTAT TGACTATTAC ACCATAGT-- ----------
       GTTGCCATAT TATCAACTGT TAGTTACCAC ATCGCGGTCA AGGCTTCTTC
       GTGGCAATTC TGTCAACACT TAGTTATCAC ATAACTGTTA TGGCAGTTTC

       ---------- ---------C TTGAAGATAA C--------- ---------T
       GGCAAGAAAC ATGCTGTCCC TTAAGGGAAA AGACAAAAAT ACCCCTCGCT
       TGCAAAAAAC ATGGTATTTC TTAAAGGAAA CAACAAATGC TCCCTACGCT

       GGATT----- ----TGATCA TATACCGATG GGT------- ----------
       GGCCCGATCT GCCGTGGCTC CACACCGATT CTTGGGTGAA GAAATCAAGA
       GGCCTGATCT GCCATGGTTG TATACAGACT CTTATGTCAA GAAAGCCAAA

       TTTTACTACT TT---GG--T CACCAAAAGC CATCGGGAAT CCAGTTGCAA
       ATCACCATGC AAAACGGCAG T---ACGAGT CTTTGCCTCG TGATTTGTGC
       TTTTCCTCGT CA---GGTGG TACTATGAAC CTTTATCAGA TGACTTGTGC

       TGCAGAGATT TTACA----- --------AT GACACGAACG TCTTCTTGTT
       TTCCTGCAGT TCACATCGGG GTCCACAGGA GAGCCGAAGG GAGTTATGAT
       TTTCTCCAAT TTACATCAGG CTCTACAGGT GACGCCAAAG GAGTCATTAT

       TTGGTCTTAT GGTAACTCGG TCGACATAGT AGGATTAA-- ---AGGAGAC
       ---AACTCAC GGTGGAATAA TCCACAATGT GAAGACGATG AGAAGCCAAT
       ---AACTCAT GGTGGGCTCA TTCATAATGT AAAGTTAATG CGAAGAAGAT

       C---GAGCTC CTCATAAAAG GTGTT-GAAA GT-------- ----------
       ACAAGAGCAC CTCGAATACG GTACTTGTGA GCTGGCTGCC ACAGTACCAC
       ACAGGAGCAC ATCAAACACA GTTTTAGTAA GTTGGCTGCC TCAATATCAT

       ---------- --ATGGGAGG CTTTGTTGAA AA-------- --AGTCGTTT
       GATATGGGAC TCATCGGTGG CCTTTTCACT AGCATGGTCA GTGGCGGTTC
       GACATGGGAC TCATTGGAGG ACTCTTTACA AGTATGGTGT CTGGTGGATC

       TGCAA----- -------CGA AAGC-----T CAAATATTTC C-CTCATGTT
       CGCAATTTTG TTCTCTCCGA CGGTGTTTAT CCGGAACCCT TTATTATGGC
       TGCAATTTTG TTTTCTCCAC TGACATTTAT CAGGAATCCC CTCTTATGGC

       AACA------ --------AC CAAGGTACTC ---------- ----------
       TGCAGACTAT AACCACGTAT CGCGCCACAC ACAGTGCGGG CCCCAATTTC
       TGCAGACTAT CAGCACATAC CATGCGACTC ATAGTGCTGG CCCCAATTTT

       ----CTGAAA ---------- ---------- ---------A GACTGCGGAA
       GCGTTCGAGC TGCTTATTCG TAGGCTAGAA GCCGATGAAA AAGGGCAGAA
       GCCTTTGAAC TCCTCGTTCG CAGATTAGAG TCAAAC---A AATCACAAAG

       TTTTCATTT- CAGTTTATGT TTTATGCACT CACAATCA-- --TGAATCA-
       GTTCGATTTG TCTTCGATGG TTTTT---CT CATGGTTGCT GCAGAACCAA
       TTTTGATTTG TCCTCCCTAA TTTTT---CT CATGATTGCT GCTGAACCAA

       ---------- ----CTTATA AGT---CT-- -----ATGGC TAAACTACTT
       TCAGAGCAGA AACTATGAGA AGG---TTTC TGCTGCTGAC TCAGCCTTTC
       TAAGAGCGAC AACCCTTAGA AAA---TTCC TTAAGCTGAC TCAATCTTTT

       TGAATCTTTT ---------- ---------- ---------- ----ACATTG
       GGGCTCGATC GGGGGGCCAT GGCCCCCGGC TATGGATTAG CAGAGAACTG
       GGACTCTCTC AAGAGGTATT ATCTCCTGGT TATGGATTAG CAGAAAACTG

       TCGGTGTGTA TAACTAATAT ATTTTCATTT TAAGATTTGA ACTATTATCG
       C--GTTTACG TGTGCAGTGC GT----ATGG AGAAGGTGAA GAGATGTTGG
       T--GTGTATG TAAGTAGTGC AT----ATGG AGAGGGTAAA GAGATTATGG

       TTCTATT--- --GATTGTGT GCTTCGAAAT GACAGATATG AAG-----GA
       TT-GATTGGA ATGAACGAGT TTGCTGTGGC TACATTGATA ACAATGATGA
       TT-GATTGGC AAGATAGGGT GTGTTGTGGT TACATAAATT CAGATGATGA

       AGGCTTTTAT G-AGAAAGTG GAAAATC--- ---------- --AACT----
       TGAAGTTCAA A-TCAAGATT GTTGATCAAG AAACGGGCGA AGAGTGTGAA
       AGATGTTGAT A-TAAAAATA GTTGATCCCG AGACTGGAAT AGAGCT----

       -------ACA AGGTCAACAA AACACGTATT AT-------- GTTGGCGGGT
       AAATCGGAAA AAGAAGGGGA GATTTGGATT AGTAGTCCAA GTAGTGGAGT
       --ATCTGAAA AAGAAGGGGA GGTATGGATT AGAAGTCCAA GTTCAGGAGT

       TG------AT GGCATT--CG AGTTGA---- ---------- ----------
       TGGATACTGG GACAATGAGG AGTTGACCAA AACGACCTTC GAAAACAAAC
       TGGGTACTGG GGCATGGAGG AGTTAAGCGA AAAGACTTAC AGAAATGAGA

       ---------- ---CAGAAAG GAATGCAT-- ---------- CTTACGCCAT
       TGAATAGCGC CCATGGCAGG AAGTACATTC GAACGGGGGA TTTGGGAAGA
       TTTATGACCA TCCAGGGAAG AAATACATGC GAACTGGAGA CTTGGGACGA

       GGCTCTAATG CACAAGCATT --TCGCTAGT AA-------- ---TAATCCT
       ATCGTGGACG GAAAGTTGTT CGTCACGGGC CGTATAAAGG ATCTCATAAT
       ATAATTGATG GAAAATTATT CATCACTGGG AGAATAAAAG ATCTAATCAT

       ---------- ----ATACCA GCCTTCCCAT ATGTTAAGGT AGGCTACTTC
       TGTTTCTGGA AGGAATA-TA TATTCGTCAG ATATCGAAAA A---------
       TGTTGCTGGA AGGAATA-TA TACTCCTCGG ATATTGAAAA A---------

       AGAGTTCAGA TATCTTCTTT TTTTTGATGT TATGGAT--C GTGTTACAGT
       ACCGTCGAGA ATTCGTGCCA GCTAGTACGC CCTGGATGCT GTGCTGCAAT
       ACAGTTGAGA TCTCATCTGA ACTTATACGA CCTGGCTGCT GTGCTGCTGT

       TTCTCGATTT CATTATTTAA TACCCTATCA AAA-GGGATA T--CATTTCC
       CGGT--GTTC CGAAGGAGAT ACTTCTATCG AAAGGGAATA TTCCATTTCC
       TGGT--GTTC CAAAAGAGAT TCTTATGTCT AAA-GGAATA T--CAGTTCA

       TAAAGGTTCT GATCAACTTG GATTGGTTGT AGTCGCAGAA GCTTAT----
       GGAAACTTCC GATCATGTTG GATTGGTGGT GATTGCCGAG GTTCGAGAAG
       AGAAATTACT GATGAAGTTG GATTGGTAGT AATTGCGGAG GTTCGAGAAG

       -----AAATC ACTTCCTTGT GAAGTTATTA AACGTATCTG TACTCGTGTT
       CAGTCAAGTC CGGTTTTAAG GAAGCTGTTA AACGAATTCA GGCATGCGTT
       TA---AAGCC GGCTCTGAAA GAAATCATGA GACATATTCA GACATGTGTT

       GTAGAAGAAC ATGGTGTATC TATTGCCTCC ATTATGACTA TCAAACCGAG
       GCGGAAGAAC ATGGGATCAT TGTCTCCTCC GTTATTCTGA TCAAACCGAG
       GCAGAAGAAC ATGGCATTGT TGCTGCTTCG ATTGTGTTGA TAAAACCAAG

       GTCCATTAGT AAGACCACGT CAGGGAAAAT CAGGAGATTT GAAGTAGCTA
       GACCATTAGC AAGACGACAT CCGGCAAAAT TAAACGGTAC GAATGTCTTC
       ATCCATTAGT AAGACAACAT CAGGAAAAAT CAAGAGATTT GAATGTCTCA

       AAAGATTTGT TGATGGAACA TTGAGTGTGG TGGAAGACGC AACCGATGGA
       AAAAGTTTAA AAACGGAACT TTCGATGTCG TTCATCAG-- ----------
       AAAAGTTTGC TGATGGAGCT TTGAATATTG TCGACCAGCT TGTTGCAGGA

       GAAAAATCAT CACGGGAGTC TA---AAGAT GGTTCCGTAC CATAAATCAC
       ---------- ----TCCA-- -ACGGAAC-T GTATCGGAAA ATCGAGCCGA
       GAAAAATTGC CAGATCCATC CACGGAACAT ATATTGCAAT CTCAAATCAC

       AAGAAATTTA GCTCTGGG-- -AAATGATAA AATTAGCAAG GGACAGATCA
       GAAAAATAAT CATCCAATTC CGAATTCTGA TATAACAAAG AGTGACATAG
       GCCAGATCAC CCTTCAGT-- -AAATTTCAG TATTAACAAG AGGGATATAA

       TAGAATTTCT GAAGGAGCTG CTCTATGAGA TGACGGAGGT TCTCAAGTCT
       TGAACTTTCT GATCGAGCTT CTCTCTCAAA TGACCGGAAT TTCTACAGCA
       TCAATTTTTT GATGGAGCTG CTCTCTGAAA TGACTGGAAC TTCTACTGTT

       AAAATCTCCA TCACTGAGAG TCTGGTGTCT TATGGTGTCG ACTCAATTGG
       AAAATCTCCA CCAATGAAAG CCTTGTATCT TATGGGGTGG ATTCCATAGG
       GAAATGTCGG CAACTGAAAG CCTTGTATCT TATGGGATTG ATTCTATTGG

       CGTTGTTCGA GCAGCTCAGA AAATTTCAAA CTTTCTCGGA GTTCTAGTTG
       TGTCGTTCGA GCCGCTCAAA AGCTCTCCGA TTTTCTTGGA GTCCCAGTTG
       TGTGGTTCGA GCAGCTCAAA AACTTTCAGA TTTCCTCGGA GTTTCAGTTG

       GAGCGATAGA CATCTTCTCA GCAACCTGCA TTGAAGACTT AGCAGACTTT
       GCGCAATCGA CATATTCACA GCGACATGCA TCGATGACCT GGCGAAATTT
       GTGCAATTGA TATTTTTACA GCAACCTGCA TCGATGACTT GGCAAATTTT

       GCTGAAGATC TTTTGAGAAA ATCCCACCCC GAGTTGGTCA CAACTACCTC
       TCAGACAACC TATTAAAGAA GTCTCGTCCT AAATCAG--- ----------
       GCAGAGAATC TTCTAAAGAA GTCCCGTCCT CAATCAGTAG CCAGTTCAAC

       CGGTTCTT-- ---------- -G----TGAA AGCAAGATGA GC----TCAA
       --------CA AAAAC----- -GAAGGCGAA AGTACCATCA ACAAAAACAA
       CCATTCAACA GAGAC----- -GAAATCAAA ATCAAAATCA AC---TTCAG

       CCA------- ---------- ---------- --------TG GTTTTTCCAA
       CCTTTTCCGA AGCTTCTTCG TCAAGAAAGC TGCAAATCTG GTTCATGCAA
       CTATTCTCAA GGCTTCTCCA ACTCATAAGC TGGGTATTTG GCTGTTTCAG

       CGTTTAGCTC TTGCTTATGT CTCTTTCATG CTGATATTTC CTGCGTGCAT
       ATCTTAGCAC TTGCTTACGT TTGCTTCTTG CTGATTTTTC CCGCTTACCT
       CTCATAGCAC TTGCTTATGT TTCCTTCTTG CTGATATTTC CTGCATATCT

       ATCGGTTTCC ACTTTCAAAT TTTTCATTTC TGCATGCCAA ATGCTGATAC
       CTCGATTTCC GCCTTTACGT ATT-----CT TACAAGCCAC AGAAAAATAC
       ATCAGTTTCT ACATTTACAC ATTGGATATA TGCTGGCCAC AATCCAATGC

       AATCAGCCCC TTGGTTTGGT TACATTGTTT CCTTAGTATG TGCTCCACTT
       ATGCA----- -----CTGCT TACTTGATTA GCCTATTTTG TGCTCCGCTT
       ACACAAGTCT CTACTTCGGT TATTTGATTT CTTTAGCGTG TGCTCCTCTT

       TCTTGGATAT TATGTG---- ---------- -----CATAG GTTTTCTTGG
       TCTTGGATGC TGTGCCTTTT CTCGACATGC ATTAGCATAT CCTTTTTGGG
       GCTTGGATAC TATGCATTTT CTTCACTAGC ATTGCAATAG GATTCTTTGG

       AAAGTCATTT CTACAACCAA ACTATATTTT GAACCCTGAG GTGTCCATCT
       GACACCATTT CTTCAGACAA ACTACGCCTT ATTTCCCGAG GTTTCGATTT
       GAGTCCATAT CTGCAACCAA ACTATGCATT GTTCCCTGAA ATGTCGATCT

       GGTCTGCTGA TTTTGTCAAG TGGTGGACAC TTTATAAGGC TCAAGAAATA
       GGTCTACCGA ATTCGTGAAG TGGTGGGCCC TGCACAAGGC TGAGGAGGTT
       GGTCAGTTGA ATTTGCCAAG TGGTGGGCAC TTCACAAGGT TCAAGAAGTT

       TCTTCAAAAG TCTTAGCAGT GCATTTGAGA GGAACGGTAT TCATCAATTA
       TCTTCGAAAG TTCTTGCAGT TCACTTAAGA GGCACTGTGT TCCTCAACTA
       TCTTCGAAAG TTCTTGCAGT TCACCTGAGA GGCACTGTGT TCATCAACTA

       TTGGTTCGAG ATGCTCGGGG AAAATATTAC ATCATCTCCT ATTCTTGATA
       CTGGTTCAGA ATGCTGGGAG CTAAAGTCGC CTCGTCAGCA TTAATCGACA
       CTGGTTTAGG ATGCTAGGAG CTAAGATTCC ATCTTCTGCA GTGCTCGATA

       CTGTCGACAT CACAGATCCA TCACTAGTTT CTATTGGAGA GCAAGCTGTG
       CCATCGACAT CACGGATCCA TTTCTAGTTT CGATCGGAGA AGATGCTGTG
       CAATTGACAT CTCTGATCCA TTTTTAGTTT CGATTGGAGA AGAAGCTGTA

       ATCAATGAAG GAGTATTGCT TCAAAGTCAT GAAGTGAAAA ATGGTGTCTT
       CTGGCAGAGG GAGCGTTAAT CCAAAGCCAT GAAGTGAAAA ATGGAGTTTT
       CTTGCAGAAG GAGTACTGAT TCAAAGCCAT GAAGTGAAAA ATGGAATCTT

       GAGTTTCAAC CCGATAAGAA TCGGCCAAAA ATCGTCGATT GGTCCTTATG
       GAGTTTCTAT CCAATGAGAA TCGGATCAAG ATCTTCGGTG GGGCCCTATG
       GAATCTATAT CCTACCAGAA TTGGTTCAAG ATCTTCAGTG GGACCGTATG

       CTGTTGTTCA AAGAGGAAGC ATTGTGGAAG ATGGTACTCA TGTGCTTGCT
       CTTTGCTTCA GAAAGGCGTC GAGGTACATG ATGGAGACGA AGTACTTGCC
       CTCTAATTCA GAAAGGCGCT GTGGTTGAAG ATGGAGCTGA AATTCTCGCC

       CTCAACACCA GC-------- -AAGACTGCA GTTAA---TG TTAAGGCTAC
       CTCA------ ---------- CAA--CTGAG GTTAGTACTT CTGATGTGGA
       CTGAATTCTG GTGAAGGAGG CAACTCTGAG GCCAAGGTTT TTAATGCTGA

       AAGCACCAAA AAGGTCAGT- --AGAATTAG AA-------- ----AAAGAA
       TCATTTCCAA A--------- ---------- ---------- ----------
       GAGTTTTCAA AAGGTCAGTC --ATAACGAA CATCTATATG CTCCACAAAA

       ACTTTCATCA ACT------- -TAGTTGCTG GACTCTTGAA AAAAG-CTTT
       ---------- ---------- ---------- ---------- ----------
       G-TTTTGATT ACT--TCTCT CTAGAATATC TACTAATAAA ATGCA-TTTT

       AGACTGA--- ---------- --CACATTTC TGT------- ----------
       ---------- ---------- ---------- ---------- ----------
       TTATTGAATT CACACCCATT G-CACTTTGA TATGTAAGAC AGAATTGCCT

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       AAAACGGAGC ACAATTTACG AATAATGAGT AGGTTAAGGA TCAAATGGCA

       ---------- ---------- ---TTTCTAT CATGATTACA GGACAAATCG
       ---------- ---------- ---------- ---------A GGGTAAAATG
       TTGACTAACC ATTGATTT-- -CATTTCTAT CACAACCACA GGGAAAAACG

       ACACGAAAGA GCATGAAGGG GAGTCATGGA AAT--CATGA ACAT-ATTAA
       GGAAAACAAA TCATCAGCAA AAACTGTGGG AATTATGCCA TGGTAATT--
       GTACAGCATA GCCTCCACAA AAATAGTGGA AACTCTGTAA CCATTATTCA

       ATATTCTC-- -CATATACAT GGTCGGCTTT CTCAGCAGCT TATCGGCAGC
       -ACTTTCTCG GTATATATAC AATCGGTTTT CTTGGCAGTT TATCGGCGGC
       GAGTTTTTTG GCATCTACAT GATTGGTTTT CTCAGCAGCT TATCAGCAGC

       TGTTTCTTAC TTCATTTACC TTTCGCTATG GCAAAACCCG CCTTCACTAA
       GGTTTCTTAT TTCGCCTACC TCTATATAAT GCAAAAGCCT CCTATTATGC
       GATGTCTTAT TTAATCTACC TCCAGATAAT GCAAAAACCT CCCTCACTTC

       AACATTTCAG GTTCTTTTAC TTAGCTGGAG CCTTTCATTG GTTTCCATAC
       ATCACTTTGC ATTCATATCT GTTTCTGGAG CATTTCATTG GCTACCTTAC
       ATCATTTTTC TTTCTTCTGT TTATCTGGAG CCTTCCATTG GTTACCGTAC

       ACGATCGCCA CATATGCTAC ATTGTTTGAC AATTCCTATT CAGGTTCATT
       ACCATAGTTG CATATATAGT AATACTCGAT TCTGCCCCTT CAAATCCGAT
       ACGATTGTTG CATGTGCAGT TATGTTTACG AGCATCCCGT CAAATCCGAT

       CACTTTTGCT ACCACTATTG CCATTTTTTA CATATGTC-- --ATGGTCTT
       CGCCTTTGCA ATCTCCATAG CCA----CAG CTTACACAAT TTACGGCCTT
       CGATTTTGCT GTAGCCATAG CAA----TGG CATATGCCAC TTATGGCCTT

       ATCCTTAGCT TATTCACTTG TTTGGTAAAC CATGTTGTCC ATCAAAAAAG
       GTCCTCAGCT TTTTCACTTG CTTCCTCAA- -ATCATACTT AGAAAAAACA
       CTCCTCTGTT TCTTCACTTG CCTCTTGAAC TACTGTATTT CGGGAAAACA

       AAATGGACAT GATG------ --AAGACTTG TCTTGTCCAT CGTGTCAACG
       AGACTTGTCC GAAAAACCCT TCCGAAAATG GCTAATACAC AGAATTTTGA
       AGATCTGTAC AAAGCACCGT TCAGAACTTT TCTTCTTCGT CAAATTATGA

       TTGCATGTCA CATAAGGTTC ACAAAATTCA TGTCTGGAAC AGAGTTGTTC
       CATCATGCCA CGTCAGATTC GCGAAATTCA TCTCTGGAAC AGAAGCTTTC
       TCGCCTGCCA TGTAAGATTT ACCAAGTTTC TTACTGGAAC AGAAGCTTTC

       TGCATTTATT TACGTCGACT AGGTGCAAAA ATTGGCCAAC ATTGTTCCAT
       TGTTTTTATC TACGCCAAAT GGGAGCCAAA ATTGGTAATC ATTGTTCGAT
       TGCGTGTACT TGCGCTGTCT TGGAACAAAA GTTGGGAGGC ATTGTTCAAT

       CAGAGCCATT AACCCCATTT CAGAACCAAA CCTCATCTCA ATAGGCAATG
       CAGAGCTATT AATCCCATTT TAGACCCGGA GCTCGTTTCA CTAGCCGACG
       TCGAGCCATC AATCCGGTGA CTAACCCCAA ATTGATATCA GTTGCAGATG

       GTGTGCATTT AGGGGACTTC AGTAGAATCG TTCCCAGATT ATATACCTC-
       GTGTTGATTT AGGGGACTTC AGCCGTTTAA TCCCAGGTTA TTACACCTCA
       GAGTACATTT GGGGGACTTC AGCAGAATTA TTCCTGGCTA TTACACATC-

       --TAGTGACT ATGTGTCTAG TAAAATTGAG ATTCAAGATA ACTCGGTTAT
       AAAAACGGCT ACTTTTCAGG CGGAATCGAG GTACAAGACA ATGCTGTAAT
       --AAATGGCT ATGTTCAAGG GAAGATTGAG ATACAAGATA ATTCAGTTGC

       CGGTAGTCAG GGTCTTGTTC TCCCAGGTTC TGTCATTGAG AAGGATGTTA
       AGGGAGCCAA GCACTAATCC TCCCTGGTTC GGTTCTTGAA AAATACGTAA
       AGGGTGTCAA AGTTTAGTCC TCCCTGGTTC TGTCCTTGAG AAAGACGTCA

       TTCTTGGTGC AATCTCAGTT GCTCCCATGA ACTCAGTACT TCAGCATGGT
       TTCTCGGTGC ACTTTCGGTT GCTCCGGCGA ACAGAATCCT CCACAGTGGC
       TTCTTGGGGC ACTCTCGGTG GCTCCCATGA GTTCAGTTCT TCAGAGCGGT

       GGTGTCTTTG TTGGCTCCAA AAATCCAGTT TTGGTCAAAA GCAAGTCTTA
       GGTGTTTATG TTGGTTCTCC ATTTCCTTCG ATGGTGAAGA ACACTACCCA
       GGTGTTTTTG TTGGGTCTCC AACTCCAGTT ATGGTTAAGA ATGCCATTCA

       CTCGTTGGAT GATCGAATCG AGGAGATGGA CTTGAAATAC AAAAAGGTGC
       AAGTTTCGAC GACAGAATCG AAGAAATGGA CACGAAATAT CGAAAGGTGT
       TTCTTTTGAT GAGAGGTTAG AGGAGATGGA CATGAAGTAC AAGAAGGTTC

       TTGGAAATCT AGCTGCAAAT TTTGCTGCTT CAACTCTTAA AGTGAAATCG
       TAGGAAATCT CGCAGCAAAT TTAGCAGCCA CAACCTTAAA AGTAAAATCG
       TTGGGAATCT TGCTGCAAAT TTGGCGGGCA CAACTCTCAA AGTAAACTCT

       AGATTCTTCC ATCGAATCGG TGCTGCAGGA AAGGGATGCT TAAGCCTCTA
       AGGTACTTCC ATAGAATCGG TGCAGCTGGC AAGGGATTTT TAAAGCTTTA
       AGATACTTTC ACCGGATTGG CGCTGCTGGT AAGGGATTTC TAAGGCTCTA

       CAATGACATC CC---AGGTT TTGCAGATCA CAAGATTTTC AGTCCTGGCA
       CGATCACCTT CCGAAGAATC TCCCACATCA TAGAATATTC TCAACGGGGA
       TAACAACATA CC---AACCT TGCCAGACCA TGAAATCTTC TCTCCTGGAA

       TGACATACCG AGTTATTATG AGGCACAGTA ATTGTTTGAG TTCTGATGAT
       AGGAATATTC CATCATCCTT CGGCACAGCA ACTGCTTGAG CTCTGACGAC
       AGCAATACTC CATTGTGATC CGGCACAGTA ATTGCCTAAG TTCTGATGAT

       GATGCACGAC TTGATCCGCG TGGTGCAGCA ATAAGGATCT TATCAAATGG
       GACGCACGCC TCGATCCACG TGGTGCAGCC ATACGAAT-- ----AACCA-
       GATGCAAGAC TCGATCCACG TGGCGCAGCA ATAAGGATCC TATCAAGCAA

       AACTGATGAG AACAGTTCAA TTCTTGATCT GACACTAAAG ACTGGTAAGG
       --CTGAGGAT AATTCCCCAT TGCTCGATTT AACCTTAAAA ACTGGCAATG
       TGCTGAACAA ACATCTCCAC TGCTGGATCT GACCCTAAAG ACAGGCAATG

       CATTTCATAC TCGGACGATA GGTGACTTTG CAACATGGCT TGTTTGTGGA
       CATTTCATGC ACGGACAATA GGGGACTTCG CCACGTGGCT AGTCTGTGGG
       CATTCCATGC GCGAACTATA GGTGACTTTG CAACATGGCT CGTCTGCGGA

       GCAGCAGCAC GGGAAGAACA CGTGAAGCAT GCTCCACATG TCCGGGATGC
       GCAGCTGCGC GAGAGGAGCA TGTGAAGCAC TATCCGCACA TTCGTGACGC
       GCTGCTGCAC GAGAAGAGCA TGTGAAGCAT GCTCCACATA TTCGTGATGC

       AATGTGGGGT TCTCTTCGAC AGGCATACTC CTACACCGAG CTGCATTACT
       AATGTGGGGA TCTTTGAGGA GAGCCGATTC TTACACCGAG CTTCACTATT
       AATGTGGAGC TCCCTGAGAA CAGCTGACTC TTACACTGAG CTCCATTATT

       ATTCAAACAT TTGCAGGCTT TTCAGATTCA AAAATGATCA AGAGATGTAT
       ACTCGAACAT CACACGGCTT TTCCGGTTCG AAGATGGCCA AGAAATGTAC
       ATTCAAACAT TTGCAGGCTG TTTAGGTTCA CGGATGGACA AGAAATGTAT

       GTCAAATTCA AGTTAAGGCC TTTTGACAAC AAC------- ----------
       GTCAGATTCA AATTGAGGCC GTTCGATAAG AAT------- ----------
       GTCAAATTTA AGCTGAGGCC TTTTGATAGG AAG------- ----------

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------

       ---------- ---------- -----ATCGG TGAGGACTCT GGTGAGGTGA
       ---------- ---------- -----ATATC AGAGGAAAAT GGAAAAGTGG
       ---------- ---------- -----TTCGA TGAGGACACT GGCAAGGTAG

       AGCCTAGAGG CGTACTTCCA CCAGAGACCG GTGCAATTCC AAGAGATGAG
       AGCCCACAGG TGTACTTCCG CCAGAAACTG GGGCTATTCC GAGAGATGAA
       AACCAAGGGG TGTACTTCCA CCAGAAACAG GTGCAATCCC AAGAGACGAG

       AATGACAACC GTCCATTACG CTTCCTTGAT GAAGATTTCC AACATCGTCT
       AACGACAAAC GCCCTCTGCT TTTCCTGGCC GATGATTTTA AGAAACG---
       AATGATAAGC GCCCTTTACT TTTCCTCGAG GATGATTTTC AGCGTCG---

       GCACACACCC GAGAAGGTGA AACAATGAAT TTTACTAAAT ACATTCGAGT
       ---------- ---------- ---GGTGGAT TCTTC----- ----------
       ---------- ---------- ---AGTAAAT TCTCC----- ----------

       CTTTTGGATT TTCAAAGTGG TTCGTAAATC TTGAAAATGA ACAGGTCCGT
       ---------- ---------- ---------- -------GGA TAAAGTTCGT
       ---------- ---------- ---------- -------AGA GAAGGTACGT

       TATGTTCTAC AACTACAAAT GCGATCAATA CCGGAGGATG AAGTGAATCG
       TATGTTCTTC AGCTGCAAAT TCGACCAATC ACAAACGACG AAAGGGAAAG
       TATGTTCTTC AATTGCACAC CCGATCAGTC CCAGAGGATG AAAACACACG

       GGAGGCAGCA CTTGATTGCA CCAAGCCATG GGATGAGATT GAATTTCCTT
       GGAGATTGCA CTCGACTGTA CAAGGCCATG GAACGAAACG GAGTTCCCAC
       AGAAAATGCA CTTGACATCA CCAAGCCATG GGATGAAACT GAGTTTCGAG

       ATATTGATAT AGGAGACATA ACTATTGATG CAACACTAAC CAAAGAAGAA
       ATTTCGATGT GGGAGAGATT ACAATCGAAC GAATCCTGAC CAAAGAGGAA
       TCATTGATGT GGGAGAGGTA ACCATTGATG AAGTGCTCAC CAAGGAAGAA

       TCAGAAGATC TAGAGTTCAA TCCATTTCTC AAATGTCATG AAGTGGATGT
       TCCGAAGAAC TCGAGTTCAA CCCTTTCCTT AGATGCCACG AAATCGATGT
       TCTGAAAAGC TAGAGTTTAA TCCGTTTCTC AGATGTCGTG AAGTGGATGT

       CATTCGAGCA ACGTCATGTA ATCAAAGCGC GTCCATGGAC CACGGACGTT
       CATAAGGGCT TCGTCTTGTA GTGAAAGTGC ATCAATGGAT CACGGCCGTT
       CATCAGGGCA ACATCGTGCA ATCAAAGTGC ATCGATGGAT CATGGCCGTT

       CAGTCGTTTA CGACATATGC CAGCACCTGA GAAACAGAAA ACCTCTCCCA
       CATTAGTGTA CTCGATCTGC CAACATCTTC GAAACAAGAA ACCACTTCCT
       CAATAGTATA TGCAATATGC CAACATCTGC GCAACAAAAA GCCTCTTCCT

       GAAGCTTGGA GAATCTTCTT AGATCAATCT GATGTTAAAA TGGACTTCAC
       CAAGCATGGA GGGCTTTCTT GAATCAATCC GACGTGAAAA TCGACCTCTC
       GAGGCTTGGA AGATGTTCTT GGATAAATCT GATGTCAAAG TCGACCTCTC

       CGGCTGTCCC CTTGCATCG- --ACATTGGA GAAAAAGTAC TACCAAAGTC
       GGGCTGTCCA ATGGCAGCTC AAAAACTCGA GAAAAGTCGA TACAGAAGTG
       TGGCTGTCCA ATGGCAGCT- --GCATTCGA GAACAAG--- TGCAAGAGAA

       GTCATGCTAG CAAGATCTTG GTACGTGACA ATGTGGCTAA TATCCGTTCA
       GTGACATTAG CGAGGCCATG GTACGTGACA TTGTGGATGA TGTCAGCTCA
       GTAACACTGG CAAGACCTTG GTATGTAACC TTGTGGCTGA TGTCGGGTCA

       GCCGTTCTTG CAAACATTTC TTCCATATTT TCTCATGGGA TTCATAATCT
       GCCGTTTCTG CAGATTTTTC TACCATACTT CCTCATGGGG CTAGTGATTT
       ACCATTTCTA CAAACTTTCC TACCTTATTT TCTGATGGGA TTAGTAATAT

       TTGCTCCATT GAAGTTTCTT TTCTACACAT CGGATACCAT AAAAGTCCAA
       TTCCACCGAT GAAATTCATC TTCCACCAGA ACAAGATCAG TAAAACAGAG
       TTGCTCCTCT GAACTGCATC TTCTACGTAA ATAAGATCAC AATGACTAAG

       ---AAGCATT GGCTGCTACC TTTGTTTTGG GTTGTGACAG GACTTTTAGG
       CACATGCATT TCTTGCTGCC ACTTTTCTGG ATTTGCTCGG GCGTTTTATC
       ---ATGTATT GGTTGCTGCC AATCTTTTGG CTTTCTTCTG GAATTTTAGG

       TGGAGTTCTA TGTGCTTTTG CAAAATGGAT TCTTGTAGGA ATCAAGAAAG
       AGGGCTCGTT TGCGCTATAT GTAAATGGAT TCTCGTGGGT AAGAAGAAAG
       AGGTCTTGTT TGTGCTGTGA GTAAATGGAT TCTCGTTGGA AGAAAGAAAG

       ACGGCGAATG TGAGCTAATT TGGAGTAAAG GGATCTTCTG CGATACGATT
       AAGGCGAAAA CGAACCGATA TGGAGTGTGG GGATATTCAT GGACACTATT
       ACGGTAAAGT AGAGCCAATT TGGAGTGCAG GGATATTTAT GGACACTACT

       TGGCAAGCAA TAAGGACACT AGTAGGAGAC TATTTCATGG AAATGAAAAG
       TGGCAGGCTG TAAGAACACT GGCTGGCGAC TATTTCATGG AAATGACAGG
       TGGCAGGCTA TAAGGACACT AGTTGAAGAA TATTTCATGG AAATGACAGG

       TGGATCGTTC TTATTCGGAA TTTGGATGAA GCTTATGGGA TCAGAAGTTG
       CGGCACGTTT ATATTCAACG TGTGGATGAA TCTAATGGGA TCGGAGGTTG
       CGGATCATTT CTGTTTGGAA TTTGGATGAA ACTCATGGGG GCAGAAATCG

       CTTGGGACGA AGGCACATAC ATCAACAGCA TGGGAGCAGT TTTGAATCCT
       ------ACAG AGGCGCTTAC GTGGACACCA TGGGAGCTGT GTTAAATCCT
       CCTGGAAAGA GGGAGTTTAT GTGGACAGCA TGGGTGCAAT TTTGAATCCT

       GAAATGGTGA GAATCGACAA GTATGGATCG ATCGAGAGAG AAGCTTTGCT
       GAACTGGTGA AGATAGAAGA GAATGGCTCT GTGGAGAGAG AAGCTTTGCT
       GAACTGATTA GGATTGAAGA ATATGGCTGT CTGGAGCGAG AGGCTTTGCT

       CTTTGGACAC ATATATGAAG GTGAAGGAGG CAAAGTGAAG TATGGCAAAA
       GTTTGGACAT ATATATGAAG GCGAAGGGGG TAAAGTTAAG TACGGAAAAA
       CTTCGGGCAC ATATATGAAG GTGAGGAAGG GAAGGTGAAG TACGGGAAGA

       TCAAGATTGG AGAAAGTGGC TATGTGGGAA GCAGAGCTGT GGCAATGCCA
       TCGTCGTTAA GAAAGGCGGA TTTGTCGGGA GCAGAGCAGT GGCAATGCCG
       TTGTGGTTAG AAAAGGCGCG TTTGTGGGAA GCCGAGCAGT GGCCATGCCC

       AGGGTAACTA TTGACACTAA ----AGGTAC TCTTGGAGAT CTCTCTCTTG
       GGAGTTACTG TGGGTGGTGG TGGTGGTTGC -TTAGGTGCT CTGTCTTTAG
       GGAGTGACTC TGGACAACGG ----AGCTAC ACTCGGGGCC CTCTCGTTAG

       CCATGAAGGG AGAGCTCGTA AAATAG---- ---------- ----------
       CTATGAAGGA AGAGTTTGTT AACTAA---- ---------- ----------
       CGATGAAGGG AGAGTTTGTC AAGTGATGCA TGGTAAAATA AAGCATAAAG

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       GATTTTCAAG TCATTTTCTG ATGATAAAAT GCTCCAAGCA AATTCATTGC

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ATACTTATCA GAAAACATAT TTGTGATTTA GAAACTCACT CGTCTTCTGA

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       AATTATTATA CGTTAGCACG TTTGACTTGG AACTTCTACC ACAATAGTTA

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       TACCAAAGAT GATCCTATTC TTCAGGTACA CCGATTACAA TTTTCAGTGC

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       CCAATTATCT CCTTTTAATC TATCTTTAGT TTTCTTTCAT TATACTCTCC

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       TTATAATCTA AAGGCAAAAG TCAAACTTAC TTTTCGTCGG ATCCAGGACT

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       ACTTTTCACC CATCTACCAC TTATTTTTGT CACATTGTAA TTTAAACAAC

       ---------- ---------- ---------- ---------- ----------
       ---------- ---------- ---------- ---------- ----------
       TTACTATGAA AATCAAACAA CTTGTCCAAT TTCCTTCAAT AAATATACGA

       -------
       -------
       ATTTCAG
JFsanchezherrero commented 6 years ago

Dear user, I wonder if you could send us the commandline logfile or any other error files generated by DOMINO in the output folder. It will give us a better understanding on what is going on.

Thank you very much Jose F

plumbwj01 commented 6 years ago

Hi Jose, I have attached the generated command line log file, along with both msa (phylip and fa) file formats of the same seq data, as well as images of the stages that I am stuck at. The second and third images display the no taxa issue I am having with both of the suggested file formats (picture 2 being the fa and 3 being the phylip). I don't whether the taxa selection stage is actually essential for DOMINO to function or that I am just interrupting and stopping the scan prematurely when I have selected the files tab to obtain the dm command line file. Sorry for the hassle and thanks again in advance, Will

[image: Inline image 1] [image: Inline image 2] [image: Inline image 3]

On Thu, Sep 28, 2017 at 9:06 AM, Jose Francisco Sanchez-Herrero < notifications@github.com> wrote:

Dear user, I wonder if you could send us the commandline logfile or any other error files generated by DOMINO in the output folder. It will give us a better understanding on what is going on.

Thank you very much Jose F

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/molevol-ub/DOMINO/issues/6#issuecomment-332760991, or mute the thread https://github.com/notifications/unsubscribe-auth/AezU6pWZQ8eT9IR810JmsHVli7oWLccqks5sm1NrgaJpZM4PkeQC .

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl /Applications/DOMINO-1.0.1/bin/scripts/DM_Clean_v1.0.0.pl -input_file /Applications/DOMINO-1.0.1/example/I_brauni098.fastq -type_file 5 -l 70 -s 20 -m 100 -p 2 -outputFolder /Users/user/Desktop/TREES -TempFiles

######################################################

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl /Applications/DOMINO-1.0.1/bin/scripts/DM_Assembly_v1.0.0.pl -o /Users/user/Desktop/TREES -mrs 80 -type_file 5 -p 2 -TempFiles

######################################################

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl /Applications/DOMINO-1.0.1/bin/scripts/DM_Clean_v1.0.0.pl -input_file /Users/user/Desktop/TREES/untitled folder/OG70_full_length_guidance_results_full_length_MUSCLE.MSA.MUSCLE.Without_low_SP_Col.With_Names.fasta -type_file 4 -l 70 -s 20 -m 100 -p 2 -outputFolder /Users/user/Desktop/TREES -b /Users/user/Desktop/TREES/untitled folder 2/OG70_CDS_guidance_results_CDS_MUSCLE.MSA.MUSCLE.Without_low_SP_Col.With_Names.fasta

######################################################

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl /Applications/DOMINO-1.0.1/bin/scripts/DM_Assembly_v1.0.0.pl -o /Users/user/Desktop/TREES -mrs 80 -type_file 5 -p 2

######################################################

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl //scripts/DM_MarkerScan_v1.0.0.pl -o /Users/user/Desktop/TREES -VD 0.01 -p 2 -option msa_alignment -msa_file /Users/user/Desktop/output/test.fa -DM selection

######################################################

###################################################### Project Directory: /Users/user/Desktop/TREES Command line Instruction: /usr/bin/perl //scripts/DM_MarkerScan_v1.0.0.pl -o /Users/user/Desktop/TREES -VD 0.01 -p 2 -option msa_alignment -msa_file /Users/user/Desktop/output/test.fa -option msa_alignment -msa_file /Users/user/Desktop/sample.phylip -DM selection

######################################################

CristinaFriasLopez commented 6 years ago

Hi Will,

We are looking for the reason that you got this error.

From while, Please Could you repeat the analysis with our test of msa file (example_phylip.tx) ? You can find it in the following link: https://github.com/molevol-ub/DOMINO/blob/master/example/phylip_msa/example_phylip.txt

Thanks in advance, Cristina

plumbwj01 commented 6 years ago

Hello again,

So I am experiencing the same problem with the provided example file. So in response to this I uninstalled and reinstalled both domino and xcode several times to ensure the issue did not lie there; however, this has been to no avail and am still unable to select taxa for the analysis for both the example file and my own data set. I was wondering if the problem could be attributed to human error on my side such as missing a vital step during my attempts to carry out the analysis? Thanks again

On Fri, Sep 29, 2017 at 6:06 PM, CristinaFriasLopez < notifications@github.com> wrote:

Hi Will,

We are looking for the reason that you got this error.

From while, Please Could you repeat the analysis with our test of msa file (example_phylip.tx) ? You can find it in the following link: https://github.com/molevol-ub/DOMINO/blob/master/example/ phylip_msa/example_phylip.txt

Thanks in advance, Cristina

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/molevol-ub/DOMINO/issues/6#issuecomment-333182800, or mute the thread https://github.com/notifications/unsubscribe-auth/AezU6h3T6AsW9dULJ5zlqRRnjFnIgEKAks5snSOVgaJpZM4PkeQC .