Closed mikecormier closed 1 year ago
@mikecormier can you post (the top of) your input file, AG.fa
? Have you tried other inputs? If you still have them, can you send me the gimmemotifs.log
files of your failed runs?
Hey @simonvh, I have tried other input files. AG.fa is the smallest of the input files I am using. Each input file has different regions we are interested in and where we know at least a few of the motifs that should be there. Each file ends up having the same errors.
Head of AG.fa
>chr1 pos:879077 gene_name:SAMD11 strand:+
CCAAAAGCTTTTTATTCTCCTCTAGGGGGATGAGAGGGGGGCTCGTTAACTTGCACAAGAGGCTAGATGGCGGGTGGGGCAGCTGGGTGCCTGCTGTGGATCTCTTCTGCACACACGCACCAGGGCCAGTGTCAGAGCTCCCCTGTGCCCCTGTCCCGCCACAGCCAGGCGTGATGTCCTCTGCGCTGAAGGCTGGGGCTGCCAGGGCTGGGCAAGGCCTGTACTCACCAGGACCAAGGGCCCCCTGAGAGATGGTGGGTGCGGTCCAGGCTGAGCTGGAGCAGGGGCTGGGTTCCCCTTCCATTCCTTGAGATGCAGGTGGGCACTCACTACCCTCCCGCAGGTGACCTGTTGGGCAAGAGGCTGGGCCGCTCCCCCCGTATCAGCAGCGACTGCTTTTCAGAGAAGAGGGCACGAAGCGAATCGCCTCAAGGTAAGAGCGTGGCTGGGACGAGAGACAGGTCACCAGGGGAGGGGGCAGTCCCTGAGGGTCCCCTGGACCTCGAGCAGGCACTCTAGAGGGGCGTGGTCCTCGGCAGTGCCTGGAGAAACCTCTCACCCCGGGTCCTCCCCAGCAGAGGCGCTGCTGCTGCCGCGGGAGCTGGGGCCCAGCATGGCCCCGGAGGACCATTACCGCCGGCTTGTGTCAGCACTGAGCGAGGCCAGCACCTTTGAGGACCCTCAGCGCCTCTACCACCTGGGCCTCCCCAGCCACGGTGAGGACCCACCCTGGCATGATCCCCCTCATCACCTCCCCAGCCACGGTGAGGACCCACCCTGGCATGATCTCCCCTCATCACCTCCCCAGCCACATGTACTCGGCCATTCCTGTTGCTGAGGCCCTGCTGACACCAAGGCCAGGCTGGATGCAGGTCCCTCTGCCACACGTCCTGCCCCATGCCCCCTGGGGCGGGCCACACCTCCATGTCCCCTAGGTCCCCAGGGTCATGACTAGCTCACATTTTATATAGAGAGAAATGGAGTCTGGGGTGGACCCAGGTGAGGGTGGGCAGTGGGCATGTCAGCAGCACCCCCCGAGGAGAGCAAGCTCCTGGACCCTGTGGTCTGTGAGTCGTCTATGCAGCCAGTGGACGCCGACCTGCCAGACGCCTGCCCCAGGAGCCTGGGGAGGGGCAGTGAGCAGAAAGGCCGGGCTGGGTGCAGTGGGCACTTGGCCACCAGGACTCCCCAGGTGCTGAAGAGACGCCAGCTGGAGGGGCTGCCCCTTCCCCCGGGTCGGCCCTGACCCTGTCCACCCCACCTCAGGACGTTCTCCAGGGGTCCCTCCGGGATGCACTCGGACCCCCTGCCCGCTGCACTCAGCCTCCCAGGCCCCAGCCGCCCGCCTGGCAGGGGAGCTTGGCTTTTCGGGCTAGAGGTGGGTGGGGGCGCCGGGAAAGGAGGCAGGATTCCTCACACCAGGCACCGTCCCCCAGGGCAGCTCAGGCACCAAGAGCCTGAATAATTCACCAAATGTTAATAATGTAAAAATCCTCCTTTTTAATTGCTTTCCCTGCTCTGCCTGGGGCCGCTCTGCTGGCCGCGCGGGGGAGGGGCGCCGGCCGCCGGGGAGCGCGCTGTCAATCAGGCCGCGCCGCCGCCCCCCCCCCCCGCCCCGCCGCGGAGCCGGCCGTAAATAACCCTGTAACTAACCCGGCCGCTAGCGCGGGGGCGCTGGGCCCCGCTGGGATCGATGCGGGCGGCCGCGCCGGCTGGGCTCTGCGGGCTGGCACCCGGCCCGGGGCGGGACCCACCTCCGCTTTCGGGTAATTAATTTATAAACAGAGGCGGCGGTGGAGCTGGCGGAGCCTGCATAGTGGGGGCTGCGGGGACTCGGGAGGCCCGGGCGGGAGGGAGAGGCCGAGAGACCTGGGACGCGGCGCCTTAGACGCGGGCGCTGCGTGCGCATTGGGGCGAGTGTGGCCACGCGGGACAGTGACCCTGCGCAGCCGGGACTGGGCGACCCCTGTGCTAGTGTGGCGTGCGTGCGCGGGCGCTGCCTTGCCTTTGTGACAAGCTTTGGCCAGCCGCGTCTACTATGGGGACCTCAGATTTTCTTGCCTCCCACCGAAGAGGGGGTCCCCTGGGCGGTCAGCCCCTGGCTGGCACTTCTGGACTCTCTCGCTGCCCCGCAGGCTCTGTGGCCTCGGGACGTCTGCACAGCCCCCTCCCCGCAAGGCTCAGCCGCCTCTCAGGCCGGAAGCCTCCAGGCACCCGGCTCCCCTTCGGGGAAGAGCTTTTCCCGACACTTCCTCGCCCAGCATCTTGTCTGCCGTCTCGGCCCTGTGGCCGCCCATCCTCCTGCCCCGTGCCCGAGACCAGCCCAGGGGCCGAGCACGGCCGAGTGGTGTGGTCAGTTCCCCACCTCAGTGTTCTACGCCAGGACGCGGGCTGGGGAGGATGAGGGCGCATAGCCGGGGGGATCACTGCTGTTGTCCCCCACCCAGATCTCCTGAGGGTCCGGCAGGAGGTGGCGGCTGCAGCTCTGAGGGGCCCCAGTGGCCTGGAAGCCCACCTGCCCTCCTCCACGGCAGGTCAGCGTCGGAAGCAGGGCCTGGCTCAGCACCGGGAGGGCGCCGCCCCAGCTGCCGCCCCGTCCTTCTCGGAGAGGTACTGGGGTGGCTGCCGTTCTCTGCTTGTTTCTGGGGTGCCGCCCGCACCCCCGCGCTCTCAGCCACCAGCACGCGCCCCGAGAGTGCCAAGCACTGTGTTCAGCTCTAGGTTCGGGTCCGGGCAGAGCGTTTCGGGGGTGACACCGATCTGGGCTGCAGTGTTGAGGGCGCCACTGGGGTGCGTGAGGGAGGCTGAGGCCCATCAGGGGGTTCCCTGGAGGAGAAGCCAGAGAAGGGGAGAGCTCCAAGTCTGGAACCCCGGGGTCAGTCGGGAGGGGTCGGCCAGAGGACTCAGAGCTGGAGGCGGAGGGGGGGTCCTGGCTGGCGCTCAAATGTAGACGCCGGCGCCGGATCTGTTCCCGGCACAGACAAGGCCTCCGGCACAGACCCGGGTTTCTCGGGTCCAGGACACGAGGCGGGGCGGGGCGCCTGGAGAAGGGAGGGGCCGCCTGAGGCCCGAGTCCCTGCCCGGCCGCTGAGCCCGGCGTCTGCAGCTGCCTCCACCGCCGCCCGGATTGCGGCTAATGACGCCCCCGCTTCCCCCGCCGCTCGGGTCCGCAGGGGAGGGGAGCAGGCGGGGCCGGCGCCCCGCGCAGTAATTACCGCTGCAGCCGTCGCCGCCCGCCGGGTCAGCGCCTCCGCGCCGCCGCCGAGATTAATTGGCGCCGCCGGCGGGGGCGGGGATGGCGCGCGACCTGGGGCCGTAACGAGCTGCGCATCGACCGCCCGCGGGGCCGGCAATTAGCGGAGGCGGCGGGGGAGGGGCGCCGGGGCCTTTACGGGAACGGGGGCGGGGGGGACGCCGCTCATTGCGCTGCCGTCCACAGGGAGCTGCCTCAGCCGCCCCCCTTGCTGTCGCCGCAGAATGCCCCTCACGTCGCCCTGGGCCCCCATCTCAGGCCCCCCTTCCTGGGGGTGCCCTCGGCTCTGTGCCAGACCCCAGGTGAGGAGGCGGGTGCGCATCCCCTGGGAGCCCGCGTGGAGGCTCGCGGACCCGGCCCTGCCCCTGTCGGAGCCGAGACGGACCGGGTAGGGGATTGCAAAGGGCCGGCTCGGACCGCCTCGGACCCCCCGACCCCGCGTTGTCCCCCTCCCCACCAGGCTACGGCTTCCTGCCCCCCGCGCAGGCGGAGATGTTCGCCTGGCAGCAGGAGCTCCTGCGGAAGCAGAACCTGGCCCGGTAGGTGCGGGGAGGCGGGCGGGGCCGCGCGGCCCGGGAGGCGGCTGACCCGCGTCTGCCCCCGGCCCAGGCTGGAGCTGCCCGCCGACCTCCTGCGGCAGAAGGAGCTGGAGAGCGCGCGCCCACAGCTGCTGGCGCCCGAGACCGCCCTGCGCCCCAACGACGGCGCCGAGGAGCTGCAGCGGCGCGGGGCCCTGCTGGTGCTGAACCACGGCGCGGCGCCACTGCTGGCCCTGCCCCCCCAGGGGCCCCCGGGCTCCGGACCCCCCACCCCGTCCCGGGACTCTGCCCGGCGAGCCCCCCGGAAGGGGGGTCCCGGCCCTGCCTCAGCGCGGCCCAGCGAGTCCAAGGAGATGACGGGGGCTAGGCTCTGGGCACAAGATGGCTCGGAAGACGAGCCCCCCAAAGACTCGGACGGAGAGGACCCCGAGACGGCAGCTGTTGGGTGCAGGGGGCCCACTCCGGGCCAAGCTCCAGCTGGAGGGGCCGGCGCCGAGGGGAAGGGGCTTTTCCCAGGGTCCACACTGCCCCTGGGCTTCCCTTATGCCGTCAGCCCCTACTTCCACACAGGTGGGCACCCCCACACTCTAGATCCTTCCAGAGGGCACAGGACTGGCAGGCCGCCTGTGGAAGGGTCTTGGGGGGAGGAAAAATTCCCCTTAGGCACCCATCCCCCACCTCAGCAATTGGGGCACACGACGGTCAGGAGACGGGCGGGTATGGGAAAGCCAGCCAGAGCCCTAGTAACACGCCCCACAACTCAGGCGCGGTAGGGGGACTCTCCATGGATGGGGAGGAGGCCCCAGCCCCTGAGGACGTCACCAAGTGGACCGTGGATGACGTCTGCAGCTTCGTGGGGGGCCTGTCTGGCTGTGGAGAGTACACTCGGGTAAGGGGGGGCCCCAGTTCCTGGGGCGGGGCTGGAGCTGGCTGGCAGTCACTACCTCCCTGGAAAGGATGGTGGGGTAGGGCCATTCCCCAACGCCCTCTCCCTCCCCAAAAGCAGTGCGCAGCAGGGACTGGACTGTGCACCCCACCTTTTTTTTTTTTTTTTTTTTTTGCCAGGTGTTTTCTGCCTGACACTCAAACCCAACAGATCACTGTTTTTAAAAAATTTCCGTGAGCTGCACAAACAGCTCCTCTTGGCTCTGCTGGGCTGGAGGATGGAGCAGCACCCGGGTCCTGACCCTCCCTCCCTCCCCCTTCCAAGTCTTCAGGGAGCAGGGGATCGACGGGGAGACCCTGCCACTGCTGACGGAGGAGCACCTGCTGACCAACATGGGGCTGAAGCTGGGGCCCGCCCTCAAGATCCGGGCCCAGGTGAGACGCTGGGGAGTGAGGTCAGGGTCTCCAGACCACAGCTGGGCAGAAAGCTCTGGGTGGGTGTGCGACAGCCCCCACCAGGCCATCTCTCTGCAGGTGGCCAGGCGCCTGGGCCGAGTTTTCTACGTGGCCAGCTTCCCCGTGGCTCTGCCACTGCAGCCACCAACCCTGCGGGCCCCGGAGCGAGAACTCGGCACAGGAGAGCAGCCCTTGTCCCCCACGACGGCCACGTCCCCCTATGGAGGGGGCCACGCCCTTGCCGGTCAAACTTCACCCAAGCAGGAGAATGGGACCTTGGCTCTACTTCCAGGGGCCCCCGACCCTTCCCAGCCTCTGTGTTGAGGTTGCCGGGGGTAGGGGTGGGGCCACACAAATCTCCAGGAGCCACCACTCAACACAATGGCCCTGCCTCCCACCGCTTTATTTCTTTCGGTTTCGGATGCAAAACAAAAAATTTTAAAAGAAAATGTGACTTCAAAGGAAAGGAACAAATTTTCAAAGACTTGGGGGAGTGAAGGCAGAGCCTGGTGCAGATGGACGAGGTCTGCAGACGGAGGGCAGAGGTGGTGGAAGGGGCCAGGGGCCTGCAGGCCTCCCCCTGGAACTGGGACTGGTCTCGGTCTGCTGACGTCAGGGTCAGCTCCCCCGCGGAGCTGACTTCAGCAGCCCACAGCTGTGGGGCTTCAGCAGCCACACCAGCCCAGCCCAGCCCAGCTCTCGATACGTTTGGTCTTTCATGCTGAAAAATAAATAATAAAGCCTGTCCCGTGTCTACTGCCTCCCCCAACTGCACAGACGCCAGCCTCTAGGCCTGACTGCCAGGGAGGTGGAAACACTGGCCACCAGCCCGGCAGCCCCTACAGGCCCCCCAGATGGGCTGCCTCAGTCGTCCTCTGAGAGCTGCAGATCCTCCAGCTCGTCCTCCGGCCCCTGGGCCAGCTGCTGCAGCTCCCCAGGGGCCAGCCCCGCCTCTGCGTCTGGGTCTCCATCTGCGGGGAGAGATGGAGGCTACATAAATTTTGCTTTATCAGGAAGAAGCCAGCCTTAGAGGTTACTCATCACTAATTAATCACGGCACTAATTAATTTATCCCTGTTGCTGGCTGCCAGAGAACAGAGCATTTGGCCTGGCCTTCCCAGGGAGGGAAAAGCCTGGCCCAGAGCCCCACGCCCCCCGCCCACGTGGCTCTGCCCTCCCGCCAGATGGGCTCACAGGGCCACACCCTCTCACCCCAAGACCATTCACCCTCCGAGTTGCTGCTGTCCTCCTCGCCCTCCTCCTCGTCCTCTTCATCGTCTTCCACCCCATGCCGAGTGCTCAGGGGCCTCAGTATCCCTGAGGAACAAGAAGCAGAGTCCATATGACTCCCACCCACAGGGTCCACCAGCAAAGTCACAGTGGGGGCAGGAGGGTGGCCAGGCTCCCAACACCCTTCCCTCCGCTGACTTCCAGCAGGTGGAGAGGAGCCCTGGGGAGGAACTGGGAGGTCACAGGCCTGGGGACAGAGTTACCAATCCCAGTAGGCCTTCACTTCAAGGAGGGAAGGCGCTGGCACCAGAAGCCTGGCAACACTGAGGTTGGCCCCAGCTGGGCCAGAGACTGGTGAGCCCCCTGCAGGATGGGTACAGGTGGCCCTCGTGGCTCTGGGAAGTCCAGCAGAGCCCTCCAGGCCCACCCTTCCCCTGGGAGCACCACGCAGGCCCCACCTCTCTCCGAGAATCCCTCGGTGTCGTCCTCTTCAGAGCTGTTCAGGTCAAAGAGGTCTTTAAATTGCTTCCTGTCCTCATCCTTCCTGTCAGCCATCTTCCTTCGTTTGATCTCAGGGAAGTTCAGGTCTTCCAGCTGGAAGGCCAAAGAACCAGGGGCTCAGGTGAGAGAGGGCAGGGGCTGGCGGCCACAGCAGGGCCAGGCATCGCCAGACCCACCACCAGGGCCCCATGTGGCCAATTTCTAGTCCCCTCTGTTCCCAAATCACAAAGCCATCCTCCAAGTTGTCCATCCCATGTCCAAGGTCAAAGGCAGAGCCCTTCCTGCTTCTCCTCACGGGCCCCTGGTGCCCACATACTGGCCTGGGTGACGAGGTCAGTCCAGCCACTCCACCTGCCCGGAGCCTCCAGCCCATCAGGCCTGAGGGGGCATGGCCTCCCCAACCTAGTGCAGCCTGGGGCTTCCCCTCCCTGGAAACGCCTGGTTCTGGCCAGTTCTCCAACACCTACCCCCTCTCCAAGTCGAATCATCCGGGCACGGCCCTGGCCGCCTGGCACTGTTTCCAAACCCTCGCCCTGGTCTCAAGTCATAGTGCGCTAGATCTGAAACCCAGGAAGTCACAACACACCCCCAGGTCCCCTCGCCGAGCCGCACCCGCTCTTTGCCACTGATCTCCAGCTGGATCTCCCGGTCACGCAGCTTGCGCCAGTGGCTGTAGTACAAGGTCAGGGGTGTCCCCTCTTCCCGGGTCAGCTTCTCCCAGGCTTCCTGGGGGGTTGGGGGAGTTCAGGGTCATGCCTCACCCTGGGCAAACCCCCACATGTAGCTGGGGCTATACCCTGGCAGGTGCCCTCAGGTGGCACTACCCCCAGGGCCCACTAACCACTGCCTGCTGCTCAGAGACGCCGAAGGAAACCCTCTGGCGGCGGCTGCAGATGTATGCCGAGTTCTCCTGAACCTTCCCAAGCAGCTGCTGCACCTGCCGGCAGTAGTTGGCCACCTTGCACTCCCGGAGGAACGACTTCAGCTGCGGAAGGGAGGGGTCAGCCACTGAAGCCCAGGACCGCTCCATGTGCACAGCTGGCCCAGGTCCTGTGCAAAACCACGCGTGGTGGCCACGGGGATACCCCAGGAGGGGACATGGATCCCATCTCAGGGCTCAAGTGCATAGCTGTTGCAGCTGGGATGGCAGAGGCAGAATCAGCCCACCCTCTGGGCCAACCCTGCCCACTACTCACCTCTGGAAATAAAGTTTTATGCCAGGCGTGGTGGCTCTCGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACGTGGCAAAACCCCGTCTCTACTAAAATACCAAAATTAGCTGGGTGTGGTGGTGGGCGCCTGTAAACCCAGCTCCTTGGGAGGCTGAGGCTGGAGAATCGCTTGAGCCCAGGAGATGGAGATTGCAGTGAGCCGAGATCGCACCATTGCACTCCAGCCTGGGCAACACAGCGAGACTCCATCTCAAACAATACTACTACTACTAATAAAATACAGTCTCGCTGACGTGCAGCCACACGTGTGCATTGCATGGTTCTGCAGTTGCCTGTGCAGCTGAACATCCGTAGGAAGCCACGTTTACCATTTGGCCCCTCACCAAAAACATTTTCTGACCCCTACCCCAGACCCCGACCCTGGGCCCTTGAGTCCAGAAGCAGAGATGCCCCAATGCCAGGTATCACCACCCAAGAGGACATGGGAGGAACAGAGGCTGTGGCCCCTGCTGTGAGTGCCCCCCAGAAAGGGGGTCCCGGCTCTGTGCATGTGACATGTGTGGCCGTGTGTGAGTACACACACATGCACACACCTCTATCTGGATAAGCCTCTGACCAATTGTGGCTCATGTGAGCAGATCCCTCCTCCCCACACTGCACAGACCTATAGTCGGCACATCTGATTCCAGCCACCAGGGCCAGACAGCAGGGCCCCCACCCCTTCCACTAGGCACAGGCCTCCCTGAGGCTGGAAACATCACGGCTGAGAGCAAACAGACCTCCCGTGGGGGCCCAGAAGGACCTTCTGAGGATAAGGAGAACCCCCTCCTCCACCCCACTCCTGCCTAAGATGAGGCTGACGTGGGGTATTTAGCGGGGCAGGCTGGGCCTTCCTATGAGGCTGATGTGGGGTATTTGGCGGGGCAGGCCAGGCCTTCCTGCCACCTGAGAAGCCACTCCACCCACTCCCCACACCCGGGATGGCCTGGGGAAGTGTGCGATCAGCGTACCAGCCTGAGCCAGGGTCGGCATGCTCAGTCCCAACCCCGAAGCAAAGATCAGCCTTGTGGTTCCCACCTGGGGAGGAGGCTGTTGTGCTCCCAGGGTCCTCAGCCCACTGCCCAGGCCTGCCCCCAAACCTCCTGAATGGCTTAGAACCCCTCATCAGCCCCTCCAAGGGGGCCTCACGGGGCGCGTTGCCAGCAGTCAGGTTCCACCCCAGTCCCAGGTACCCGGGACAAGGGCACCTCCTACCAGCCTGGGGCAGCCAAGCCCGTTATAAGACAGTCTGAGTCGGCCACGAGCCGGTGTGGGCAGGACACACACCTGCAGGACCACAGGCAGCACCAGCTCCGGGAAGCCGATGCAGTGTGCCTGGCTGTGCAGGTACTCCAGGGTGAGGTCGTACAGCTGCTCCACCAGGCCGTCCTGAAGAGCAGGAGAGAGGGCCGAGTGCATCAGGGAGAGGCTGGGGCTGGGCACTCAGGCCCCTTCCCCTCAGGCTGTCAGGGCAGCGCCATCTCCAGGGCACGGACTGCAAGGAAGGGGCTCCTGGGCCCCAGCCCTGGGAGACCATGAAGGTCCATGCTTGAACTTGGAGGATGCCAGCCCCCTCCCATCCACCTCAGCACCCCCAACCCCACCCTGGGAACTGCCCAGGCCCCTCCCCAGGAGGCCAGCCTCACCCGGTACGCCTTCTCCTGCAGGTTGACATTGGACAGCTTCAGGATCACGGAGAAGTTGATGGGCTTGGAGCTCATGCGCCCTGGCTTCCTGTTGAAGTCGACCTGCTGGAACATCTGCCCCAAGGGCCGTGTCAGGCTCTCTCGGCCCCATGCCTGGTCACCCTGGCTTCACCCTGGCTGCACCCTGGTCCCCCTGGTCCCTTTGGCC
>chr1 pos:880527 gene_name:NOC2L strand:-
GTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAAGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAGGGGTCGCCCAGTCCCGGCTGCGCAGGGTCACTGTCCCGCGTGGCCACACTCGCCCCAATGCGCACGCAGCGCCCGCGTCTAAGGCGCCGCGTCCCAGGTCTCTCGGCCTCTCCCTCCCGCCCGGGCCTCCCGAGTCCCCGCAGCCCCCACTATGCAGGCTCCGCCAGCTCCACCGCCGCCTCTGTTTATAAATTAATTACCCGAAAGCGGAGGTGGGTCCCGCCCCGGGCCGGGTGCCAGCCCGCAGAGCCCAGCCGGCGCGGCCGCCCGCATCGATCCCAGCGGGGCCCAGCGCCCCCGCGCTAGCGGCCGGGTTAGTTACAGGGTTATTTACGGCCGGCTCCGCGGCGGGGCGGGGGGGGGGGGCGGCGGCGCGGCCTGATTGACAGCGCGCTCCCCGGCGGCCGGCGCCCCTCCCCCGCGCGGCCAGCAGAGCGGCCCCAGGCAGAGCAGGGAAAGCAATTAAAAAGGAGGATTTTTACATTATTAACATTTGGTGAATTATTCAGGCTCTTG
>chr1 pos:880536 gene_name:NOC2L strand:-
TGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTAGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAGGGGTCGCCCAGTCCCGGCTGCGCAGGGTCACTGTCCCGCGTGGCCACACTCGCCCCAATGCGCACGCAGCGCCCGCGTCTAAGGCGCCGCGTCCCAGGTCTCTCGGCCTCTCCCTCCCGCCCGGGCCTCCCGAGTCCCCGCAGCCCCCACTATGCAGGCTCCGCCAGCTCCACCGCCGCCTCTGTTTATAAATTAATTACCCGAAAGCGGAGGTGGGTCCCGCCCCGGGCCGGGTGCCAGCCCGCAGAGCCCAGCCGGCGCGGCCGCCCGCATCGATCCCAGCGGGGCCCAGCGCCCCCGCGCTAGCGGCCGGGTTAGTTACAGGGTTATTTACGGCCGGCTCCGCGGCGGGGCGGGGGGGGGGGGCGGCGGCGCGGCCTGATTGACAGCGCGCTCCCCGGCGGCCGGCGCCCCTCCCCCGCGCGGCCAGCAGAGCGGCCCCAGGCAGAGCAGGGAAAGCAATTAAAAAGGAGGATTTTTACATTATTAACATTTGGTGAATTATTC
>chr1 pos:881043 gene_name:NOC2L strand:-
TGTGCATATCAGTTCATGTGTGCATCTGTATGTGTGTATGCACGTGTATCCATGAATGCCTGTGTGCCTGCAGGTGTGTGCATCTGTGCGTGTGTACACCTGTGTGTATGCATGTGTGTACCTTTGCGTGTACCTTTGCGTGTGTGCACCTGTGCATGTGTCTTTGTATACCAGTGTGTACCTGTGTGTACCTGTATGCATGCACATGCGTGTGTACCTGTGTGCACCTGTCTGCATGTGTGTACCTGTGCGTGTGTGCACCTGTGTGCATGCATTTGCGTCTGCATGTGTCTACCTGTGCATGCATGAACCTTTGCATGTGTGCATCTGTGTGCATGCATGTGTGTCTGCGTGCATCTATGTACCTGTGTGCACCTATGTGCATACACACGTGTCTGTGTGCACTTGTGTGCATGCATTTGCATCTGCATGTGTGTACTGTGCGTGTGTACCTGTGCGTGTACCTGTACACTTGGGTGCATGCATGCACGTCTGCTTGTGTGTGCCTGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTAGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAG
>chr1 pos:881044 gene_name:NOC2L strand:-
GTGTGCATATCAGTTCATGTGTGCATCTGTATGTGTGTATGCACGTGTATCCATGAATGCCTGTGTGCCTGCAGGTGTGTGCATCTGTGCGTGTGTACACCTGTGTGTATGCATGTGTGTACCTTTGCGTGTACCTTTGCGTGTGTGCACCTGTGCATGTGTCTTTGTATACCAGTGTGTACCTGTGTGTACCTGTATGCATGCACATGCGTGTGTACCTGTGTGCACCTGTCTGCATGTGTGTACCTGTGCGTGTGTGCACCTGTGTGCATGCATTTGCGTCTGCATGTGTCTACCTGTGCATGCATGAACCTTTGCATGTGTGCATCTGTGTGCATGCATGTGTGTCTGCGTGCATCTATGTACCTGTGTGCACCTATGTGCATACACACGTGTCTGTGTGCACTTGTGTGCATGCATTTGCATCTGCATGTGTGTACTGTGCGTGTGTACCTGTGCGTGTACCTGTACACTTGGGTGCATGCATGCACGTCTGCTTGTGTGTGCCTGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTAGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACA
Number of lines
wc -l AG.fa
252122 AG.fa
Number of sequence lines
grep -v ">" AG.fa | wc -l
126061
gimmemotifs.log for Random background
2020-02-12 20:39:42,345 - gimme.config - DEBUG - Using multiprocessing
2020-02-12 20:39:42,345 - gimme.config - DEBUG - Parameters:
2020-02-12 20:39:42,346 - gimme.config - DEBUG - fraction: 0.2
2020-02-12 20:39:42,346 - gimme.config - DEBUG - use_strand: False
2020-02-12 20:39:42,346 - gimme.config - DEBUG - abs_max: 1000
2020-02-12 20:39:42,346 - gimme.config - DEBUG - analysis: xl
2020-02-12 20:39:42,346 - gimme.config - DEBUG - enrichment: 1.5
2020-02-12 20:39:42,347 - gimme.config - DEBUG - size: 200
2020-02-12 20:39:42,347 - gimme.config - DEBUG - lsize: 500
2020-02-12 20:39:42,347 - gimme.config - DEBUG - background: ['random']
2020-02-12 20:39:42,347 - gimme.config - DEBUG - cluster_threshold: 0.95
2020-02-12 20:39:42,347 - gimme.config - DEBUG - scan_cutoff: 0.9
2020-02-12 20:39:42,347 - gimme.config - DEBUG - available_tools: MDmodule,MEME,MEMEW,DREME,Weeder,GADEM,MotifSampler,Trawler,Improbizer,BioProspector,Posmo,ChIPMunk,AMD,HMS,Homer,XXmotif,ProSampler,DiNAMO
2020-02-12 20:39:42,347 - gimme.config - DEBUG - tools: MEME,Homer,BioProspector
2020-02-12 20:39:42,348 - gimme.config - DEBUG - pvalue: 0.001
2020-02-12 20:39:42,348 - gimme.config - DEBUG - max_time: -1
2020-02-12 20:39:42,348 - gimme.config - DEBUG - ncpus: 12
2020-02-12 20:39:42,348 - gimme.config - DEBUG - motif_db: gimme.vertebrate.v5.0.pfm
2020-02-12 20:39:42,348 - gimme.config - DEBUG - use_cache: False
2020-02-12 20:39:42,348 - gimme.config - DEBUG - custom_background: /scratch/generate/combined/motifs_0.99-1_out/test/generated_background.random.fa
2020-02-12 20:39:42,348 - gimme.config - DEBUG - genome: hg19
2020-02-12 20:39:42,349 - gimme.config - DEBUG - No time limit for motif prediction
2020-02-12 20:39:42,349 - gimme.denovo - INFO - starting full motif analysis
2020-02-12 20:39:42,350 - gimme.denovo - DEBUG - Using temporary directory /tmp/gimmemotifs.265933.wih86yf9
2020-02-12 20:39:42,352 - gimme.denovo - INFO - using size of 200, set size to 0 to use original region size
2020-02-12 20:39:42,352 - gimme.denovo - INFO - preparing input from FASTA
2020-02-12 20:39:42,352 - gimme.denovo - INFO - preparing input (FASTA)
2020-02-12 20:39:42,353 - gimme.denovo - DEBUG - Splitting AG.fa into prediction set (/scratch/generate/combined/motifs_0.99-1_out/test/intermediate/prediction.fa) and validation set (/scratch/generate/combined/motifs_0.99-1_out/test/intermediate/validation.fa)
2020-02-12 20:39:49,584 - gimme.denovo - DEBUG - Random background: /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/prediction.bg.fa
2020-02-12 20:40:15,442 - gimme.denovo - DEBUG - Random background: /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/bg.random.fa
2020-02-12 20:40:15,550 - gimme.prediction - INFO - starting motif prediction (xl)
2020-02-12 20:40:15,550 - gimme.prediction - INFO - tools: MEME, BioProspector, Homer
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping AMD
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping GADEM
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping Improbizer
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping JASPAR
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping MEMEW
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping ProSampler
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping RPMCMC
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping trawler
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping Weeder
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping XXmotif
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 6
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 8
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 10
2020-02-12 20:40:16,371 - gimme.prediction - DEBUG - Starting BioProspector job, width 12
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 14
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 16
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 18
2020-02-12 20:40:16,373 - gimme.prediction - DEBUG - Starting BioProspector job, width 20
2020-02-12 20:40:16,376 - gimme.prediction - INFO - BioProspector_width_6 finished, found 0 motifs
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - Skipping ChIPMunk
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - stdout BioProspector_width_6:
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - Skipping DiNAMO
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - stdout BioProspector_width_6: BioProspector_width_6 failed to run: BioProspector is not configured
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - Skipping DREME
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - Skipping HMS
2020-02-12 20:40:16,378 - gimme.prediction - INFO - BioProspector_width_8 finished, found 0 motifs
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - Starting Homer job, width 6
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - stdout BioProspector_width_8:
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - Starting Homer job, width 8
2020-02-12 20:40:16,380 - gimme.prediction - DEBUG - stdout BioProspector_width_8: BioProspector_width_8 failed to run: BioProspector is not configured
2020-02-12 20:40:16,380 - gimme.prediction - DEBUG - Starting Homer job, width 10
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 12
2020-02-12 20:40:16,381 - gimme.prediction - INFO - BioProspector_width_10 finished, found 0 motifs
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 14
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - stdout BioProspector_width_10:
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 16
2020-02-12 20:40:16,382 - gimme.prediction - DEBUG - stdout BioProspector_width_10: BioProspector_width_10 failed to run: BioProspector is not configured
2020-02-12 20:40:16,382 - gimme.prediction - DEBUG - Starting Homer job, width 18
2020-02-12 20:40:16,382 - gimme.prediction - INFO - BioProspector_width_12 finished, found 0 motifs
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - Starting Homer job, width 20
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - stdout BioProspector_width_12:
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - Skipping MDmodule
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - stdout BioProspector_width_12: BioProspector_width_12 failed to run: BioProspector is not configured
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 6
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 8
2020-02-12 20:40:16,384 - gimme.prediction - INFO - BioProspector_width_14 finished, found 0 motifs
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 10
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - stdout BioProspector_width_14:
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - Starting MEME job, width 12
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - stdout BioProspector_width_14: BioProspector_width_14 failed to run: BioProspector is not configured
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - Starting MEME job, width 14
2020-02-12 20:40:16,386 - gimme.prediction - INFO - BioProspector_width_16 finished, found 0 motifs
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - Starting MEME job, width 16
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - stdout BioProspector_width_16:
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - Starting MEME job, width 18
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - stdout BioProspector_width_16: BioProspector_width_16 failed to run: BioProspector is not configured
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - Starting MEME job, width 20
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - Skipping MotifSampler
2020-02-12 20:40:16,387 - gimme.prediction - INFO - BioProspector_width_18 finished, found 0 motifs
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - Skipping Posmo
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - stdout BioProspector_width_18:
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - Skipping YAMDA
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - stdout BioProspector_width_18: BioProspector_width_18 failed to run: BioProspector is not configured
2020-02-12 20:40:16,388 - gimme.prediction - INFO - all jobs submitted
2020-02-12 20:40:16,389 - gimme.prediction - INFO - BioProspector_width_20 finished, found 0 motifs
2020-02-12 20:40:16,389 - gimme.prediction - DEBUG - stdout BioProspector_width_20:
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout BioProspector_width_20: BioProspector_width_20 failed to run: BioProspector is not configured
2020-02-12 20:40:16,390 - gimme.prediction - INFO - Homer_width_6 finished, found 0 motifs
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout Homer_width_6:
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout Homer_width_6: Homer_width_6 failed to run: Homer is not configured
2020-02-12 20:40:16,390 - gimme.prediction - INFO - Homer_width_8 finished, found 0 motifs
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_8:
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_8: Homer_width_8 failed to run: Homer is not configured
2020-02-12 20:40:16,391 - gimme.prediction - INFO - Homer_width_14 finished, found 0 motifs
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_14:
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_14: Homer_width_14 failed to run: Homer is not configured
2020-02-12 20:40:16,392 - gimme.prediction - INFO - Homer_width_12 finished, found 0 motifs
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_12:
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_12: Homer_width_12 failed to run: Homer is not configured
2020-02-12 20:40:16,393 - gimme.prediction - INFO - Homer_width_10 finished, found 0 motifs
2020-02-12 20:40:16,393 - gimme.prediction - DEBUG - stdout Homer_width_10:
2020-02-12 20:40:16,393 - gimme.prediction - DEBUG - stdout Homer_width_10: Homer_width_10 failed to run: Homer is not configured
2020-02-12 20:40:16,393 - gimme.prediction - INFO - Homer_width_16 finished, found 0 motifs
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_16:
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_16: Homer_width_16 failed to run: Homer is not configured
2020-02-12 20:40:16,394 - gimme.prediction - INFO - Homer_width_18 finished, found 0 motifs
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_18:
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_18: Homer_width_18 failed to run: Homer is not configured
2020-02-12 20:40:16,395 - gimme.prediction - INFO - Homer_width_20 finished, found 0 motifs
2020-02-12 20:40:16,395 - gimme.prediction - DEBUG - stdout Homer_width_20:
2020-02-12 20:40:16,395 - gimme.prediction - DEBUG - stdout Homer_width_20: Homer_width_20 failed to run: Homer is not configured
2020-02-12 20:40:16,395 - gimme.prediction - INFO - MEME_width_6 finished, found 0 motifs
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_6:
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_6: MEME_width_6 failed to run: MEME is not configured
2020-02-12 20:40:16,396 - gimme.prediction - INFO - MEME_width_8 finished, found 0 motifs
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_8:
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_8: MEME_width_8 failed to run: MEME is not configured
2020-02-12 20:40:16,397 - gimme.prediction - INFO - MEME_width_10 finished, found 0 motifs
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_10:
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_10: MEME_width_10 failed to run: MEME is not configured
2020-02-12 20:40:16,398 - gimme.prediction - INFO - MEME_width_12 finished, found 0 motifs
2020-02-12 20:40:16,398 - gimme.prediction - DEBUG - stdout MEME_width_12:
2020-02-12 20:40:16,398 - gimme.prediction - DEBUG - stdout MEME_width_12: MEME_width_12 failed to run: MEME is not configured
2020-02-12 20:40:16,398 - gimme.prediction - INFO - MEME_width_14 finished, found 0 motifs
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_14:
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_14: MEME_width_14 failed to run: MEME is not configured
2020-02-12 20:40:16,399 - gimme.prediction - INFO - MEME_width_16 finished, found 0 motifs
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_16:
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_16: MEME_width_16 failed to run: MEME is not configured
2020-02-12 20:40:16,400 - gimme.prediction - INFO - MEME_width_18 finished, found 0 motifs
2020-02-12 20:40:16,400 - gimme.prediction - DEBUG - stdout MEME_width_18:
2020-02-12 20:40:16,400 - gimme.prediction - DEBUG - stdout MEME_width_18: MEME_width_18 failed to run: MEME is not configured
2020-02-12 20:40:16,400 - gimme.prediction - INFO - MEME_width_20 finished, found 0 motifs
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - stdout MEME_width_20:
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - stdout MEME_width_20: MEME_width_20 failed to run: MEME is not configured
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - waiting for statistics to finish
2020-02-12 20:40:18,404 - gimme.prediction - INFO - predicted 0 motifs
2020-02-12 20:40:18,406 - gimme.prediction - DEBUG - written to /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/all_motifs.pfm
2020-02-12 20:40:18,406 - gimme.prediction - INFO - no motifs found
2020-02-12 20:40:18,406 - gimme.denovo - INFO - finished
gimmemotifs.log for GC background
No gimmemotifs.log file
Ok, two points that I see.
1). There seems to be something wrong with the configuration of the motif prediction tools. Normally gimme
would initialize this on the first run. You are now running in a conda environment, right? Can you try to delete ~/.config/gimmemotifs/gimmemotifs.cfg
and then run gimme motifs
again? Can you download this file and run it on that to see if it works?
2) I'm afraid your input sequence set is too large for de novo motif prediction. Both in terms of size, as well as in the number of the regions. Motif prediction works best if you have smaller regions, say 100-1000bp. Beyond that, performance usually quickly detoriates (and running time increases). Second, the number of regions is quite large. By default, gimme motifs
selects only 1,000 regions for de novo motif prediction, but it calculates statistics and enrichment on all of them. This will take a very long time. If possible, I would try to get your input set down to 10,000 regions of at most 500 bp long.
Hi @simonvh,
After recreating the gimmemotifs.cfg file I am now getting a new error:
2020-02-18 02:42:09,693 - INFO - starting motif prediction (xl)
2020-02-18 02:42:09,700 - INFO - tools: MEME, BioProspector, Homer
2020-02-18 02:49:13,507 - INFO - all jobs submitted
2020-02-18 02:51:34,360 - INFO - Homer_width_6 finished, found 5 motifs
2020-02-18 02:52:25,617 - INFO - Homer_width_8 finished, found 5 motifs
2020-02-18 02:55:04,288 - INFO - Homer_width_10 finished, found 5 motifs
2020-02-18 03:32:14,956 - INFO - Homer_width_16 finished, found 5 motifs
2020-02-18 03:37:49,282 - INFO - Homer_width_12 finished, found 5 motifs
2020-02-18 03:38:21,142 - INFO - Homer_width_14 finished, found 5 motifs
2020-02-18 03:44:24,751 - INFO - MEME_width_6 finished, found 10 motifs
2020-02-18 03:44:57,824 - INFO - MEME_width_8 finished, found 10 motifs
2020-02-18 03:50:54,469 - INFO - MEME_width_10 finished, found 10 motifs
2020-02-18 03:51:15,927 - INFO - MEME_width_12 finished, found 10 motifs
2020-02-18 03:57:17,854 - INFO - MEME_width_14 finished, found 10 motifs
2020-02-18 03:57:43,490 - INFO - MEME_width_16 finished, found 10 motifs
2020-02-18 04:03:37,277 - INFO - MEME_width_18 finished, found 10 motifs
2020-02-18 04:04:00,668 - INFO - MEME_width_20 finished, found 10 motifs
2020-02-18 04:15:10,326 - INFO - Homer_width_18 finished, found 5 motifs
2020-02-18 04:25:16,733 - INFO - BioProspector_width_6 finished, found 5 motifs
2020-02-18 04:38:13,157 - INFO - BioProspector_width_8 finished, found 5 motifs
2020-02-18 04:53:53,012 - INFO - BioProspector_width_10 finished, found 5 motifs
2020-02-18 05:08:57,453 - INFO - BioProspector_width_12 finished, found 5 motifs
2020-02-18 05:21:18,184 - INFO - BioProspector_width_14 finished, found 5 motifs
2020-02-18 05:34:35,552 - INFO - BioProspector_width_16 finished, found 5 motifs
2020-02-18 05:49:17,098 - INFO - BioProspector_width_18 finished, found 5 motifs
2020-02-18 06:02:10,619 - INFO - BioProspector_width_20 finished, found 5 motifs
2020-02-18 07:00:52,947 - INFO - Homer_width_20 finished, found 5 motifs
Traceback (most recent call last):
File "/scratch/miniconda3/envs/gimme/bin/gimme", line 11, in <module>
cli(sys.argv[1:])
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/cli.py", line 625, in cli
args.func(args)
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/commands/motifs.py", line 94, in motifs
"size": args.size,
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/denovo.py", line 619, in gimme_motifs
stats_bg=background,
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 372, in predict_motifs
stats_bg=stats_bg,
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 320, in pp_predict_motifs
result.wait_for_stats()
File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 178, in wait_for_stats
job.get()
File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/pool.py", line 670, in get
raise self._value
File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/pool.py", line 450, in _handle_tasks
put(task)
File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/connection.py", line 206, in send
self._send_bytes(_ForkingPickler.dumps(obj))
File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/connection.py", line 393, in _send_bytes
header = struct.pack("!i", n)
struct.error: 'i' format requires -2147483648 <= number <= 2147483647
This means that you should decrease the amount of input sequences. It is a bug that should be solved with a newer version of Python. However, it might be best anyway to use a limited set of sequences (< ~100k) as input.
I am having a similar error when trying to run gimme motifs on my own bed or fasta files, so I did the following:
genomepy install hg38 UCSC --annotation
gimme motifs --known -g hg38 TAp73alpha.fa ./test
yet I still get this following error:
2020-05-07 16:30:52,415 - INFO - No config found.
2020-05-07 16:30:52,416 - INFO - Creating new config.
2020-05-07 16:30:52,429 - INFO - Using included version of MDmodule.
2020-05-07 16:30:52,440 - INFO - Using system version of MEME.
2020-05-07 16:30:52,446 - INFO - Using system version of MEMEW.
2020-05-07 16:30:52,452 - INFO - Using system version of DREME.
2020-05-07 16:30:52,457 - INFO - Using system version of Weeder.
2020-05-07 16:30:52,463 - INFO - Using system version of GADEM.
2020-05-07 16:30:52,463 - INFO - Using included version of MotifSampler.
2020-05-07 16:30:52,468 - INFO - Using system version of Trawler.
2020-05-07 16:30:52,468 - INFO - Using included version of Improbizer.
2020-05-07 16:30:52,469 - INFO - Using included version of BioProspector.
2020-05-07 16:30:52,469 - INFO - Using included version of Posmo.
2020-05-07 16:30:52,470 - INFO - Using included version of ChIPMunk.
2020-05-07 16:30:52,470 - INFO - Using included version of AMD.
2020-05-07 16:30:52,470 - INFO - Using included version of HMS.
2020-05-07 16:30:52,477 - INFO - Using system version of Homer.
2020-05-07 16:30:52,486 - INFO - Using system version of XXmotif.
2020-05-07 16:30:52,494 - INFO - Using system version of ProSampler.
2020-05-07 16:30:52,494 - WARNING - Yamda not in config
2020-05-07 16:30:52,500 - INFO - Using system version of DiNAMO.
2020-05-07 16:30:52,513 - WARNING - RPMCMC not found. To include it you will have to install it.
2020-05-07 16:30:52,546 - INFO - Configuration file: /home/abcaldwe/.config/gimmemotifs/gimmemotifs.cfg
2020-05-07 16:30:53,004 - INFO - creating background (matched GC%)
2020-05-07 16:30:53,051 - INFO - Creating index for genomic GC frequencies.
Traceback (most recent call last):
File "/home/abcaldwe/anaconda3/envs/gimmemotifs/bin/gimme", line 11, in
Is it possible that installation of the hg38 genome annotation fails using genomepy, so I downloaded the hg38.fa from UCSC and the chrom.sizes, but I get the same error.
@andrewbcaldwell, I encountered your error before. It was solved by updating pyarrow in the gimmemotifs environment.
try conda update pyarrow
For me, pyarrow 0.13.0 was installed originally. It was updated to 0.16.0 and moved past that error.
@siebrenf Thanks for the tip! I had tried conda update pyarrow
earlier to no avail, but forcing the update to version 0.16.0 with condo update pyarrow=0.16.0
solved the issue.
Thanks for reporting this @andrewbcaldwell and for the fix @siebrenf. I'll update the conda package to reflect this dependency!
Describe the bug A clear and concise description of what the bug is.
gimme motif
fails and does not produce any motifsI have run
gimme motif
in multiple different environments and in a different contexts without success. I have also installed gimme from conda and from cloning the github repo and running it locally.To Reproduce Steps to reproduce the behavior:
and
gimme motifs -g hg19 -b random AG.fa /scratch/generate/combined/motifs_0.99-1_out/AG/
Expected behavior A clear and concise description of what you expected to happen.
That gimme motif does not error out and results in identified motifs
Error logs If applicable, add error logs to help explain your problem.
1 Random background
2 GC Background
Installation information (please complete the following information):
OS: [Linux]
Installation [conda]
Version [v0.14.2]
Additional context Add any other context about the problem here.