vanheeringen-lab / gimmemotifs

Suite of motif tools, including a motif prediction pipeline for ChIP-seq experiments. See full GimmeMotifs documentation for detailed installation instructions and usage examples.
https://gimmemotifs.readthedocs.io/en/master
MIT License
109 stars 33 forks source link

Gimme motif Errors without results #107

Closed mikecormier closed 1 year ago

mikecormier commented 4 years ago

Describe the bug A clear and concise description of what the bug is.

gimme motif fails and does not produce any motifs

I have run gimme motif in multiple different environments and in a different contexts without success. I have also installed gimme from conda and from cloning the github repo and running it locally.

To Reproduce Steps to reproduce the behavior:

gimme motifs -g hg19 AG.fa /scratch/generate/combined/motifs_0.99-1_out/AG/

and

gimme motifs -g hg19 -b random AG.fa /scratch/generate/combined/motifs_0.99-1_out/AG/

Expected behavior A clear and concise description of what you expected to happen.

That gimme motif does not error out and results in identified motifs

Error logs If applicable, add error logs to help explain your problem.

1 Random background

2020-02-12 12:12:05,122 - INFO - creating background (random)
2020-02-12 12:29:40,299 - INFO - starting full motif analysis
2020-02-12 12:29:40,300 - INFO - using size of 200, set size to 0 to use original region size
2020-02-12 12:29:40,300 - INFO - preparing input from FASTA
2020-02-12 12:29:40,300 - INFO - preparing input (FASTA)
2020-02-12 16:43:07,691 - INFO - starting motif prediction (xl)
2020-02-12 16:43:07,693 - INFO - tools: MEME, BioProspector, Homer
2020-02-12 16:48:09,093 - INFO - BioProspector_width_6 finished, found 0 motifs
2020-02-12 16:48:09,096 - INFO - BioProspector_width_8 finished, found 0 motifs
2020-02-12 16:48:09,098 - INFO - BioProspector_width_16 finished, found 0 motifs
2020-02-12 16:48:09,100 - INFO - BioProspector_width_10 finished, found 0 motifs
2020-02-12 16:48:09,102 - INFO - BioProspector_width_12 finished, found 0 motifs
2020-02-12 16:48:09,104 - INFO - BioProspector_width_14 finished, found 0 motifs
2020-02-12 16:48:09,106 - INFO - BioProspector_width_18 finished, found 0 motifs
2020-02-12 16:48:09,107 - INFO - BioProspector_width_20 finished, found 0 motifs
2020-02-12 16:48:09,108 - INFO - all jobs submitted
2020-02-12 16:48:09,109 - INFO - Homer_width_6 finished, found 0 motifs
2020-02-12 16:48:09,109 - INFO - Homer_width_8 finished, found 0 motifs
2020-02-12 16:48:09,110 - INFO - Homer_width_10 finished, found 0 motifs
2020-02-12 16:48:09,111 - INFO - Homer_width_14 finished, found 0 motifs
2020-02-12 16:48:09,111 - INFO - Homer_width_12 finished, found 0 motifs
2020-02-12 16:48:09,112 - INFO - Homer_width_18 finished, found 0 motifs
2020-02-12 16:48:09,113 - INFO - Homer_width_16 finished, found 0 motifs
2020-02-12 16:48:09,113 - INFO - Homer_width_20 finished, found 0 motifs
2020-02-12 16:48:09,114 - INFO - MEME_width_6 finished, found 0 motifs
2020-02-12 16:48:09,115 - INFO - MEME_width_8 finished, found 0 motifs
2020-02-12 16:48:09,115 - INFO - MEME_width_10 finished, found 0 motifs
2020-02-12 16:48:09,116 - INFO - MEME_width_12 finished, found 0 motifs
2020-02-12 16:48:09,117 - INFO - MEME_width_14 finished, found 0 motifs
2020-02-12 16:48:09,117 - INFO - MEME_width_16 finished, found 0 motifs
2020-02-12 16:48:09,118 - INFO - MEME_width_18 finished, found 0 motifs
2020-02-12 16:48:09,119 - INFO - MEME_width_20 finished, found 0 motifs
2020-02-12 16:48:11,122 - INFO - predicted 0 motifs
2020-02-12 16:48:11,134 - INFO - no motifs found
2020-02-12 16:48:11,135 - INFO - finished
Traceback (most recent call last):
  File "/scratch/miniconda3/envs/gimme_git/bin/gimme", line 7, in <module>
    exec(compile(f.read(), __file__, 'exec'))
  File "/scratch/GimmeMotif/gimmemotifs/scripts/gimme", line 11, in <module>
    cli(sys.argv[1:])
  File "/scratch/GimmeMotif/gimmemotifs/gimmemotifs/cli.py", line 625, in cli
    args.func(args)
  File "/scratch/GimmeMotif/gimmemotifs/gimmemotifs/commands/motifs.py", line 97, in motifs
    denovo = read_motifs(os.path.join(args.outdir, "gimme.denovo.pfm"))
  File "/scratch/GimmeMotif/gimmemotifs/gimmemotifs/motif.py", line 1522, in read_motifs
    infile = pfmfile_location(infile)
  File "/scratch/GimmeMotif/gimmemotifs/gimmemotifs/utils.py", line 103, in pfmfile_location
    raise ValueError("Motif file {} not found".format(infile))
ValueError: Motif file /scratch/generate/combined/motifs_0.99-1_out/AG/gimme.denovo.pfm not found

2 GC Background

2020-02-12 20:41:12,380 - INFO - creating background (matched GC%)
2020-02-12 20:41:12,488 - INFO - Creating index for genomic GC frequencies.
Traceback (most recent call last):
  File "/scratch/miniconda3/envs/gimme/bin/gimme", line 11, in <module>
    cli(sys.argv[1:])
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/cli.py", line 625, in cli
    args.func(args)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/commands/motifs.py", line 75, in motifs
    number=10000,
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/background.py", line 122, in create_background_file
    m = MatchedGcFasta(inputfile, genome, number=number, size=size)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/background.py", line 548, in __init__
    matched_gc_bedfile(tmpbed, matchfile, genome, number, size=size)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/background.py", line 512, in matched_gc_bedfile
    min_bin_size=min_bin_size,
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/background.py", line 387, in gc_bin_bedfile
    create_gc_bin_index(genome, fname, min_bin_size=min_bin_size)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/background.py", line 352, in create_gc_bin_index
    df.reset_index()[cols].to_feather(fname)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pandas/util/_decorators.py", line 214, in wrapper
    return func(*args, **kwargs)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pandas/core/frame.py", line 1994, in to_feather
    to_feather(self, path)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pandas/io/feather_format.py", line 64, in to_feather
    feather.write_feather(df, path)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pyarrow/feather.py", line 180, in write_feather
    writer.write(df)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pyarrow/feather.py", line 91, in write
    table = Table.from_pandas(df, preserve_index=False)
  File "pyarrow/table.pxi", line 1139, in pyarrow.lib.Table.from_pandas
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pyarrow/pandas_compat.py", line 474, in dataframe_to_arrays
    convert_types))
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/concurrent/futures/_base.py", line 586, in result_iterator
    yield fs.pop().result()
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/concurrent/futures/_base.py", line 425, in result
    return self.__get_result()
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/concurrent/futures/_base.py", line 384, in __get_result
    raise self._exception
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/concurrent/futures/thread.py", line 56, in run
    result = self.fn(*self.args, **self.kwargs)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pyarrow/pandas_compat.py", line 463, in convert_column
    raise e
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/pyarrow/pandas_compat.py", line 457, in convert_column
    return pa.array(col, type=ty, from_pandas=True, safe=safe)
  File "pyarrow/array.pxi", line 169, in pyarrow.lib.array
  File "pyarrow/array.pxi", line 78, in pyarrow.lib._ndarray_to_array
  File "pyarrow/error.pxi", line 91, in pyarrow.lib.check_status
pyarrow.lib.ArrowTypeError: ('an integer is required (got type str)', 'Conversion failed for column chrom with type object')

Installation information (please complete the following information):

Additional context Add any other context about the problem here.

simonvh commented 4 years ago

@mikecormier can you post (the top of) your input file, AG.fa? Have you tried other inputs? If you still have them, can you send me the gimmemotifs.log files of your failed runs?

mikecormier commented 4 years ago

Hey @simonvh, I have tried other input files. AG.fa is the smallest of the input files I am using. Each input file has different regions we are interested in and where we know at least a few of the motifs that should be there. Each file ends up having the same errors.

Head of AG.fa

>chr1 pos:879077 gene_name:SAMD11 strand:+
CCAAAAGCTTTTTATTCTCCTCTAGGGGGATGAGAGGGGGGCTCGTTAACTTGCACAAGAGGCTAGATGGCGGGTGGGGCAGCTGGGTGCCTGCTGTGGATCTCTTCTGCACACACGCACCAGGGCCAGTGTCAGAGCTCCCCTGTGCCCCTGTCCCGCCACAGCCAGGCGTGATGTCCTCTGCGCTGAAGGCTGGGGCTGCCAGGGCTGGGCAAGGCCTGTACTCACCAGGACCAAGGGCCCCCTGAGAGATGGTGGGTGCGGTCCAGGCTGAGCTGGAGCAGGGGCTGGGTTCCCCTTCCATTCCTTGAGATGCAGGTGGGCACTCACTACCCTCCCGCAGGTGACCTGTTGGGCAAGAGGCTGGGCCGCTCCCCCCGTATCAGCAGCGACTGCTTTTCAGAGAAGAGGGCACGAAGCGAATCGCCTCAAGGTAAGAGCGTGGCTGGGACGAGAGACAGGTCACCAGGGGAGGGGGCAGTCCCTGAGGGTCCCCTGGACCTCGAGCAGGCACTCTAGAGGGGCGTGGTCCTCGGCAGTGCCTGGAGAAACCTCTCACCCCGGGTCCTCCCCAGCAGAGGCGCTGCTGCTGCCGCGGGAGCTGGGGCCCAGCATGGCCCCGGAGGACCATTACCGCCGGCTTGTGTCAGCACTGAGCGAGGCCAGCACCTTTGAGGACCCTCAGCGCCTCTACCACCTGGGCCTCCCCAGCCACGGTGAGGACCCACCCTGGCATGATCCCCCTCATCACCTCCCCAGCCACGGTGAGGACCCACCCTGGCATGATCTCCCCTCATCACCTCCCCAGCCACATGTACTCGGCCATTCCTGTTGCTGAGGCCCTGCTGACACCAAGGCCAGGCTGGATGCAGGTCCCTCTGCCACACGTCCTGCCCCATGCCCCCTGGGGCGGGCCACACCTCCATGTCCCCTAGGTCCCCAGGGTCATGACTAGCTCACATTTTATATAGAGAGAAATGGAGTCTGGGGTGGACCCAGGTGAGGGTGGGCAGTGGGCATGTCAGCAGCACCCCCCGAGGAGAGCAAGCTCCTGGACCCTGTGGTCTGTGAGTCGTCTATGCAGCCAGTGGACGCCGACCTGCCAGACGCCTGCCCCAGGAGCCTGGGGAGGGGCAGTGAGCAGAAAGGCCGGGCTGGGTGCAGTGGGCACTTGGCCACCAGGACTCCCCAGGTGCTGAAGAGACGCCAGCTGGAGGGGCTGCCCCTTCCCCCGGGTCGGCCCTGACCCTGTCCACCCCACCTCAGGACGTTCTCCAGGGGTCCCTCCGGGATGCACTCGGACCCCCTGCCCGCTGCACTCAGCCTCCCAGGCCCCAGCCGCCCGCCTGGCAGGGGAGCTTGGCTTTTCGGGCTAGAGGTGGGTGGGGGCGCCGGGAAAGGAGGCAGGATTCCTCACACCAGGCACCGTCCCCCAGGGCAGCTCAGGCACCAAGAGCCTGAATAATTCACCAAATGTTAATAATGTAAAAATCCTCCTTTTTAATTGCTTTCCCTGCTCTGCCTGGGGCCGCTCTGCTGGCCGCGCGGGGGAGGGGCGCCGGCCGCCGGGGAGCGCGCTGTCAATCAGGCCGCGCCGCCGCCCCCCCCCCCCGCCCCGCCGCGGAGCCGGCCGTAAATAACCCTGTAACTAACCCGGCCGCTAGCGCGGGGGCGCTGGGCCCCGCTGGGATCGATGCGGGCGGCCGCGCCGGCTGGGCTCTGCGGGCTGGCACCCGGCCCGGGGCGGGACCCACCTCCGCTTTCGGGTAATTAATTTATAAACAGAGGCGGCGGTGGAGCTGGCGGAGCCTGCATAGTGGGGGCTGCGGGGACTCGGGAGGCCCGGGCGGGAGGGAGAGGCCGAGAGACCTGGGACGCGGCGCCTTAGACGCGGGCGCTGCGTGCGCATTGGGGCGAGTGTGGCCACGCGGGACAGTGACCCTGCGCAGCCGGGACTGGGCGACCCCTGTGCTAGTGTGGCGTGCGTGCGCGGGCGCTGCCTTGCCTTTGTGACAAGCTTTGGCCAGCCGCGTCTACTATGGGGACCTCAGATTTTCTTGCCTCCCACCGAAGAGGGGGTCCCCTGGGCGGTCAGCCCCTGGCTGGCACTTCTGGACTCTCTCGCTGCCCCGCAGGCTCTGTGGCCTCGGGACGTCTGCACAGCCCCCTCCCCGCAAGGCTCAGCCGCCTCTCAGGCCGGAAGCCTCCAGGCACCCGGCTCCCCTTCGGGGAAGAGCTTTTCCCGACACTTCCTCGCCCAGCATCTTGTCTGCCGTCTCGGCCCTGTGGCCGCCCATCCTCCTGCCCCGTGCCCGAGACCAGCCCAGGGGCCGAGCACGGCCGAGTGGTGTGGTCAGTTCCCCACCTCAGTGTTCTACGCCAGGACGCGGGCTGGGGAGGATGAGGGCGCATAGCCGGGGGGATCACTGCTGTTGTCCCCCACCCAGATCTCCTGAGGGTCCGGCAGGAGGTGGCGGCTGCAGCTCTGAGGGGCCCCAGTGGCCTGGAAGCCCACCTGCCCTCCTCCACGGCAGGTCAGCGTCGGAAGCAGGGCCTGGCTCAGCACCGGGAGGGCGCCGCCCCAGCTGCCGCCCCGTCCTTCTCGGAGAGGTACTGGGGTGGCTGCCGTTCTCTGCTTGTTTCTGGGGTGCCGCCCGCACCCCCGCGCTCTCAGCCACCAGCACGCGCCCCGAGAGTGCCAAGCACTGTGTTCAGCTCTAGGTTCGGGTCCGGGCAGAGCGTTTCGGGGGTGACACCGATCTGGGCTGCAGTGTTGAGGGCGCCACTGGGGTGCGTGAGGGAGGCTGAGGCCCATCAGGGGGTTCCCTGGAGGAGAAGCCAGAGAAGGGGAGAGCTCCAAGTCTGGAACCCCGGGGTCAGTCGGGAGGGGTCGGCCAGAGGACTCAGAGCTGGAGGCGGAGGGGGGGTCCTGGCTGGCGCTCAAATGTAGACGCCGGCGCCGGATCTGTTCCCGGCACAGACAAGGCCTCCGGCACAGACCCGGGTTTCTCGGGTCCAGGACACGAGGCGGGGCGGGGCGCCTGGAGAAGGGAGGGGCCGCCTGAGGCCCGAGTCCCTGCCCGGCCGCTGAGCCCGGCGTCTGCAGCTGCCTCCACCGCCGCCCGGATTGCGGCTAATGACGCCCCCGCTTCCCCCGCCGCTCGGGTCCGCAGGGGAGGGGAGCAGGCGGGGCCGGCGCCCCGCGCAGTAATTACCGCTGCAGCCGTCGCCGCCCGCCGGGTCAGCGCCTCCGCGCCGCCGCCGAGATTAATTGGCGCCGCCGGCGGGGGCGGGGATGGCGCGCGACCTGGGGCCGTAACGAGCTGCGCATCGACCGCCCGCGGGGCCGGCAATTAGCGGAGGCGGCGGGGGAGGGGCGCCGGGGCCTTTACGGGAACGGGGGCGGGGGGGACGCCGCTCATTGCGCTGCCGTCCACAGGGAGCTGCCTCAGCCGCCCCCCTTGCTGTCGCCGCAGAATGCCCCTCACGTCGCCCTGGGCCCCCATCTCAGGCCCCCCTTCCTGGGGGTGCCCTCGGCTCTGTGCCAGACCCCAGGTGAGGAGGCGGGTGCGCATCCCCTGGGAGCCCGCGTGGAGGCTCGCGGACCCGGCCCTGCCCCTGTCGGAGCCGAGACGGACCGGGTAGGGGATTGCAAAGGGCCGGCTCGGACCGCCTCGGACCCCCCGACCCCGCGTTGTCCCCCTCCCCACCAGGCTACGGCTTCCTGCCCCCCGCGCAGGCGGAGATGTTCGCCTGGCAGCAGGAGCTCCTGCGGAAGCAGAACCTGGCCCGGTAGGTGCGGGGAGGCGGGCGGGGCCGCGCGGCCCGGGAGGCGGCTGACCCGCGTCTGCCCCCGGCCCAGGCTGGAGCTGCCCGCCGACCTCCTGCGGCAGAAGGAGCTGGAGAGCGCGCGCCCACAGCTGCTGGCGCCCGAGACCGCCCTGCGCCCCAACGACGGCGCCGAGGAGCTGCAGCGGCGCGGGGCCCTGCTGGTGCTGAACCACGGCGCGGCGCCACTGCTGGCCCTGCCCCCCCAGGGGCCCCCGGGCTCCGGACCCCCCACCCCGTCCCGGGACTCTGCCCGGCGAGCCCCCCGGAAGGGGGGTCCCGGCCCTGCCTCAGCGCGGCCCAGCGAGTCCAAGGAGATGACGGGGGCTAGGCTCTGGGCACAAGATGGCTCGGAAGACGAGCCCCCCAAAGACTCGGACGGAGAGGACCCCGAGACGGCAGCTGTTGGGTGCAGGGGGCCCACTCCGGGCCAAGCTCCAGCTGGAGGGGCCGGCGCCGAGGGGAAGGGGCTTTTCCCAGGGTCCACACTGCCCCTGGGCTTCCCTTATGCCGTCAGCCCCTACTTCCACACAGGTGGGCACCCCCACACTCTAGATCCTTCCAGAGGGCACAGGACTGGCAGGCCGCCTGTGGAAGGGTCTTGGGGGGAGGAAAAATTCCCCTTAGGCACCCATCCCCCACCTCAGCAATTGGGGCACACGACGGTCAGGAGACGGGCGGGTATGGGAAAGCCAGCCAGAGCCCTAGTAACACGCCCCACAACTCAGGCGCGGTAGGGGGACTCTCCATGGATGGGGAGGAGGCCCCAGCCCCTGAGGACGTCACCAAGTGGACCGTGGATGACGTCTGCAGCTTCGTGGGGGGCCTGTCTGGCTGTGGAGAGTACACTCGGGTAAGGGGGGGCCCCAGTTCCTGGGGCGGGGCTGGAGCTGGCTGGCAGTCACTACCTCCCTGGAAAGGATGGTGGGGTAGGGCCATTCCCCAACGCCCTCTCCCTCCCCAAAAGCAGTGCGCAGCAGGGACTGGACTGTGCACCCCACCTTTTTTTTTTTTTTTTTTTTTTGCCAGGTGTTTTCTGCCTGACACTCAAACCCAACAGATCACTGTTTTTAAAAAATTTCCGTGAGCTGCACAAACAGCTCCTCTTGGCTCTGCTGGGCTGGAGGATGGAGCAGCACCCGGGTCCTGACCCTCCCTCCCTCCCCCTTCCAAGTCTTCAGGGAGCAGGGGATCGACGGGGAGACCCTGCCACTGCTGACGGAGGAGCACCTGCTGACCAACATGGGGCTGAAGCTGGGGCCCGCCCTCAAGATCCGGGCCCAGGTGAGACGCTGGGGAGTGAGGTCAGGGTCTCCAGACCACAGCTGGGCAGAAAGCTCTGGGTGGGTGTGCGACAGCCCCCACCAGGCCATCTCTCTGCAGGTGGCCAGGCGCCTGGGCCGAGTTTTCTACGTGGCCAGCTTCCCCGTGGCTCTGCCACTGCAGCCACCAACCCTGCGGGCCCCGGAGCGAGAACTCGGCACAGGAGAGCAGCCCTTGTCCCCCACGACGGCCACGTCCCCCTATGGAGGGGGCCACGCCCTTGCCGGTCAAACTTCACCCAAGCAGGAGAATGGGACCTTGGCTCTACTTCCAGGGGCCCCCGACCCTTCCCAGCCTCTGTGTTGAGGTTGCCGGGGGTAGGGGTGGGGCCACACAAATCTCCAGGAGCCACCACTCAACACAATGGCCCTGCCTCCCACCGCTTTATTTCTTTCGGTTTCGGATGCAAAACAAAAAATTTTAAAAGAAAATGTGACTTCAAAGGAAAGGAACAAATTTTCAAAGACTTGGGGGAGTGAAGGCAGAGCCTGGTGCAGATGGACGAGGTCTGCAGACGGAGGGCAGAGGTGGTGGAAGGGGCCAGGGGCCTGCAGGCCTCCCCCTGGAACTGGGACTGGTCTCGGTCTGCTGACGTCAGGGTCAGCTCCCCCGCGGAGCTGACTTCAGCAGCCCACAGCTGTGGGGCTTCAGCAGCCACACCAGCCCAGCCCAGCCCAGCTCTCGATACGTTTGGTCTTTCATGCTGAAAAATAAATAATAAAGCCTGTCCCGTGTCTACTGCCTCCCCCAACTGCACAGACGCCAGCCTCTAGGCCTGACTGCCAGGGAGGTGGAAACACTGGCCACCAGCCCGGCAGCCCCTACAGGCCCCCCAGATGGGCTGCCTCAGTCGTCCTCTGAGAGCTGCAGATCCTCCAGCTCGTCCTCCGGCCCCTGGGCCAGCTGCTGCAGCTCCCCAGGGGCCAGCCCCGCCTCTGCGTCTGGGTCTCCATCTGCGGGGAGAGATGGAGGCTACATAAATTTTGCTTTATCAGGAAGAAGCCAGCCTTAGAGGTTACTCATCACTAATTAATCACGGCACTAATTAATTTATCCCTGTTGCTGGCTGCCAGAGAACAGAGCATTTGGCCTGGCCTTCCCAGGGAGGGAAAAGCCTGGCCCAGAGCCCCACGCCCCCCGCCCACGTGGCTCTGCCCTCCCGCCAGATGGGCTCACAGGGCCACACCCTCTCACCCCAAGACCATTCACCCTCCGAGTTGCTGCTGTCCTCCTCGCCCTCCTCCTCGTCCTCTTCATCGTCTTCCACCCCATGCCGAGTGCTCAGGGGCCTCAGTATCCCTGAGGAACAAGAAGCAGAGTCCATATGACTCCCACCCACAGGGTCCACCAGCAAAGTCACAGTGGGGGCAGGAGGGTGGCCAGGCTCCCAACACCCTTCCCTCCGCTGACTTCCAGCAGGTGGAGAGGAGCCCTGGGGAGGAACTGGGAGGTCACAGGCCTGGGGACAGAGTTACCAATCCCAGTAGGCCTTCACTTCAAGGAGGGAAGGCGCTGGCACCAGAAGCCTGGCAACACTGAGGTTGGCCCCAGCTGGGCCAGAGACTGGTGAGCCCCCTGCAGGATGGGTACAGGTGGCCCTCGTGGCTCTGGGAAGTCCAGCAGAGCCCTCCAGGCCCACCCTTCCCCTGGGAGCACCACGCAGGCCCCACCTCTCTCCGAGAATCCCTCGGTGTCGTCCTCTTCAGAGCTGTTCAGGTCAAAGAGGTCTTTAAATTGCTTCCTGTCCTCATCCTTCCTGTCAGCCATCTTCCTTCGTTTGATCTCAGGGAAGTTCAGGTCTTCCAGCTGGAAGGCCAAAGAACCAGGGGCTCAGGTGAGAGAGGGCAGGGGCTGGCGGCCACAGCAGGGCCAGGCATCGCCAGACCCACCACCAGGGCCCCATGTGGCCAATTTCTAGTCCCCTCTGTTCCCAAATCACAAAGCCATCCTCCAAGTTGTCCATCCCATGTCCAAGGTCAAAGGCAGAGCCCTTCCTGCTTCTCCTCACGGGCCCCTGGTGCCCACATACTGGCCTGGGTGACGAGGTCAGTCCAGCCACTCCACCTGCCCGGAGCCTCCAGCCCATCAGGCCTGAGGGGGCATGGCCTCCCCAACCTAGTGCAGCCTGGGGCTTCCCCTCCCTGGAAACGCCTGGTTCTGGCCAGTTCTCCAACACCTACCCCCTCTCCAAGTCGAATCATCCGGGCACGGCCCTGGCCGCCTGGCACTGTTTCCAAACCCTCGCCCTGGTCTCAAGTCATAGTGCGCTAGATCTGAAACCCAGGAAGTCACAACACACCCCCAGGTCCCCTCGCCGAGCCGCACCCGCTCTTTGCCACTGATCTCCAGCTGGATCTCCCGGTCACGCAGCTTGCGCCAGTGGCTGTAGTACAAGGTCAGGGGTGTCCCCTCTTCCCGGGTCAGCTTCTCCCAGGCTTCCTGGGGGGTTGGGGGAGTTCAGGGTCATGCCTCACCCTGGGCAAACCCCCACATGTAGCTGGGGCTATACCCTGGCAGGTGCCCTCAGGTGGCACTACCCCCAGGGCCCACTAACCACTGCCTGCTGCTCAGAGACGCCGAAGGAAACCCTCTGGCGGCGGCTGCAGATGTATGCCGAGTTCTCCTGAACCTTCCCAAGCAGCTGCTGCACCTGCCGGCAGTAGTTGGCCACCTTGCACTCCCGGAGGAACGACTTCAGCTGCGGAAGGGAGGGGTCAGCCACTGAAGCCCAGGACCGCTCCATGTGCACAGCTGGCCCAGGTCCTGTGCAAAACCACGCGTGGTGGCCACGGGGATACCCCAGGAGGGGACATGGATCCCATCTCAGGGCTCAAGTGCATAGCTGTTGCAGCTGGGATGGCAGAGGCAGAATCAGCCCACCCTCTGGGCCAACCCTGCCCACTACTCACCTCTGGAAATAAAGTTTTATGCCAGGCGTGGTGGCTCTCGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACGTGGCAAAACCCCGTCTCTACTAAAATACCAAAATTAGCTGGGTGTGGTGGTGGGCGCCTGTAAACCCAGCTCCTTGGGAGGCTGAGGCTGGAGAATCGCTTGAGCCCAGGAGATGGAGATTGCAGTGAGCCGAGATCGCACCATTGCACTCCAGCCTGGGCAACACAGCGAGACTCCATCTCAAACAATACTACTACTACTAATAAAATACAGTCTCGCTGACGTGCAGCCACACGTGTGCATTGCATGGTTCTGCAGTTGCCTGTGCAGCTGAACATCCGTAGGAAGCCACGTTTACCATTTGGCCCCTCACCAAAAACATTTTCTGACCCCTACCCCAGACCCCGACCCTGGGCCCTTGAGTCCAGAAGCAGAGATGCCCCAATGCCAGGTATCACCACCCAAGAGGACATGGGAGGAACAGAGGCTGTGGCCCCTGCTGTGAGTGCCCCCCAGAAAGGGGGTCCCGGCTCTGTGCATGTGACATGTGTGGCCGTGTGTGAGTACACACACATGCACACACCTCTATCTGGATAAGCCTCTGACCAATTGTGGCTCATGTGAGCAGATCCCTCCTCCCCACACTGCACAGACCTATAGTCGGCACATCTGATTCCAGCCACCAGGGCCAGACAGCAGGGCCCCCACCCCTTCCACTAGGCACAGGCCTCCCTGAGGCTGGAAACATCACGGCTGAGAGCAAACAGACCTCCCGTGGGGGCCCAGAAGGACCTTCTGAGGATAAGGAGAACCCCCTCCTCCACCCCACTCCTGCCTAAGATGAGGCTGACGTGGGGTATTTAGCGGGGCAGGCTGGGCCTTCCTATGAGGCTGATGTGGGGTATTTGGCGGGGCAGGCCAGGCCTTCCTGCCACCTGAGAAGCCACTCCACCCACTCCCCACACCCGGGATGGCCTGGGGAAGTGTGCGATCAGCGTACCAGCCTGAGCCAGGGTCGGCATGCTCAGTCCCAACCCCGAAGCAAAGATCAGCCTTGTGGTTCCCACCTGGGGAGGAGGCTGTTGTGCTCCCAGGGTCCTCAGCCCACTGCCCAGGCCTGCCCCCAAACCTCCTGAATGGCTTAGAACCCCTCATCAGCCCCTCCAAGGGGGCCTCACGGGGCGCGTTGCCAGCAGTCAGGTTCCACCCCAGTCCCAGGTACCCGGGACAAGGGCACCTCCTACCAGCCTGGGGCAGCCAAGCCCGTTATAAGACAGTCTGAGTCGGCCACGAGCCGGTGTGGGCAGGACACACACCTGCAGGACCACAGGCAGCACCAGCTCCGGGAAGCCGATGCAGTGTGCCTGGCTGTGCAGGTACTCCAGGGTGAGGTCGTACAGCTGCTCCACCAGGCCGTCCTGAAGAGCAGGAGAGAGGGCCGAGTGCATCAGGGAGAGGCTGGGGCTGGGCACTCAGGCCCCTTCCCCTCAGGCTGTCAGGGCAGCGCCATCTCCAGGGCACGGACTGCAAGGAAGGGGCTCCTGGGCCCCAGCCCTGGGAGACCATGAAGGTCCATGCTTGAACTTGGAGGATGCCAGCCCCCTCCCATCCACCTCAGCACCCCCAACCCCACCCTGGGAACTGCCCAGGCCCCTCCCCAGGAGGCCAGCCTCACCCGGTACGCCTTCTCCTGCAGGTTGACATTGGACAGCTTCAGGATCACGGAGAAGTTGATGGGCTTGGAGCTCATGCGCCCTGGCTTCCTGTTGAAGTCGACCTGCTGGAACATCTGCCCCAAGGGCCGTGTCAGGCTCTCTCGGCCCCATGCCTGGTCACCCTGGCTTCACCCTGGCTGCACCCTGGTCCCCCTGGTCCCTTTGGCC
>chr1 pos:880527 gene_name:NOC2L strand:-
GTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAAGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAGGGGTCGCCCAGTCCCGGCTGCGCAGGGTCACTGTCCCGCGTGGCCACACTCGCCCCAATGCGCACGCAGCGCCCGCGTCTAAGGCGCCGCGTCCCAGGTCTCTCGGCCTCTCCCTCCCGCCCGGGCCTCCCGAGTCCCCGCAGCCCCCACTATGCAGGCTCCGCCAGCTCCACCGCCGCCTCTGTTTATAAATTAATTACCCGAAAGCGGAGGTGGGTCCCGCCCCGGGCCGGGTGCCAGCCCGCAGAGCCCAGCCGGCGCGGCCGCCCGCATCGATCCCAGCGGGGCCCAGCGCCCCCGCGCTAGCGGCCGGGTTAGTTACAGGGTTATTTACGGCCGGCTCCGCGGCGGGGCGGGGGGGGGGGGCGGCGGCGCGGCCTGATTGACAGCGCGCTCCCCGGCGGCCGGCGCCCCTCCCCCGCGCGGCCAGCAGAGCGGCCCCAGGCAGAGCAGGGAAAGCAATTAAAAAGGAGGATTTTTACATTATTAACATTTGGTGAATTATTCAGGCTCTTG
>chr1 pos:880536 gene_name:NOC2L strand:-
TGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTAGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAGGGGTCGCCCAGTCCCGGCTGCGCAGGGTCACTGTCCCGCGTGGCCACACTCGCCCCAATGCGCACGCAGCGCCCGCGTCTAAGGCGCCGCGTCCCAGGTCTCTCGGCCTCTCCCTCCCGCCCGGGCCTCCCGAGTCCCCGCAGCCCCCACTATGCAGGCTCCGCCAGCTCCACCGCCGCCTCTGTTTATAAATTAATTACCCGAAAGCGGAGGTGGGTCCCGCCCCGGGCCGGGTGCCAGCCCGCAGAGCCCAGCCGGCGCGGCCGCCCGCATCGATCCCAGCGGGGCCCAGCGCCCCCGCGCTAGCGGCCGGGTTAGTTACAGGGTTATTTACGGCCGGCTCCGCGGCGGGGCGGGGGGGGGGGGCGGCGGCGCGGCCTGATTGACAGCGCGCTCCCCGGCGGCCGGCGCCCCTCCCCCGCGCGGCCAGCAGAGCGGCCCCAGGCAGAGCAGGGAAAGCAATTAAAAAGGAGGATTTTTACATTATTAACATTTGGTGAATTATTC
>chr1 pos:881043 gene_name:NOC2L strand:-
TGTGCATATCAGTTCATGTGTGCATCTGTATGTGTGTATGCACGTGTATCCATGAATGCCTGTGTGCCTGCAGGTGTGTGCATCTGTGCGTGTGTACACCTGTGTGTATGCATGTGTGTACCTTTGCGTGTACCTTTGCGTGTGTGCACCTGTGCATGTGTCTTTGTATACCAGTGTGTACCTGTGTGTACCTGTATGCATGCACATGCGTGTGTACCTGTGTGCACCTGTCTGCATGTGTGTACCTGTGCGTGTGTGCACCTGTGTGCATGCATTTGCGTCTGCATGTGTCTACCTGTGCATGCATGAACCTTTGCATGTGTGCATCTGTGTGCATGCATGTGTGTCTGCGTGCATCTATGTACCTGTGTGCACCTATGTGCATACACACGTGTCTGTGTGCACTTGTGTGCATGCATTTGCATCTGCATGTGTGTACTGTGCGTGTGTACCTGTGCGTGTACCTGTACACTTGGGTGCATGCATGCACGTCTGCTTGTGTGTGCCTGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTTAGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACAG
>chr1 pos:881044 gene_name:NOC2L strand:-
GTGTGCATATCAGTTCATGTGTGCATCTGTATGTGTGTATGCACGTGTATCCATGAATGCCTGTGTGCCTGCAGGTGTGTGCATCTGTGCGTGTGTACACCTGTGTGTATGCATGTGTGTACCTTTGCGTGTACCTTTGCGTGTGTGCACCTGTGCATGTGTCTTTGTATACCAGTGTGTACCTGTGTGTACCTGTATGCATGCACATGCGTGTGTACCTGTGTGCACCTGTCTGCATGTGTGTACCTGTGCGTGTGTGCACCTGTGTGCATGCATTTGCGTCTGCATGTGTCTACCTGTGCATGCATGAACCTTTGCATGTGTGCATCTGTGTGCATGCATGTGTGTCTGCGTGCATCTATGTACCTGTGTGCACCTATGTGCATACACACGTGTCTGTGTGCACTTGTGTGCATGCATTTGCATCTGCATGTGTGTACTGTGCGTGTGTACCTGTGCGTGTACCTGTACACTTGGGTGCATGCATGCACGTCTGCTTGTGTGTGCCTGTGCGTGTGTGCACCTGTGTGCATGTACGTGCGTCTACGTGTGTGCCTGTGCATGTGTGCGCCTGTGTGTACCTGTGTGTATGCATCTTTGCACGTGCACATGCCACTCAGGTAGGGAAAGATTGGAGTCCGAAGTTTCAACTTTCAGTGGGTGGGTTAGGCCCCACCCCGCTGTAAATTTAGGAATTCACGATCTCCACCCTGTTTATCTAATGAGTTCTCAGCCTCATGAAGGCCCAGAGTCGTGTCACAGCTGTCCTTGGGGCTGGGTCCCCAGGTTGCTGGGTCCAGAAGGTATGGAAGCCCCAGGCACGTTCTGATTCCCCTTCCACTGAGGCAGGGATGCTGAACATCTTTAGGAAGCCATGTTCACTCCCATGGCAGCCAGCAGTGGTCTCTGATTGCCCAGCCCTTGGCCTGGCCCCTGTGTCTGTGGGCCTCCAGCTCTGCTGCCCAGCTCCAGGCATGCTTTGTGTCTGTTTCCCTTGTCCAATCTCCTTGGCTACGTGCTTTCTTACTCTCTTGCAGTGTCTGTTTCTTCACTTGTGCACTGCCCTGGTTCACTGCAGCCGCACCCTGTAGGCCCCTCTCACGCAGGGATGCAGGCCTCTCCTCTCCGGAAAAAGCAAACCCTAAAAGCTAAAACAAAGCCCTCAGCTGTAGGCCGTGCCTGCCCTTCCCCGGTGCCTGGACAGGAAGCCAGTCGCCTGCCCATACTTTTGGCCCAGGCTAGAGAAGGGCAGTGTCCTCCCAGAGGTTCATCAGTACCAGGGCGTTTTCCCATCTGGACCTGAGCTCAGCTGTCTGGCAGCCACCCCTGCTGAGTGGGGTGTCTTGCTGGGGCCTCCACCCTTGGGCCCCCCATAATCTGCTTCTGTCCTCTGGTGCCCCAGCATGTACCCTGGATCTCTCTGGTTCACAGCCTGAGGGCTCCTAGTGGTTGGGGAGGGGTCACAAGACTGAGAGGCCAGGCTGACTCTTTCTCTGCTCCTCCTGGCATGTCCTACGGAGGTGCATGGCCTGTGGCTTCTGTGGAGGGTGTGGGAGGGGCCCCCCAGGCCTCCCGTGACCTCCATCTGTCCCGTCCTGTGTCTGGCACTCTTTGCTGTTGCTGCTGCGTCTTCTGGTTGCTCGGGACGGAGCCCCATGTGGCATTGCTGTGCTGAGGGCCAGGATGGGCCTCAGTGCCATGTTGTCAGGAATGGGGGCTGTCCTGGTACTCTGTGTGGCAGGGACCTCTAGGTCTCCAGACGTGGGTCCTTAGTGCTTCCCAGGATTTTGGGAGAGGGCCCGTGTTCCTGATCCTTCCCTGCTGATCAGAGCCCCACTCGGGGACACGCCAGGCTGTGTGGGGCCATGGGGCTGGGACCGTGCCTAGCTGCTTATCTCTTGTTTCGGGTTGGGTCTCCTCGTGCTGAAGCCTGAGGACCAGGGTGACCAGGGTGCAGCCAGGTGCAGGGCCAAAGGGACCAGGGGGACCAGGGTGCAGCCAGGGTGAAGCCAGGGTGACCAGGCATGGGGCCGAGAGAGCCTGACACGGCCCTTGGGGCAGATGTTCCAGCAGGTCGACTTCAACAGGAAGCCAGGGCGCATGAGCTCCAAGCCCATCAACTTCTCCGTGATCCTGAAGCTGTCCAATGTCAACCTGCAGGAGAAGGCGTACCGGGTGAGGCTGGCCTCCTGGGGAGGGGCCTGGGCAGTTCCCAGGGTGGGGTTGGGGGTGCTGAGGTGGATGGGAGGGGGCTGGCATCCTCCAAGTTCAAGCATGGACCTTCATGGTCTCCCAGGGCTGGGGCCCAGGAGCCCCTTCCTTGCAGTCCGTGCCCTGGAGATGGCGCTGCCCTGACAGCCTGAGGGGAAGGGGCCTGAGTGCCCAGCCCCAGCCTCTCCCTGATGCACTCGGCCCTCTCTCCTGCTCTTCAGGACGGCCTGGTGGAGCAGCTGTACGACCTCACCCTGGAGTACCTGCACAGCCAGGCACACTGCATCGGCTTCCCGGAGCTGGTGCTGCCTGTGGTCCTGCAGGTGTGTGTCCTGCCCACACCGGCTCGTGGCCGACTCAGACTGTCTTATAACGGGCTTGGCTGCCCCAGGCTGGTAGGAGGTGCCCTTGTCCCGGGTACCTGGGACTGGGGTGGAACCTGACTGCTGGCAACGCGCCCCGTGAGGCCCCCTTGGAGGGGCTGATGAGGGGTTCTAAGCCATTCAGGAGGTTTGGGGGCAGGCCTGGGCAGTGGGCTGAGGACCCTGGGAGCACAACAGCCTCCTCCCCAGGTGGGAACCACAAGGCTGATCTTTGCTTCGGGGTTGGGACTGAGCATGCCGACCCTGGCTCAGGCTGGTACGCTGATCGCACACTTCCCCAGGCCATCCCGGGTGTGGGGAGTGGGTGGAGTGGCTTCTCAGGTGGCAGGAAGGCCTGGCCTGCCCCGCCAAATACCCCACATCAGCCTCATAGGAAGGCCCAGCCTGCCCCGCTAAATACCCCACGTCAGCCTCATCTTAGGCAGGAGTGGGGTGGAGGAGGGGGTTCTCCTTATCCTCAGAAGGTCCTTCTGGGCCCCCACGGGAGGTCTGTTTGCTCTCAGCCGTGATGTTTCCAGCCTCAGGGAGGCCTGTGCCTAGTGGAAGGGGTGGGGGCCCTGCTGTCTGGCCCTGGTGGCTGGAATCAGATGTGCCGACTATAGGTCTGTGCAGTGTGGGGAGGAGGGATCTGCTCACATGAGCCACAATTGGTCAGAGGCTTATCCAGATAGAGGTGTGTGCATGTGTGTGTACTCACACACGGCCACACATGTCACATGCACAGAGCCGGGACCCCCTTTCTGGGGGGCACTCACAGCAGGGGCCACAGCCTCTGTTCCTCCCATGTCCTCTTGGGTGGTGATACCTGGCATTGGGGCATCTCTGCTTCTGGACTCAAGGGCCCAGGGTCGGGGTCTGGGGTAGGGGTCAGAAAATGTTTTTGGTGAGGGGCCAAATGGTAAACGTGGCTTCCTACGGATGTTCAGCTGCACAGGCAACTGCAGAACCATGCAATGCACACGTGTGGCTGCACGTCAGCGAGACTGTATTTTATTAGTAGTAGTAGTATTGTTTGAGATGGAGTCTCGCTGTGTTGCCCAGGCTGGAGTGCAATGGTGCGATCTCGGCTCACTGCAATCTCCATCTCCTGGGCTCAAGCGATTCTCCAGCCTCAGCCTCCCAAGGAGCTGGGTTTACAGGCGCCCACCACCACACCCAGCTAATTTTGGTATTTTAGTAGAGACGGGGTTTTGCCACGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCACGCCTGGCATAAAACTTTATTTCCAGAGGTGAGTAGTGGGCAGGGTTGGCCCAGAGGGTGGGCTGATTCTGCCTCTGCCATCCCAGCTGCAACAGCTATGCACTTGAGCCCTGAGATGGGATCCATGTCCCCTCCTGGGGTATCCCCGTGGCCACCACGCGTGGTTTTGCACAGGACCTGGGCCAGCTGTGCACATGGAGCGGTCCTGGGCTTCAGTGGCTGACCCCTCCCTTCCGCAGCTGAAGTCGTTCCTCCGGGAGTGCAAGGTGGCCAACTACTGCCGGCAGGTGCAGCAGCTGCTTGGGAAGGTTCAGGAGAACTCGGCATACATCTGCAGCCGCCGCCAGAGGGTTTCCTTCGGCGTCTCTGAGCAGCAGGCAGTGGTTAGTGGGCCCTGGGGGTAGTGCCACCTGAGGGCACCTGCCAGGGTATAGCCCCAGCTACATGTGGGGGTTTGCCCAGGGTGAGGCATGACCCTGAACTCCCCCAACCCCCCAGGAAGCCTGGGAGAAGCTGACCCGGGAAGAGGGGACACCCCTGACCTTGTACTACAGCCACTGGCGCAAGCTGCGTGACCGGGAGATCCAGCTGGAGATCAGTGGCAAAGAGCGGGTGCGGCTCGGCGAGGGGACCTGGGGGTGTGTTGTGACTTCCTGGGTTTCAGATCTAGCGCACTATGACTTGAGACCAGGGCGAGGGTTTGGAAACAGTGCCAGGCGGCCAGGGCCGTGCCCGGATGATTCGACTTGGAGAGGGGGTAGGTGTTGGAGAACTGGCCAGAACCAGGCGTTTCCAGGGAGGGGAAGCCCCAGGCTGCACTAGGTTGGGGAGGCCATGCCCCCTCAGGCCTGATGGGCTGGAGGCTCCGGGCAGGTGGAGTGGCTGGACTGACCTCGTCACCCAGGCCAGTATGTGGGCACCAGGGGCCCGTGAGGAGAAGCAGGAAGGGCTCTGCCTTTGACCTTGGACATGGGATGGACAACTTGGAGGATGGCTTTGTGATTTGGGAACAGAGGGGACTAGAAATTGGCCACATGGGGCCCTGGTGGTGGGTCTGGCGATGCCTGGCCCTGCTGTGGCCGCCAGCCCCTGCCCTCTCTCACCTGAGCCCCTGGTTCTTAGGCCTTCCAGCTGGAAGACCTGAACTTCCCTGAGATCAAACGAAGGAAGATGGCTGACAGGAAGGATGAGGACAGGAAGCAATTTAAAGACCTCTTTGACCTGAACAGCTCTGAAGAGGACGACACCGAGGGATTCTCGGAGAGAGGTGGGGCCTGCGTGGTGCTCCCAGGGGAAGGGTGGGCCTGGAGGGCTCTGCTGGACTTCCCAGAGCCACGAGGGCCACCTGTACCCATCCTGCAGGGGGCTCACCAGTCTCTGGCCCAGCTGGGGCCAACCTCAGTGTTGCCAGGCTTCTGGTGCCAGCGCCTTCCCTCCTTGAAGTGAAGGCCTACTGGGATTGGTAACTCTGTCCCCAGGCCTGTGACCTCCCAGTTCCTCCCCAGGGCTCCTCTCCACCTGCTGGAAGTCAGCGGAGGGAAGGGTGTTGGGAGCCTGGCCACCCTCCTGCCCCCACTGTGACTTTGCTGGTGGACCCTGTGGGTGGGAGTCATATGGACTCTGCTTCTTGTTCCTCAGGGATACTGAGGCCCCTGAGCACTCGGCATGGGGTGGAAGACGATGAAGAGGACGAGGAGGAGGGCGAGGAGGACAGCAGCAACTCGGAGGGTGAATGGTCTTGGGGTGAGAGGGTGTGGCCCTGTGAGCCCATCTGGCGGGAGGGCAGAGCCACGTGGGCGGGGGGCGTGGGGCTCTGGGCCAGGCTTTTCCCTCCCTGGGAAGGCCAGGCCAAATGCTCTGTTCTCTGGCAGCCAGCAACAGGGATAAATTAATTAGTGCCGTGATTAATTAGTGATGAGTAACCTCTAAGGCTGGCTTCTTCCTGATAAAGCAAAATTTATGTAGCCTCCATCTCTCCCCGCAGATGGAGACCCAGACGCAGAGGCGGGGCTGGCCCCTGGGGAGCTGCAGCAGCTGGCCCAGGGGCCGGAGGACGAGCTGGAGGATCTGCAGCTCTCAGAGGACGACTGAGGCAGCCCATCTGGGGGGCCTGTAGGGGCTGCCGGGCTGGTGGCCAGTGTTTCCACCTCCCTGGCAGTCAGGCCTAGAGGCTGGCGTCTGTGCAGTTGGGGGAGGCAGTAGACACGGGACAGGCTTTATTATTTATTTTTCAGCATGAAAGACCAAACGTATCGAGAGCTGGGCTGGGCTGGGCTGGTGTGGCTGCTGAAGCCCCACAGCTGTGGGCTGCTGAAGTCAGCTCCGCGGGGGAGCTGACCCTGACGTCAGCAGACCGAGACCAGTCCCAGTTCCAGGGGGAGGCCTGCAGGCCCCTGGCCCCTTCCACCACCTCTGCCCTCCGTCTGCAGACCTCGTCCATCTGCACCAGGCTCTGCCTTCACTCCCCCAAGTCTTTGAAAATTTGTTCCTTTCCTTTGAAGTCACATTTTCTTTTAAAATTTTTTGTTTTGCATCCGAAACCGAAAGAAATAAAGCGGTGGGAGGCAGGGCCATTGTGTTGAGTGGTGGCTCCTGGAGATTTGTGTGGCCCCACCCCTACCCCCGGCAACCTCAACACAGAGGCTGGGAAGGGTCGGGGGCCCCTGGAAGTAGAGCCAAGGTCCCATTCTCCTGCTTGGGTGAAGTTTGACCGGCAAGGGCGTGGCCCCCTCCATAGGGGGACGTGGCCGTCGTGGGGGACAAGGGCTGCTCTCCTGTGCCGAGTTCTCGCTCCGGGGCCCGCAGGGTTGGTGGCTGCAGTGGCAGAGCCACGGGGAAGCTGGCCACGTAGAAAACTCGGCCCAGGCGCCTGGCCACCTGCAGAGAGATGGCCTGGTGGGGGCTGTCGCACACCCACCCAGAGCTTTCTGCCCAGCTGTGGTCTGGAGACCCTGACCTCACTCCCCAGCGTCTCACCTGGGCCCGGATCTTGAGGGCGGGCCCCAGCTTCAGCCCCATGTTGGTCAGCAGGTGCTCCTCCGTCAGCAGTGGCAGGGTCTCCCCGTCGATCCCCTGCTCCCTGAAGACCTGGAAGGGGGAGGGAGGGAGGGTCAGGACCCGGGTGCTGCTCCATCCTCCAGCCCAGCAGAGCCAAGAGGAGCTGTTTGTGCAGCTCACGGAAATTTTTTAAAAACAGTGATCTGTTGGGTTTGAGTGTCAGGCAGAAAACACCTGGCAAAAAAAAAAAAAAAAAAAAAAGGTGGGGTGCACAGTCCAGTCCCTGCTGCGCACTGCTTTTGGGGAGGGAGAGGGCGTTGGGGAATGGCCCTACCCCACCATCCTTTCCAGGGAGGTAGTGACTGCCAGCCAGCTCCAGCCCCGCCCCAGGAACTGGGGCCCCCCCTTACCCGAGTGTACTCTCCACAGCCAGACAGGCCCCCCACGAAGCTGCAGACGTCATCCACGGTCCACTTGGTGACGTCCTCAGGGGCTGGGGCCTCCTCCCCATCCATGGAGAGTCCCCCTACCGCGCCTGAGTTGTGGGGCGTGTTACTAGGGCTCTGGCTGGCTTTCCCATACCCGCCCGTCTCCTGACCGTCGTGTGCCCCAATTGCTGAGGTGGGGGATGGGTGCCTAAGGGGAATTTTTCCTCCCCCCAAGACCCTTCCACAGGCGGCCTGCCAGTCCTGTGCCCTCTGGAAGGATCTAGAGTGTGGGGGTGCCCACCTGTGTGGAAGTAGGGGCTGACGGCATAAGGGAAGCCCAGGGGCAGTGTGGACCCTGGGAAAAGCCCCTTCCCCTCGGCGCCGGCCCCTCCAGCTGGAGCTTGGCCCGGAGTGGGCCCCCTGCACCCAACAGCTGCCGTCTCGGGGTCCTCTCCGTCCGAGTCTTTGGGGGGCTCGTCTTCCGAGCCATCTTGTGCCCAGAGCCTAGCCCCCGTCATCTCCTTGGACTCGCTGGGCCGCGCTGAGGCAGGGCCGGGACCCCCCTTCCGGGGGGCTCGCCGGGCAGAGTCCCGGGACGGGGTGGGGGGTCCGGAGCCCGGGGGCCCCTGGGGGGGCAGGGCCAGCAGTGGCGCCGCGCCGTGGTTCAGCACCAGCAGGGCCCCGCGCCGCTGCAGCTCCTCGGCGCCGTCGTTGGGGCGCAGGGCGGTCTCGGGCGCCAGCAGCTGTGGGCGCGCGCTCTCCAGCTCCTTCTGCCGCAGGAGGTCGGCGGGCAGCTCCAGCCTGGGCCGGGGGCAGACGCGGGTCAGCCGCCTCCCGGGCCGCGCGGCCCCGCCCGCCTCCCCGCACCTACCGGGCCAGGTTCTGCTTCCGCAGGAGCTCCTGCTGCCAGGCGAACATCTCCGCCTGCGCGGGGGGCAGGAAGCCGTAGCCTGGTGGGGAGGGGGACAACGCGGGGTCGGGGGGTCCGAGGCGGTCCGAGCCGGCCCTTTGCAATCCCCTACCCGGTCCGTCTCGGCTCCGACAGGGGCAGGGCCGGGTCCGCGAGCCTCCACGCGGGCTCCCAGGGGATGCGCACCCGCCTCCTCACCTGGGGTCTGGCACAGAGCCGAGGGCACCCCCAGGAAGGGGGGCCTGAGATGGGGGCCCAGGGCGACGTGAGGGGCATTCTGCGGCGACAGCAAGGGGGGCGGCTGAGGCAGCTCCCTGTGGACGGCAGCGCAATGAGCGGCGTCCCCCCCGCCCCCGTTCCCGTAAAGGCCCCGGCGCCCCTCCCCCGCCGCCTCCGCTAATTGCCGGCCCCGCGGGCGGTCGATGCGCAGCTCGTTACGGCCCCAGGTCGCGCGCCATCCCCGCCCCCGCCGGCGGCGCCAATTAATCTCGGCGGCGGCGCGGAGGCGCTGACCCGGCGGGCGGCGACGGCTGCAGCGGTAATTACTGCGCGGGGCGCCGGCCCCGCCTGCTCCCCTCCCCTGCGGACCCGAGCGGCGGGGGAAGCGGGGGCGTCATTAGCCGCAATCCGGGCGGCGGTGGAGGCAGCTGCAGACGCCGGGCTCAGCGGCCGGGCAGGGACTCGGGCCTCAGGCGGCCCCTCCCTTCTCCAGGCGCCCCGCCCCGCCTCGTGTCCTGGACCCGAGAAACCCGGGTCTGTGCCGGAGGCCTTGTCTGTGCCGGGAACAGATCCGGCGCCGGCGTCTACATTTGAGCGCCAGCCAGGACCCCCCCTCCGCCTCCAGCTCTGAGTCCTCTGGCCGACCCCTCCCGACTGACCCCGGGGTTCCAGACTTGGAGCTCTCCCCTTCTCTGGCTTCTCCTCCAGGGAACCCCCTGATGGGCCTCAGCCTCCCTCACGCACCCCAGTGGCGCCCTCAACACTGCAGCCCAGATCGGTGTCACCCCCGAAACGCTCTGCCCGGACCCGAACCTAGAGCTGAACACAGTGCTTGGCACTCTCGGGGCGCGTGCTGGTGGCTGAGAGCGCGGGGGTGCGGGCGGCACCCCAGAAACAAGCAGAGAACGGCAGCCACCCCAGTACCTCTCCGAGAAGGACGGGGCGGCAGCTGGGGCGGCGCCCTCCCGGTGCTGAGCCAGGCCCTGCTTCCGACGCTGACCTGCCGTGGAGGAGGGCAGGTGGGCTTCCAGGCCACTGGGGCCCCTCAGAGCTGCAGCCGCCACCTCCTGCCGGACCCTCAGGAGATCTGGGTGGGGGACAACAGCAGTGATCCCCCCGGCTATGCGCCCTCATCCTCCCCAGCCCGCGTCCTGGCGTAGAACACTGAGGTGGGGAACTGACCACACCACTCGGCCGTGCTCGGCCCCTGGGCTGGTCTCGGGCACGGGGCAGGAGGATGGGCGGCCACAGGGCCGAGACGGCAGACAAGATGCTGGGCGAGGAAGTGTCGGGAAAAGCTCTTCCCCGAAGGGGAGCCGGGTGCCTGGAGGCTTCCGGCCTGAGAGGCGGCTGAGCCTTGCGGGGAGGGGGCTGTGCAGACGTCCCGAGGCCACAGAGCCTGCGGGGCAGCGAGAGAGTCCAGAAGTGCCAGCCAGGGGCTGACCGCCCAGGGGACCCCCTCTTCGGTGGGAGGCAAGAAAATCTGAGGTCCCCATAGTAGACGCGGCTGGCCAAAGCTTGTCACAAAGGCAAGGCAGCGCCCGCGCACGCACGCCACACTAGCACA

Number of lines

wc -l AG.fa
252122 AG.fa

Number of sequence lines

grep -v ">" AG.fa | wc -l
126061

gimmemotifs.log for Random background

2020-02-12 20:39:42,345 - gimme.config - DEBUG - Using multiprocessing
2020-02-12 20:39:42,345 - gimme.config - DEBUG - Parameters:
2020-02-12 20:39:42,346 - gimme.config - DEBUG -   fraction: 0.2
2020-02-12 20:39:42,346 - gimme.config - DEBUG -   use_strand: False
2020-02-12 20:39:42,346 - gimme.config - DEBUG -   abs_max: 1000
2020-02-12 20:39:42,346 - gimme.config - DEBUG -   analysis: xl
2020-02-12 20:39:42,346 - gimme.config - DEBUG -   enrichment: 1.5
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   size: 200
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   lsize: 500
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   background: ['random']
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   cluster_threshold: 0.95
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   scan_cutoff: 0.9
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   available_tools: MDmodule,MEME,MEMEW,DREME,Weeder,GADEM,MotifSampler,Trawler,Improbizer,BioProspector,Posmo,ChIPMunk,AMD,HMS,Homer,XXmotif,ProSampler,DiNAMO
2020-02-12 20:39:42,347 - gimme.config - DEBUG -   tools: MEME,Homer,BioProspector
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   pvalue: 0.001
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   max_time: -1
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   ncpus: 12
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   motif_db: gimme.vertebrate.v5.0.pfm
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   use_cache: False
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   custom_background: /scratch/generate/combined/motifs_0.99-1_out/test/generated_background.random.fa
2020-02-12 20:39:42,348 - gimme.config - DEBUG -   genome: hg19
2020-02-12 20:39:42,349 - gimme.config - DEBUG - No time limit for motif prediction
2020-02-12 20:39:42,349 - gimme.denovo - INFO - starting full motif analysis
2020-02-12 20:39:42,350 - gimme.denovo - DEBUG - Using temporary directory /tmp/gimmemotifs.265933.wih86yf9
2020-02-12 20:39:42,352 - gimme.denovo - INFO - using size of 200, set size to 0 to use original region size
2020-02-12 20:39:42,352 - gimme.denovo - INFO - preparing input from FASTA
2020-02-12 20:39:42,352 - gimme.denovo - INFO - preparing input (FASTA)
2020-02-12 20:39:42,353 - gimme.denovo - DEBUG - Splitting AG.fa into prediction set (/scratch/generate/combined/motifs_0.99-1_out/test/intermediate/prediction.fa) and validation set (/scratch/generate/combined/motifs_0.99-1_out/test/intermediate/validation.fa)
2020-02-12 20:39:49,584 - gimme.denovo - DEBUG - Random background: /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/prediction.bg.fa
2020-02-12 20:40:15,442 - gimme.denovo - DEBUG - Random background: /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/bg.random.fa
2020-02-12 20:40:15,550 - gimme.prediction - INFO - starting motif prediction (xl)
2020-02-12 20:40:15,550 - gimme.prediction - INFO - tools: MEME, BioProspector, Homer
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping AMD
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping GADEM
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping Improbizer
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping JASPAR
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping MEMEW
2020-02-12 20:40:16,366 - gimme.prediction - DEBUG - Skipping ProSampler
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping RPMCMC
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping trawler
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping Weeder
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Skipping XXmotif
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 6
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 8
2020-02-12 20:40:16,367 - gimme.prediction - DEBUG - Starting BioProspector job, width 10
2020-02-12 20:40:16,371 - gimme.prediction - DEBUG - Starting BioProspector job, width 12
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 14
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 16
2020-02-12 20:40:16,372 - gimme.prediction - DEBUG - Starting BioProspector job, width 18
2020-02-12 20:40:16,373 - gimme.prediction - DEBUG - Starting BioProspector job, width 20
2020-02-12 20:40:16,376 - gimme.prediction - INFO - BioProspector_width_6 finished, found 0 motifs
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - Skipping ChIPMunk
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - stdout BioProspector_width_6: 
2020-02-12 20:40:16,377 - gimme.prediction - DEBUG - Skipping DiNAMO
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - stdout BioProspector_width_6: BioProspector_width_6 failed to run: BioProspector is not configured
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - Skipping DREME
2020-02-12 20:40:16,378 - gimme.prediction - DEBUG - Skipping HMS
2020-02-12 20:40:16,378 - gimme.prediction - INFO - BioProspector_width_8 finished, found 0 motifs
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - Starting Homer job, width 6
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - stdout BioProspector_width_8: 
2020-02-12 20:40:16,379 - gimme.prediction - DEBUG - Starting Homer job, width 8
2020-02-12 20:40:16,380 - gimme.prediction - DEBUG - stdout BioProspector_width_8: BioProspector_width_8 failed to run: BioProspector is not configured
2020-02-12 20:40:16,380 - gimme.prediction - DEBUG - Starting Homer job, width 10
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 12
2020-02-12 20:40:16,381 - gimme.prediction - INFO - BioProspector_width_10 finished, found 0 motifs
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 14
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - stdout BioProspector_width_10: 
2020-02-12 20:40:16,381 - gimme.prediction - DEBUG - Starting Homer job, width 16
2020-02-12 20:40:16,382 - gimme.prediction - DEBUG - stdout BioProspector_width_10: BioProspector_width_10 failed to run: BioProspector is not configured
2020-02-12 20:40:16,382 - gimme.prediction - DEBUG - Starting Homer job, width 18
2020-02-12 20:40:16,382 - gimme.prediction - INFO - BioProspector_width_12 finished, found 0 motifs
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - Starting Homer job, width 20
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - stdout BioProspector_width_12: 
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - Skipping MDmodule
2020-02-12 20:40:16,383 - gimme.prediction - DEBUG - stdout BioProspector_width_12: BioProspector_width_12 failed to run: BioProspector is not configured
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 6
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 8
2020-02-12 20:40:16,384 - gimme.prediction - INFO - BioProspector_width_14 finished, found 0 motifs
2020-02-12 20:40:16,384 - gimme.prediction - DEBUG - Starting MEME job, width 10
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - stdout BioProspector_width_14: 
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - Starting MEME job, width 12
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - stdout BioProspector_width_14: BioProspector_width_14 failed to run: BioProspector is not configured
2020-02-12 20:40:16,385 - gimme.prediction - DEBUG - Starting MEME job, width 14
2020-02-12 20:40:16,386 - gimme.prediction - INFO - BioProspector_width_16 finished, found 0 motifs
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - Starting MEME job, width 16
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - stdout BioProspector_width_16: 
2020-02-12 20:40:16,386 - gimme.prediction - DEBUG - Starting MEME job, width 18
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - stdout BioProspector_width_16: BioProspector_width_16 failed to run: BioProspector is not configured
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - Starting MEME job, width 20
2020-02-12 20:40:16,387 - gimme.prediction - DEBUG - Skipping MotifSampler
2020-02-12 20:40:16,387 - gimme.prediction - INFO - BioProspector_width_18 finished, found 0 motifs
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - Skipping Posmo
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - stdout BioProspector_width_18: 
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - Skipping YAMDA
2020-02-12 20:40:16,388 - gimme.prediction - DEBUG - stdout BioProspector_width_18: BioProspector_width_18 failed to run: BioProspector is not configured
2020-02-12 20:40:16,388 - gimme.prediction - INFO - all jobs submitted
2020-02-12 20:40:16,389 - gimme.prediction - INFO - BioProspector_width_20 finished, found 0 motifs
2020-02-12 20:40:16,389 - gimme.prediction - DEBUG - stdout BioProspector_width_20: 
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout BioProspector_width_20: BioProspector_width_20 failed to run: BioProspector is not configured
2020-02-12 20:40:16,390 - gimme.prediction - INFO - Homer_width_6 finished, found 0 motifs
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout Homer_width_6: 
2020-02-12 20:40:16,390 - gimme.prediction - DEBUG - stdout Homer_width_6: Homer_width_6 failed to run: Homer is not configured
2020-02-12 20:40:16,390 - gimme.prediction - INFO - Homer_width_8 finished, found 0 motifs
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_8: 
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_8: Homer_width_8 failed to run: Homer is not configured
2020-02-12 20:40:16,391 - gimme.prediction - INFO - Homer_width_14 finished, found 0 motifs
2020-02-12 20:40:16,391 - gimme.prediction - DEBUG - stdout Homer_width_14: 
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_14: Homer_width_14 failed to run: Homer is not configured
2020-02-12 20:40:16,392 - gimme.prediction - INFO - Homer_width_12 finished, found 0 motifs
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_12: 
2020-02-12 20:40:16,392 - gimme.prediction - DEBUG - stdout Homer_width_12: Homer_width_12 failed to run: Homer is not configured
2020-02-12 20:40:16,393 - gimme.prediction - INFO - Homer_width_10 finished, found 0 motifs
2020-02-12 20:40:16,393 - gimme.prediction - DEBUG - stdout Homer_width_10: 
2020-02-12 20:40:16,393 - gimme.prediction - DEBUG - stdout Homer_width_10: Homer_width_10 failed to run: Homer is not configured
2020-02-12 20:40:16,393 - gimme.prediction - INFO - Homer_width_16 finished, found 0 motifs
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_16: 
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_16: Homer_width_16 failed to run: Homer is not configured
2020-02-12 20:40:16,394 - gimme.prediction - INFO - Homer_width_18 finished, found 0 motifs
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_18: 
2020-02-12 20:40:16,394 - gimme.prediction - DEBUG - stdout Homer_width_18: Homer_width_18 failed to run: Homer is not configured
2020-02-12 20:40:16,395 - gimme.prediction - INFO - Homer_width_20 finished, found 0 motifs
2020-02-12 20:40:16,395 - gimme.prediction - DEBUG - stdout Homer_width_20: 
2020-02-12 20:40:16,395 - gimme.prediction - DEBUG - stdout Homer_width_20: Homer_width_20 failed to run: Homer is not configured
2020-02-12 20:40:16,395 - gimme.prediction - INFO - MEME_width_6 finished, found 0 motifs
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_6: 
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_6: MEME_width_6 failed to run: MEME is not configured
2020-02-12 20:40:16,396 - gimme.prediction - INFO - MEME_width_8 finished, found 0 motifs
2020-02-12 20:40:16,396 - gimme.prediction - DEBUG - stdout MEME_width_8: 
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_8: MEME_width_8 failed to run: MEME is not configured
2020-02-12 20:40:16,397 - gimme.prediction - INFO - MEME_width_10 finished, found 0 motifs
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_10: 
2020-02-12 20:40:16,397 - gimme.prediction - DEBUG - stdout MEME_width_10: MEME_width_10 failed to run: MEME is not configured
2020-02-12 20:40:16,398 - gimme.prediction - INFO - MEME_width_12 finished, found 0 motifs
2020-02-12 20:40:16,398 - gimme.prediction - DEBUG - stdout MEME_width_12: 
2020-02-12 20:40:16,398 - gimme.prediction - DEBUG - stdout MEME_width_12: MEME_width_12 failed to run: MEME is not configured
2020-02-12 20:40:16,398 - gimme.prediction - INFO - MEME_width_14 finished, found 0 motifs
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_14: 
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_14: MEME_width_14 failed to run: MEME is not configured
2020-02-12 20:40:16,399 - gimme.prediction - INFO - MEME_width_16 finished, found 0 motifs
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_16: 
2020-02-12 20:40:16,399 - gimme.prediction - DEBUG - stdout MEME_width_16: MEME_width_16 failed to run: MEME is not configured
2020-02-12 20:40:16,400 - gimme.prediction - INFO - MEME_width_18 finished, found 0 motifs
2020-02-12 20:40:16,400 - gimme.prediction - DEBUG - stdout MEME_width_18: 
2020-02-12 20:40:16,400 - gimme.prediction - DEBUG - stdout MEME_width_18: MEME_width_18 failed to run: MEME is not configured
2020-02-12 20:40:16,400 - gimme.prediction - INFO - MEME_width_20 finished, found 0 motifs
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - stdout MEME_width_20: 
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - stdout MEME_width_20: MEME_width_20 failed to run: MEME is not configured
2020-02-12 20:40:16,401 - gimme.prediction - DEBUG - waiting for statistics to finish
2020-02-12 20:40:18,404 - gimme.prediction - INFO - predicted 0 motifs
2020-02-12 20:40:18,406 - gimme.prediction - DEBUG - written to /scratch/generate/combined/motifs_0.99-1_out/test/intermediate/all_motifs.pfm
2020-02-12 20:40:18,406 - gimme.prediction - INFO - no motifs found
2020-02-12 20:40:18,406 - gimme.denovo - INFO - finished

gimmemotifs.log for GC background

No gimmemotifs.log file
simonvh commented 4 years ago

Ok, two points that I see.

1). There seems to be something wrong with the configuration of the motif prediction tools. Normally gimme would initialize this on the first run. You are now running in a conda environment, right? Can you try to delete ~/.config/gimmemotifs/gimmemotifs.cfg and then run gimme motifs again? Can you download this file and run it on that to see if it works?

2) I'm afraid your input sequence set is too large for de novo motif prediction. Both in terms of size, as well as in the number of the regions. Motif prediction works best if you have smaller regions, say 100-1000bp. Beyond that, performance usually quickly detoriates (and running time increases). Second, the number of regions is quite large. By default, gimme motifs selects only 1,000 regions for de novo motif prediction, but it calculates statistics and enrichment on all of them. This will take a very long time. If possible, I would try to get your input set down to 10,000 regions of at most 500 bp long.

mikecormier commented 4 years ago

Hi @simonvh,

After recreating the gimmemotifs.cfg file I am now getting a new error:

2020-02-18 02:42:09,693 - INFO - starting motif prediction (xl)
2020-02-18 02:42:09,700 - INFO - tools: MEME, BioProspector, Homer
2020-02-18 02:49:13,507 - INFO - all jobs submitted
2020-02-18 02:51:34,360 - INFO - Homer_width_6 finished, found 5 motifs
2020-02-18 02:52:25,617 - INFO - Homer_width_8 finished, found 5 motifs
2020-02-18 02:55:04,288 - INFO - Homer_width_10 finished, found 5 motifs
2020-02-18 03:32:14,956 - INFO - Homer_width_16 finished, found 5 motifs
2020-02-18 03:37:49,282 - INFO - Homer_width_12 finished, found 5 motifs
2020-02-18 03:38:21,142 - INFO - Homer_width_14 finished, found 5 motifs
2020-02-18 03:44:24,751 - INFO - MEME_width_6 finished, found 10 motifs
2020-02-18 03:44:57,824 - INFO - MEME_width_8 finished, found 10 motifs
2020-02-18 03:50:54,469 - INFO - MEME_width_10 finished, found 10 motifs
2020-02-18 03:51:15,927 - INFO - MEME_width_12 finished, found 10 motifs
2020-02-18 03:57:17,854 - INFO - MEME_width_14 finished, found 10 motifs
2020-02-18 03:57:43,490 - INFO - MEME_width_16 finished, found 10 motifs
2020-02-18 04:03:37,277 - INFO - MEME_width_18 finished, found 10 motifs
2020-02-18 04:04:00,668 - INFO - MEME_width_20 finished, found 10 motifs
2020-02-18 04:15:10,326 - INFO - Homer_width_18 finished, found 5 motifs
2020-02-18 04:25:16,733 - INFO - BioProspector_width_6 finished, found 5 motifs
2020-02-18 04:38:13,157 - INFO - BioProspector_width_8 finished, found 5 motifs
2020-02-18 04:53:53,012 - INFO - BioProspector_width_10 finished, found 5 motifs
2020-02-18 05:08:57,453 - INFO - BioProspector_width_12 finished, found 5 motifs
2020-02-18 05:21:18,184 - INFO - BioProspector_width_14 finished, found 5 motifs
2020-02-18 05:34:35,552 - INFO - BioProspector_width_16 finished, found 5 motifs
2020-02-18 05:49:17,098 - INFO - BioProspector_width_18 finished, found 5 motifs
2020-02-18 06:02:10,619 - INFO - BioProspector_width_20 finished, found 5 motifs
2020-02-18 07:00:52,947 - INFO - Homer_width_20 finished, found 5 motifs
Traceback (most recent call last):
  File "/scratch/miniconda3/envs/gimme/bin/gimme", line 11, in <module>
    cli(sys.argv[1:])
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/cli.py", line 625, in cli  
    args.func(args)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/commands/motifs.py", line 94, in motifs
    "size": args.size,
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/denovo.py", line 619, in gimme_motifs
    stats_bg=background,
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 372, in predict_motifs
    stats_bg=stats_bg,
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 320, in pp_predict_motifs
    result.wait_for_stats()
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/site-packages/gimmemotifs/prediction.py", line 178, in wait_for_stats
    job.get()
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/pool.py", line 670, in get
    raise self._value
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/pool.py", line 450, in _handle_tasks 
    put(task)
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/connection.py", line 206, in send
    self._send_bytes(_ForkingPickler.dumps(obj))
  File "/scratch/miniconda3/envs/gimme/lib/python3.6/multiprocessing/connection.py", line 393, in _send_bytes
    header = struct.pack("!i", n)
struct.error: 'i' format requires -2147483648 <= number <= 2147483647
simonvh commented 4 years ago

This means that you should decrease the amount of input sequences. It is a bug that should be solved with a newer version of Python. However, it might be best anyway to use a limited set of sequences (< ~100k) as input.

andrewbcaldwell commented 4 years ago

I am having a similar error when trying to run gimme motifs on my own bed or fasta files, so I did the following:

genomepy install hg38 UCSC --annotation gimme motifs --known -g hg38 TAp73alpha.fa ./test

yet I still get this following error:

2020-05-07 16:30:52,415 - INFO - No config found. 2020-05-07 16:30:52,416 - INFO - Creating new config. 2020-05-07 16:30:52,429 - INFO - Using included version of MDmodule. 2020-05-07 16:30:52,440 - INFO - Using system version of MEME. 2020-05-07 16:30:52,446 - INFO - Using system version of MEMEW. 2020-05-07 16:30:52,452 - INFO - Using system version of DREME. 2020-05-07 16:30:52,457 - INFO - Using system version of Weeder. 2020-05-07 16:30:52,463 - INFO - Using system version of GADEM. 2020-05-07 16:30:52,463 - INFO - Using included version of MotifSampler. 2020-05-07 16:30:52,468 - INFO - Using system version of Trawler. 2020-05-07 16:30:52,468 - INFO - Using included version of Improbizer. 2020-05-07 16:30:52,469 - INFO - Using included version of BioProspector. 2020-05-07 16:30:52,469 - INFO - Using included version of Posmo. 2020-05-07 16:30:52,470 - INFO - Using included version of ChIPMunk. 2020-05-07 16:30:52,470 - INFO - Using included version of AMD. 2020-05-07 16:30:52,470 - INFO - Using included version of HMS. 2020-05-07 16:30:52,477 - INFO - Using system version of Homer. 2020-05-07 16:30:52,486 - INFO - Using system version of XXmotif. 2020-05-07 16:30:52,494 - INFO - Using system version of ProSampler. 2020-05-07 16:30:52,494 - WARNING - Yamda not in config 2020-05-07 16:30:52,500 - INFO - Using system version of DiNAMO. 2020-05-07 16:30:52,513 - WARNING - RPMCMC not found. To include it you will have to install it. 2020-05-07 16:30:52,546 - INFO - Configuration file: /home/abcaldwe/.config/gimmemotifs/gimmemotifs.cfg 2020-05-07 16:30:53,004 - INFO - creating background (matched GC%) 2020-05-07 16:30:53,051 - INFO - Creating index for genomic GC frequencies. Traceback (most recent call last): File "/home/abcaldwe/anaconda3/envs/gimmemotifs/bin/gimme", line 11, in cli(sys.argv[1:]) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/cli.py", line 625, in cli args.func(args) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/commands/motifs.py", line 75, in motifs number=10000, File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/background.py", line 122, in create_background_file m = MatchedGcFasta(inputfile, genome, number=number, size=size) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/background.py", line 559, in init matched_gc_bedfile(tmpbed, matchfile, genome, number, size=size) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/background.py", line 523, in matched_gc_bedfile min_bin_size=min_bin_size, File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/background.py", line 398, in gc_bin_bedfile create_gc_bin_index(genome, fname, min_bin_size=min_bin_size) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/gimmemotifs/background.py", line 363, in create_gc_bin_index df.reset_index()[cols].to_feather(fname) File "/home/abcaldwe/.local/lib/python3.7/site-packages/pandas/util/_decorators.py", line 214, in wrapper return func(*args, *kwargs) File "/home/abcaldwe/.local/lib/python3.7/site-packages/pandas/core/frame.py", line 1994, in to_feather to_feather(self, path) File "/home/abcaldwe/.local/lib/python3.7/site-packages/pandas/io/feather_format.py", line 64, in to_feather feather.write_feather(df, path) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/pyarrow/feather.py", line 180, in write_feather writer.write(df) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/pyarrow/feather.py", line 91, in write table = Table.from_pandas(df, preserve_index=False) File "pyarrow/table.pxi", line 1139, in pyarrow.lib.Table.from_pandas File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/pyarrow/pandas_compat.py", line 474, in dataframe_to_arrays convert_types)) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/concurrent/futures/_base.py", line 598, in result_iterator yield fs.pop().result() File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/concurrent/futures/_base.py", line 428, in result return self.get_result() File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/concurrent/futures/_base.py", line 384, in get_result raise self._exception File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/concurrent/futures/thread.py", line 57, in run result = self.fn(self.args, **self.kwargs) File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/pyarrow/pandas_compat.py", line 463, in convert_column raise e File "/home/abcaldwe/anaconda3/envs/gimmemotifs/lib/python3.7/site-packages/pyarrow/pandas_compat.py", line 457, in convert_column return pa.array(col, type=ty, from_pandas=True, safe=safe) File "pyarrow/array.pxi", line 169, in pyarrow.lib.array File "pyarrow/array.pxi", line 74, in pyarrow.lib._ndarray_to_array File "pyarrow/array.pxi", line 62, in pyarrow.lib._ndarray_to_type File "pyarrow/error.pxi", line 91, in pyarrow.lib.check_status pyarrow.lib.ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column chrom with type string')

Is it possible that installation of the hg38 genome annotation fails using genomepy, so I downloaded the hg38.fa from UCSC and the chrom.sizes, but I get the same error.

siebrenf commented 4 years ago

@andrewbcaldwell, I encountered your error before. It was solved by updating pyarrow in the gimmemotifs environment.

try conda update pyarrow

For me, pyarrow 0.13.0 was installed originally. It was updated to 0.16.0 and moved past that error.

andrewbcaldwell commented 4 years ago

@siebrenf Thanks for the tip! I had tried conda update pyarrow earlier to no avail, but forcing the update to version 0.16.0 with condo update pyarrow=0.16.0 solved the issue.

simonvh commented 4 years ago

Thanks for reporting this @andrewbcaldwell and for the fix @siebrenf. I'll update the conda package to reflect this dependency!