hoffmangroup / genomedata

The Genomedata format for storing large-scale functional genomics data.
https://genomedata.hoffmanlab.org/
GNU General Public License v2.0
2 stars 1 forks source link

Empty archives when created from an assembly #22

Closed EricR86 closed 5 years ago

EricR86 commented 8 years ago

Original report (archived issue) by Mickaël Mendez (Bitbucket: Mickael Mendez).

The original report had attachments: concatenated_agp_file.agp.gz


Creating an archive from an assembly results in an almost empty archive.

I tried with the genomedata-load and the four steps commands but the archives are always the same size of 65M.

command used:

genomedata-load -s ./concatenated_agp_file.agp -t my_track=my_signal_file.bg --assembly my_archive

The genomedata exists but it seems that the assembly file could not be parsed properly and the open_chromosome field contains the name of the assembly file instead of the chromosome names:

>>> print g.open_chromosomes
{'concatenated_agp_file': <Chromosome 'concatenated_agp_file', file='genomedata/my_archive/concatenated_agp_file.genomedata'>}

Note: the assembly file (attached) was generated by concatenating the reference agp files from the ncbi ftp, after filtering out the comment lines starting with the character '#'

EricR86 commented 6 years ago

Original comment by Mickaël Mendez (Bitbucket: Mickael Mendez).


@ericr86 I think that this was solved in one of the recent PR, could you confirm that?

EricR86 commented 5 years ago

Original comment by Eric Roberts (Bitbucket: ericr86, GitHub: ericr86).


This should have been resolved in the 1.4.0 release