Open Cheryn-A opened 4 years ago
@Cheryn-A hello and sorry for the very late reply. I have notifications setup for this repository and to my surprise, I did not get one for this issue.
It looks like you are having issues with Genomedata creation. Notably, at some point in your steps, there seems to be a lack of data. "Empty windows" typically means there's no data to work with. The error can't tie one track in multiple groups
I believe might indicate that you're accidentally using the same trackname in each of your genomedata archives.
With genomedata-open-data
, the tracknames
option should be trackname
with no space between it and the hyphen and the positional arguments should come after the options (though I don't really know if it makes a difference):
genomedata-open-data --trackname s1_norm s1_norm.genomedata
.
The rest of your commands look correct. I would also ensure that you choose a different trackname across your archives so Segway can uniquely determine your datasets.
Most of the commands also come with a verbosity option that I would highly recommend using when debugging these issues. For example, it would help verify how much data is being loaded.
For general troubleshooting on Segway issues I would highly recommend e-mailing to the mailing list: segway-l@listserv.utoronto.ca to reach a larger audience potentially.
Hello, I am facing the same issue "ValueError: Set of training windows is empty". I did take into account what you mentioned about the genomedata file creation in your response, but that still does not help. Any help with this would help. Thanks.
@varsha090597 could please put in your segway train command?
Although it is hard to tell, the most likely situation is that there is actually no data in your genomedata archives. Perhaps try creating them again with the --verbose option to verify data is being loaded in.
Hello, I am having troubles running segway, here is my situation. I wish to use segway with 6 ChIP-seq tracks. I started from raw data in fastq format of each signal and its associated input (s1.fastq , s1_input.fastq, ... , s6.fastq , s6_input.fastq)
In here https://www.biorxiv.org/content/10.1101/080382v1.full you mention at some point "The preferred input for Segway is the “fold change over control” bigWig signal file, because it is already processed and normalized." so here is the full pipeline I followed :
1-Pre-processing each fastq file:
2-Generating bw files with deepTools-3.0.2:
(Output: s1_norm.bw ... s6_norm.bw)
3-Converting bigwig into bedGraph
(Output: s1_norm.bedGraph ... s6_norm.bedGraph)
4-Generating genomedata files with genomedata-1.4.4:
(Output: s1_norm.genomedata ... s6_norm.genomedata)
5-Running segway
This gave me the following error:
So I tried to follow the exemple in https://segway.readthedocs.io/en/latest/quick.html#acquiring-data and to run it with less parameters and on one genomedata file:
And I got this error:
When I try to run it with the test.genomedata given here https://segway.readthedocs.io/en/latest/quick.html#acquiring-data it works well.
So I am a bit confused by the error messages and I do not know how to solve this.
Could you please help me? Did I do something wrong while generating my genomedata? Do I correctly use the command line to run segway?
I thank you in advance for your answers.
PS: I am working on Ubuntu 18.04.5