Closed 6br closed 4 years ago
I ran into the same issue yesterday. I suspect that you did not use the current master of odgi
to build the graph?
After that, all went fine. @josiahseaman replaced the median nucleotide position with the actual starting and ending positions. And I suspect you used the version which still outputs the median nucleotide position. Please report back.
Thank you, @subwaystation . I used https://github.com/graph-genome/odgi.git
in graph-genome/pipeline. Now I switch to https://github.com/vgteam/odgi.git
.
Another question: What does it mean bin-size in component_segmentation? in other words, what is better number on bin-size? I used to set it as the same as -bin-width (in Schematize).
So the idea is that we present Schematize the bin width in the file output of cs
(component_segmentation). Therefore, the user should enter this as an argument.
Yes, it is the same as --bin-width in Schematize and odgi. But in the future, we will read this in.
Maybe we can have a short chat today, so I can update you on our plans ;)
I confused that because the option name is different between odgi (bin-width) and cs
(bin-size). But now I completely understand. Thank you!
I tried it on vgteam/odgi master HEAD, but the same error is shown.
Currently taking a look.
@6br Component_segmentation is made to work with the latest odgi with first_nucleotide and last_nucleotide as you requested. Here's our fork: https://github.com/graph-genome/odgi
It was merged to vgteam master: https://github.com/vgteam/odgi/pull/79
So if your code is older than Feb 20 you might not have gotten it. https://github.com/vgteam/odgi/commits/master
@6br was using our latest pulls (odgi
, component_segmentation
) on pg2. So it should have worked. I can reproduce the error.
Please take a look at /home/ubuntu/sh/test_data3_ty
and the scripts in there to reproduce.
If I understand it right, this error occurs during the calculation of the links.
Interestingly, this problem was not present when munching the phage data.
Actually, this is my fault. There's an edge case that only comes up in TY's data.
these_comp = self.components[cut:end_cut]
if these_comp: # when we're out of data because the last component is 1 wide
partitions.append(
PangenomeSchematic(JSON_VERSION,
self.bin_size,
these_comp[0].first_bin,
these_comp[-1].last_bin,
these_comp, self.path_names, self.total_nr_files))
bin2file_mapping.append({"first_bin": these_comp[0].first_bin, "file": self.filename(i)})
I run component_segmentation of the latest master HEAD on data3.gfa (located on /home/ubuntu/ty/test-pipeline/)