Closed zhangyixing3 closed 3 months ago
Your steps look okay. If I try to do something similar on the yeast test data included in cactus
cactus-pangenome ./js ./examples/yeastPangenome.txt --outDir yeast-filter-test --outName yeast --reference S288C --gfa clip filter
cd yeast-filter-test
zcat yeast.gfa.gz | vg clip -d 2 - -P S288C -m 1000 | vg clip -d 1 -P S288C - | vg clip -sS - -P S288C > yeast.myd2.gfa
zcat yeast.d2.gfa.gz > yeast.d2.gfa
Then the two files have equivalent stats
vg stats -lz yeast.d2.gfa 2> /dev/null
nodes 452349
edges 551752
length 12417384
vg stats -lz yeast.myd2.gfa 2> /dev/null
nodes 452349
edges 551752
length 12417384
along with path names and lengths
vg paths -Ex yeast.d2.gfa 2> /dev/null | sort > yeast.d2.paths
vg paths -Ex yeast.myd2.gfa 2> /dev/null | sort > yeast.myd2.paths
diff -s yeast.d2.paths yeast.myd2.paths
Files yeast.d2.paths and yeast.myd2.paths are identical
Thanks for your quickly reply,I'll check my steps.
You were right, 97samples.d2.gfa and 97samples.d2.vgclip.gfa do have equivalent stats. I just forgot to sort the paths. And yesterday, when I was using odgi, I completely forgot to convert the GFA 1.1 to GFA 1.0. I feel like such an idiot. Thank you !
Dear Sir, I compared the d2 index obtained by default from minigraph cactus with the d2 index I built using vg, but my graph pangenome results seem incorrect.
The above result is inconsistent with the default output of minigraph cactus
The differences between 97samples.d2.gfa and 97samples.d2.vgclip.gfa derived from the same clip pangenome are significant. Am I missing some steps in creating the filtered pangenome? I found that the filtered pangenome index I created cannot be operated on by odgi (neither odgi paths nor odgi stats work), while the default d2 pangenome works fine.