Closed skunklem closed 2 years ago
Hi! we're currently heavily editing and improving the analysis pipeline.
We have a proof of concept inside our V-pipe pipeline: signature.smk (for now it's still hard-coded, it's not yet plugged into our usually schema-based configuration).
Step are:
Note:
Further polishing of the proof of concept is going to happen quickly over the comming couple of weeks, so stay in touch. Priority will be to kill bugs in LolliPop, properly integrate the V-pipe proof-of-concept and make it configurable (no more hard-coded values) and release updated version of Cojac (dev into 0.3) and Lollipop (current main into 0.2).
Later, more widespread format will be integrated (by version 0.3 of LolliPop).
We also plan making a wrapper for Galaxy to show possible integration into other pipelines. (initially using 0.2 + smallgenomeutilities' aln2basecount, eventually standard formats using 0.3).
Again, sorry for the much messy aspect of the wastewater (compared to our usual work on V-pipe) there was a lot of rush to prepare the Recomb paper before the deadline, and to finish the proof-of-concept during last week's biohackathon.
Thanks for such a quick and detailed response. I understand the occasional need for speed over clean code. I'll see what I can do with all the information you shared and look forward to the finished product.
Update on the situation:
aln2basecnt
is shown (exactly as V-pipe does it internally)
I would like to run your notebook WwSmoothingKernel to analyze some wastewater samples, but I'm curious how you produced the tally file: tallymut_line_full.tsv. I can imagine using mutlist-full.txt as a kind of key for parsing .vcf files, but I wanted to make sure. Also, if you already have some method for creating the tally file, there's no point in me recreating it. Thanks for the help.