DEploid-dev / dEploidPaper

0 stars 0 forks source link

reviewer 3 comments #13

Closed shajoezhu closed 7 years ago

shajoezhu commented 7 years ago

Reviewer: 3

Comments to the Author Review of Zhi et al "Deconvolution of multiple infections in Plasmodium falciparum from high throughput sequencing data"

This paper describes how to infer the mixture decomposition of multiple strains of haploid organisms when multiple, related strains may be present in the same sample. This is an important problem in bacterial genetics, as argued by the authors, and they present a workable solution to this. The solution used, to use a copying model and perform markov-chain monte carlo analysis to extract out the appropriate details for the copying model, is an interesting novel application of these methods. To the best of my understanding it is correctly implemented and performs a useful job.

Minor comments:

jalmagro commented 7 years ago

"So I'm generally positive about this paper. I don't have major concerns, but as it stands it is not very easy to read. It is laid out in the classic mathematical style, which is to say to get to the results the reader has to slog through a lot of complex descriptions of mcmc updates, which have not been given any context or intuition. The writing is not bad but the ms would benefit hugely from a) a reorganisation to hide the gore from an interested biological-minded reader, and b) some effort to explain the details in intuitive terms. Some specific suggestions are listed below."

I think he/she has a point here. A broad discussion of the algorithm, step by step, and moving the math into the supp. material would make the paper more appealing (and easy to read).

jalmagro commented 7 years ago

"More technically, I found the technical details to be slightly unsatisfactorally explored. Specific concerns were the arbitrary value of G=20 (page 4) which scales the recombination rate. This is pretty unconvincing. I agree that the model usually allows for some misspecification of the recombination rate but something much better could be done. Either do the right thing (inference of G by EM or analogously) or show that it is insensitive."

A fair point although my experience with Pf is that the painting model tends to be very robust unless extreme values of recombination are used (tried with a set of ranges, for instance, for the inbreeding analysis). We can rerun the model with different scaling factors and show this or go for the EM run, but I would avoid implementing anything new at this point.

jalmagro commented 7 years ago

"I also disliked the anecdotalaity of Figure 2 - I was not clear what the general takehome message was meant to be, and the plot with its many black bars is quite confusing."

We need a different representation for haplotypes, maybe just rendering differences.