Closed whedon closed 4 years ago
Hello human, I'm @whedon, a robot that can help you with some common editorial tasks. @jlincbio, @VivekTodur it looks like you're currently assigned to review this paper :tada:.
:star: Important :star:
If you haven't already, you should seriously consider unsubscribing from GitHub notifications for this (https://github.com/openjournals/joss-reviews) repository. As a reviewer, you're probably currently watching this repository which means for GitHub's default behaviour you will receive notifications (emails) for all reviews 😿
To fix this do the following two things:
For a list of things I can do to help you, just type:
@whedon commands
For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:
@whedon generate pdf
Attempting PDF compilation. Reticulating splines etc...
@jlincbio, @VivekTodur - this is where the review happens! Instructions are available in the post above - don't hesitate to ping me if you have questions!
@RafalUrniaz - please take a look at the checklists above and make sure that all required items are available in your submission. If not, you can already add them now, to facilitate the review process.
@whedon check references
Attempting to check references...
OK DOIs
- 10.1016/0022-2836(70)90057-4 is OK
- 10.1016/0022-2836(81)90087-5 is OK
- 10.1186/1472-6785-11-11 is OK
MISSING DOIs
- None
INVALID DOIs
- None
I have some comments about the software and documentation - can I just paste them here?
@csoneson, @RafalUrniaz: here are some of my comments so far:
devtools
package in R is not typically installed by default;R CMD INSTALL
devtools
.tcR
and rDNAse
require way too many other dependencies that are unrelated to the core function of this package. Looking at the R code it seems like only two functions rDNAse::twoSeqSim()
and tCR::generate.kmers()
are invoked, and neither of which require calls to features not included in core R and Biostrings
; perhaps incorporate these functions into the package and cite the two packages instead?
Sources: rDNAse::twoSeqSim
and tcR::generate.kmers
tcR
dependency igraph
about missing gfortran; this is more or less carelessness on my end but also a side effect of requiring unnecessary dependencies.While the documentation describes the purpose of the tool as to "calculate similarity score matrix for DNA k-mers," there is no description on what problems this tool is designed to solve or the target audience.
There is no documentation or tutorial on how to export results from the R package to use with the companion Python program (heatmap4kmers), even though Figure 1 is actually generated from the Python program and the R package includes an export function; from my perspective it seems like a core feature that is left undocumented.
The documentation includes a clear walkthrough, but it would be nice to have a more real-world scenario use case than GATTACA.
This is missing in the documentation but can be fixed relatively quickly.
since the introduction discusses a layman’s description of DNA, so technically it satisfies the requirement that it describes the tool to a "diverse, non-specialist audience"; however, it needs a little more discussion on the purpose of this tool in the introduction section in my opinion.
The paper describes the purpose behind creating such a tool to generate similarity score matrices but does not describe the sort of research problem this tool can help address; while similarity scores are calculated to compare sequences, from time to time this can be done without using this tool, so I suggest that the paper should try to address the purpose of this tool better. Since this is technically an academic paper, it should adhere to JOSS submission requirements that require that "the software should have an obvious research application" and "be a significant contribution to the available open source software."
This is entirely missing in the paper. A discussion on why someone would bother to calculate these scores to compare sequence similarities would be a starting point to illustrate the purpose of this tool.
It is a bit unclear as it stands what the purpose and use cases of this tool are. The manuscript describes what some the basic keywords mean, but fails to address the scientific idea. For instance, the author mentions a list of PAM and BLOSUM matrices frequently used in bioinformatics, and while the different use cases for these matrices are too fine and beyond the scope of this manuscript, merely listing them and not discussing what they are clutters the writing.
The paper cities Wikipedia and PLoSWiki entries; this I find a bit problematic for archival purposes. Additionally, the NCBI citations are not archived either.
heatmap()
function or ggplots
(one of the dependencies installed by tcR
), seeing how the paper includes this figure and in a way counts the ability to generate data for this tool as part of the feature set. Without this, the package seems a bit bare bones, since it can be construed as a wrapper for Biostrings::pairwiseAlignment
; this technically goes against the requirement that minor utility packages being unacceptable for submission.Hi all, just checking in on how the reviews are progressing here - @jlincbio, thanks for your comments above! @VivekTodur - do you think you will have the chance to provide your review in the coming week? Thanks!
@csoneson thanks for checking in. Please let me know when I need to follow up. I have a few things on the agenda for the next two weeks but quick checks are not a problem.
Yes, I am almost done with the review. I can revert back by this Sunday...
Thanks
On Thu, 3 Oct 2019, 12:27 am Charlotte Soneson, notifications@github.com wrote:
Hi all, just checking in on how the reviews are progressing here - @jlincbio https://github.com/jlincbio, thanks for your comments above! @VivekTodur https://github.com/VivekTodur - do you think you will have the chance to provide your review in the coming week? Thanks!
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/openjournals/joss-reviews/issues/1744?email_source=notifications&email_token=AC4CFXQCYXXGTJFACGSMGVTQMTVHXA5CNFSM4IXZM3LKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEAFZ4AY#issuecomment-537632259, or mute the thread https://github.com/notifications/unsubscribe-auth/AC4CFXSIQE3WOE45VAEY23TQMTVHXANCNFSM4IXZM3LA .
Yes, I am almost done with the review. I can revert back by this Sunday... Thanks
@VivekTodur - any updates on your review?
@csoneson Thanks for the opportunity to review this manuscript and my apologies for the delay. Below are my comments,
Functionality Installation Installation is quick and simple as described in the documentation on Ubuntu 19.04. Shouldn't be a problem in other versions of Linux computers that are prepared for biological sequence analysis.
Documentation Statement of need Manual for the tool "kmeRs: K-Mers Similarity Score Matrix" is not clearly describing the general purpose and requirement of the tool, A quick note is required.
Functionality documentation Various functions of the tool are neatly illustrated with relevant examples, but it lacks documentation on how to export the results and it's output format. Also, manuscript containings Kmer Heatmap image generated from heatmap4kmers, A quick note on this is very useful.
Example usage The documentation includes detailed instructions on the How-To-Use. As mentioned earlier output export method is missing
Community guidelines General guidelines are missing, should be added.
Software paper Summary The tool is developed to address problems related to biological sequence similarity searches. In this, very focused context author has neatly explained the purpose.
Statement of need Paper titled "K-Mers Similarity Score Matrix" describes the purpose and requirement of the tool in layman terms. The author needs to describe the research object, a real-world requirement and how kmeRs is outperforming the other tools.
State of the field An author should include this section as it is missing. Also, it would be nice to see a quick note on how this tool can be integrated into existing other pipelines.
Quality of writing There are plenty of tools/algorithms already available to solve the sequence similarity search problem. The author needs to explain the exact purpose of kmeRs, and how it is different from the existing one in terms of functionalities. A computational benchmark [CPU vs Memory] would add more value to the manuscript. Empirical formula to calculate KmeR Score is also missing in the manuscript.
References Considering this is software paper, including citations from the scientific journal, are sufficient. Those Wikipedia citations can be omitted.
@VivekTodur - thanks for your review! @RafalUrniaz - please go through the comments from the reviewers above, and report back here when you are ready for them to take another look. Of course, don't hesitate to ping me or the reviewers if you have questions.
Hi @RafalUrniaz - I just wanted to make sure that you have seen the comments above from the reviewers, and check whether you would have an estimate of when you might be able to have a new version for them to consider. Please let us know if you have questions.
Ping @RafalUrniaz. Could you please let us know whether you are working on a revision of your submission? If you expect that it will take a long time to get back to this, also please let us know, and we can pause this issue. Thanks.
@RafalUrniaz - I'm going to pause this for now. Please let us know when you have an updated version and are ready for the review to resume.
👋 @csoneson - since it looks like you haven't heard back from the author, you may need to ping them via email at this point
I have been in contact with @RafalUrniaz via email, and he would like to withdraw this submission since he will not have time to get back to it in the near future.
@openjournals/joss-eics - I will close this issue and add the 'withdrawn' label, is there anything else that should be done from my side?
@jlincbio, @VivekTodur - thanks a lot for your reviews, really appreciate your time and input.
We have to withdraw the submission on the AEIC interface, which I just did now.
@csoneson Thanks for the notice. If there are other manuscripts needing review please do not hesitate to let me know.
Submitting author: @RafalUrniaz (Rafal Urniaz) Repository: https://github.com/RafalUrniaz/kmeRs Version: 1.1.0 Editor: @csoneson Reviewer: @jlincbio, @VivekTodur Archive: Pending
Status
Status badge code:
Reviewers and authors:
Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) by leaving comments in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)
Reviewer instructions & questions
@jlincbio & @VivekTodur, please carry out your review in this issue by updating the checklist below. If you cannot edit the checklist please:
The reviewer guidelines are available here: https://joss.readthedocs.io/en/latest/reviewer_guidelines.html. Any questions/concerns please let @csoneson know.
✨ Please try and complete your review in the next two weeks ✨
Review checklist for @jlincbio
Conflict of interest
Code of Conduct
General checks
Functionality
Documentation
Software paper
Review checklist for @VivekTodur
Conflict of interest
Code of Conduct
General checks
Functionality
Documentation
Software paper