Closed whedon closed 7 years ago
Hello human, I'm @whedon. I'm here to help you with some common editorial tasks for JOSS. @genomematt it looks like you're currently assigned as the reviewer for this paper :tada:.
:star: Important :star:
If you haven't already, you should seriously consider unsubscribing from GitHub notifications for this (https://github.com/openjournals/joss-reviews) repository. As as reviewer, you're probably currently watching this repository which means for GitHub's default behaviour you will receive notifications (emails) for all JOSS reviews 😿
To fix this do the following two things:
For a list of things I can do to help you, just type:
@whedon commands
@genomematt Where are we at with this review? Please let us know.
I am still working my way through this, but some initial comments @camillescott.
I think that you need a more from first principles statement of need that outlines the sequence matching problem that CRBH is addressing (remembering that this is a generalist scientific computing journal) so that readers don't need to chase a reference to get a basic idea of what the software does.
You need contribution, bug report and support guidelines - check out some other JOSS articles for what this should look like.
A real world (ish) worked example, ideally as a jupyter notebook or doctest file would greatly enhance, and would help with explaining the statement of need.
Very pleased to see coverage as part of your testing. Your exclusions all look very sensible on first pass. You may wish to add #pragma no cover
directives marking the code you don't test. This makes it easier to track in your code and makes full coverage 100%. (suggestion not a requirement)
As you make a claim of improved performance (speed) it would be good to quantify that improvement.
I had an install issue but I am pretty sure it was related to a prior install not yours. I will redo on a clean VM to confirm.
Thanks @genomematt. I'll add a section to the paper describing the approach in a bit more detail. I can easily add a figure with performance comps as well.
Update: I have added a performance comparison figure, a section describing the method in a bit more detail, and contribution guidelines.
Ok @camillescott the added bits are good. My install issue was a conda problem and all was fine on a clean vm.
Given that there are multiple dependencies I think there should be a test step at the end of your install procedure. This would not need to be comprehensive testing of code, just a 'if you have installed it correctly this input will give this output.'
I am giving you a scraping pass on the real world application documentation, but an additional notebook that does not hide the actual running in a script like the performance notebook, showing real transcripts against a real db and then interpreting the csv to answer the biological question would enhance. As it stands the documentation is good for anyone who has done this sort of procedure before, and not as easy for someone who has a set of transcripts and is asking 'what is the best way to match these to the most closely related model organism'. e.g. if I have 10 transcripts will this program work, will the results be valid? I really care about gene family X, how do I interpret the output to find my family X transcripts?
Neither of those would stop me recommending accepting, but the complete lack of community guidelines does.
Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support
What is expected for guidelines? I did add a CONTRIBUTING.md: https://github.com/camillescott/shmlast/blob/master/CONTRIBUTING.md
I followed the example of several other accepted projects on that -- should I add more?
Sorry @camillescott I missed the CONTRIBUTING.md. I suspect a lot of other people especially those with bugs or support questions would as well. At least link it from the readme, and personally I would be inclined to put such a short section just in the readme.
Ok @biorelated I consider this ready to accept.
Copy that @genomematt -- added it to the readme :)
Many thanks @genomematt.
@arfon can we prepare this work for publication please?
@camillescott - At this point could you make an archive of the reviewed software in Zenodo/figshare/other service and update this thread with the DOI of the archive? I can then move forward with accepting the submission.
@arfon: I've archived on Zenodo; here is the DOI: http://doi.org/10.5281/zenodo.260437
note that the final published version after the requested changes is v1.1, not v1.0.2 as in Whedon's first post in this issue
@whedon set 10.5281/zenodo.260437 as archive
OK. 10.5281/zenodo.260437 is the archive.
Many thanks for your review @genomematt and thanks for editing this one @biorelated ✨
@camillescott - your paper is now accepted into JOSS and your DOI is http://dx.doi.org/10.21105/joss.00142 ⚡️ 🚀 💥
Submitting author: @camillescott (Camille Scott) Repository: https://github.com/camillescott/shmlast Version: v1.1 Editor: @biorelated Reviewer: @genomematt Archive: 10.5281/zenodo.260437
Status
Status badge code:
Reviewers and authors:
Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)
Reviewer questions
Conflict of interest
General checks
Functionality
Documentation
Software paper
paper.md
file include a list of authors with their affiliations?