openjournals / joss-reviews

Reviews for the Journal of Open Source Software
Creative Commons Zero v1.0 Universal
708 stars 38 forks source link

[PRE REVIEW]: target-methylseq-qc: a lightweight pipeline for collecting metrics from targeted sequence mapping files. #7238

Open editorialbot opened 1 week ago

editorialbot commented 1 week ago

Submitting author: !--author-handle-->@abhi18av<!--end-author-handle-- (Abhinav Sharma) Repository: https://github.com/wal-yan/target-methylseq-qc Branch with paper.md (empty if default branch): master Version: v2.0.0 Editor: Pending Reviewers: Pending Managing EiC: Kevin M. Moerman

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/220574eb92e0a5c64f58eb092dfd399a"><img src="https://joss.theoj.org/papers/220574eb92e0a5c64f58eb092dfd399a/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/220574eb92e0a5c64f58eb092dfd399a/status.svg)](https://joss.theoj.org/papers/220574eb92e0a5c64f58eb092dfd399a)

Author instructions

Thanks for submitting your paper to JOSS @abhi18av. Currently, there isn't a JOSS editor assigned to your paper.

@abhi18av if you have any suggestions for potential reviewers then please mention them here in this thread (without tagging them with an @). You can search the list of people that have already agreed to review and may be suitable for this submission.

Editor instructions

The JOSS submission bot @editorialbot is here to help you find and assign reviewers and start the main review. To find out what @editorialbot can do for you type:

@editorialbot commands
editorialbot commented 1 week ago

Hello human, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf
editorialbot commented 1 week ago
Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

βœ… OK DOIs

- 10.1101/2023.04.29.23289314 is OK
- 10.1038/nbt.3820 is OK
- 10.1038/s41592-018-0046-7 is OK
- 10.1093/bioinformatics/btx192 is OK
- 10.5281/zenodo.10463781 is OK
- 10.1093/gigascience/giab008 is OK
- 10.1101/gr.107524.110 is OK
- 10.1038/nbt.3820 is OK
- 10.1093/bioinformatics/btw354 is OK
- 10.1038/s41587-020-0439-x is OK
- 10.1093/bioinformatics/btq033 is OK
- 10.5281/zenodo.13147688 is OK
- 10.5281/zenodo.13601364 is OK

🟑 SKIP DOIs

- No DOI given, and none found for title: The nf-core framework for community-curated bioinf...
- No DOI given, and none found for title: CreateSequenceDictionary (Picard)
- No DOI given, and none found for title: Picard toolkit
- No DOI given, and none found for title: CollectHsMetrics (Picard)
- No DOI given, and none found for title: CollectMultipleMetrics (Picard)
- No DOI given, and none found for title: HTS format specifications
- No DOI given, and none found for title: Babraham Bioinformatics - FastQC A Quality Control...
- No DOI given, and none found for title: Twist Methylome
- No DOI given, and none found for title: Twist Methylome
- No DOI given, and none found for title: target-methylseq-qc website

❌ MISSING DOIs

- None

❌ INVALID DOIs

- None
editorialbot commented 1 week ago

Software report:

github.com/AlDanial/cloc v 1.90  T=0.05 s (1532.4 files/s, 244651.6 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
CSS                              5             39             20           2238
JavaScript                      11            235            226           2112
SVG                              3              3              3           2081
HTML                             4             53             10           1537
YAML                            27             74             30            905
JSON                             7              2              0            635
XML                              2              0              0            518
Markdown                         9            295              0            494
Groovy                           4             76            103            354
TeX                              1             31              0            339
Python                           2             61             90            183
CSV                              3              0              0             10
TOML                             1              1              2              7
Bourne Shell                     1              0              0              5
-------------------------------------------------------------------------------
SUM:                            80            870            484          11418
-------------------------------------------------------------------------------

Commit count by author:

   111  Abhinav Sharma
     1  Patricia
     1  t4ly4
editorialbot commented 1 week ago

Paper file info:

πŸ“„ Wordcount for paper.md is 1753

βœ… The paper includes a Statement of need section

editorialbot commented 1 week ago

License info:

βœ… License found: MIT License (Valid open source OSI approved license)

editorialbot commented 1 week ago

:point_right::page_facing_up: Download article proof :page_facing_up: View article proof on GitHub :page_facing_up: :point_left:

editorialbot commented 1 week ago

Five most similar historical JOSS papers:

Acanthophis: a comprehensive plant hologenomics pipeline Submitting author: @kdm9 Handling editor: @marcosvital (Active) Reviewers: @bricoletc, @gbouras13, @abhishektiwari Similarity score: 0.7228

MetaGenePipe: An Automated, Portable Pipeline for Contig-based Functional and Taxonomic Analysis Submitting author: @ParkvilleData Handling editor: @jmschrei (Active) Reviewers: @Ebedthan, @rjorton Similarity score: 0.7062

nf-gwas-pipeline: A Nextflow Genome-Wide Association Study Pipeline Submitting author: @ZeyuanSong Handling editor: @lpantano (Active) Reviewers: @preetida, @rspirgel Similarity score: 0.6955

CheckQC: Quick quality control of Illumina sequencing runs Submitting author: @johandahlberg Handling editor: @pjotrp (Retired) Reviewers: @brainstorm Similarity score: 0.6866

Koverage: Read-coverage analysis for massive (meta)genomics datasets Submitting author: @beardymcjohnface Handling editor: @csoneson (Active) Reviewers: @lparsons, @telatin Similarity score: 0.6764

⚠️ Note to editors: If these papers look like they might be a good match, click through to the review issue for that paper and invite one or more of the authors before considering asking the reviewers of these papers to review again for JOSS.

abhi18av commented 5 days ago

CC @agudeloromero @t4ly4

Kevin-Mattheus-Moerman commented 3 days ago

@abhi18av Dear author, thanks for this submission. I am the AEiC on this track and here to help process the initial steps. Before we proceed, please can you have a look at the following points:

Kevin-Mattheus-Moerman commented 3 days ago

@editorialbot invite @csoneson as editor

editorialbot commented 3 days ago

Invitation to edit this submission sent!

csoneson commented 15 hours ago

In principle I'm happy to edit this, but would like to first wait for the author's responses to @Kevin-Mattheus-Moerman's queries above.

abhi18av commented 14 hours ago

@abhi18av Dear author, thanks for this submission. I am the AEiC on this track and here to help process the initial steps. Before we proceed, please can you have a look at the following points:

Dear @Kevin-Mattheus-Moerman and @csoneson , thank you for your time to evaluate this manuscript!

I have addressed the comments inline.

  • βœ… Please study the above reference check ☝️ and see if you can address any of the reported potential DOI issues. You can add/amend DOI entries in your .bib file, and call @editorialbot check references here to check them again.

Sure, I have updated the DOI for a few more citations, however as some entries don't have an associated publication (such as picard) or there's no consensus on how to cite (such as HTS specification https://github.com/samtools/hts-specs/issues/179 ), I have simply used the @online bib resource annotation for those.

If there's a better way or a JOSS convention to address this, please let us know and we will be happy to accommodate.

  • βœ… I may have missed it, but can you confirm this project features (automated) testing? If so it may be good to link to this in the README.

Ah yes, there are a bunch of Github actions in the repo which are triggered upon relevant events.

In addition, I have added an explanation in the REAMDE for the bundled test dataset which we provide to users for quick testing https://github.com/wal-yan/target-methylseq-qc?tab=readme-ov-file#testing .

  • βœ… Could you help me understand the above code report, and potentially add to it in terms of nextflow contributions? What aspects of the report would you say is your core achievement/new contribution? In addition, can you help estimate the lines of code number for the nextflow work? We ask this as some tools exist e.g. to automatically generate JavaScript GUI related code for instance. So any help to judge the "weight/size" of this submission would be appreciated.

In terms of the cloc report from https://github.com/openjournals/joss-reviews/issues/7238#issuecomment-2352844762, I must say that numbers hide the overall big picture, but thank you for raising this.

The principle changes regarding the implementation logic are of course in the Nextflow/Groovy layer, however as Nextflow is just the DSL for the orchestration of tasks, we have worked on other layers/languages as well.

The samplesheet check (written in Python) is specific to this pipeline and checks for the overall validity of the samplesheet as a pre-flight check, in addition to the test samplesheet files in CSV format (assets/test_samplesheet_bed_filter.csv and assets/test_samplesheet_picard_profiler.csv.

Furthermore, once the analysis is done, the generated results are merged and pushed to MultiQC which relies on a customized YAML file (assets/multiqc_config.yml) in order to present the principal summary report.

Finally in terms of the UI for Nextflow Schema renderers, the JSON format has been customized to reflect the principal parameters of the pipeline corresponding to different modes.

I must also highlight that the creation of test_* profiles is done in *`conf/config** scripts which are alsoGroovy/Nextflowscripts but __do not__ get picked up bycloc` as any language in the overall counts.

Therefore, kindly take this into consideration πŸ™

image
abhi18av commented 14 hours ago

@editorialbot check references

abhi18av commented 14 hours ago

@editorialbot generate pdf

editorialbot commented 14 hours ago
Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

βœ… OK DOIs

- 10.1101/2023.04.29.23289314 is OK
- 10.1038/nbt.3820 is OK
- 10.1038/s41587-020-0439-x is OK
- 10.1038/s41592-018-0046-7 is OK
- 10.1093/bioinformatics/btx192 is OK
- 10.5281/zenodo.10463781 is OK
- 10.1093/gigascience/giab008 is OK
- 10.1101/gr.107524.110 is OK
- 10.1038/nbt.3820 is OK
- 10.1093/bioinformatics/btw354 is OK
- 10.1038/s41587-020-0439-x is OK
- 10.1093/bioinformatics/btq033 is OK
- 10.5281/zenodo.8251379 is OK
- 10.5281/zenodo.13597863 is OK

🟑 SKIP DOIs

- No DOI given, and none found for title: CreateSequenceDictionary (Picard)
- No DOI given, and none found for title: Picard toolkit
- No DOI given, and none found for title: CollectHsMetrics (Picard)
- No DOI given, and none found for title: CollectMultipleMetrics (Picard)
- No DOI given, and none found for title: HTS format specifications
- No DOI given, and none found for title: Babraham Bioinformatics - FastQC A Quality Control...
- No DOI given, and none found for title: Twist Methylome
- No DOI given, and none found for title: Twist Methylome
- No DOI given, and none found for title: target-methylseq-qc website

❌ MISSING DOIs

- None

❌ INVALID DOIs

- None
editorialbot commented 14 hours ago

:point_right::page_facing_up: Download article proof :page_facing_up: View article proof on GitHub :page_facing_up: :point_left:

editorialbot commented 14 hours ago

Five most similar historical JOSS papers:

Acanthophis: a comprehensive plant hologenomics pipeline Submitting author: @kdm9 Handling editor: @marcosvital (Active) Reviewers: @bricoletc, @gbouras13, @abhishektiwari Similarity score: 0.7247

MetaGenePipe: An Automated, Portable Pipeline for Contig-based Functional and Taxonomic Analysis Submitting author: @ParkvilleData Handling editor: @jmschrei (Active) Reviewers: @Ebedthan, @rjorton Similarity score: 0.7067

nf-gwas-pipeline: A Nextflow Genome-Wide Association Study Pipeline Submitting author: @ZeyuanSong Handling editor: @lpantano (Active) Reviewers: @preetida, @rspirgel Similarity score: 0.6974

CheckQC: Quick quality control of Illumina sequencing runs Submitting author: @johandahlberg Handling editor: @pjotrp (Retired) Reviewers: @brainstorm Similarity score: 0.6879

RNAsik: A Pipeline for complete and reproducible RNA-seq analysis that runs anywhere with speed and ease Submitting author: @serine Handling editor: @pjotrp (Retired) Reviewers: @andrewyatz Similarity score: 0.6763

⚠️ Note to editors: If these papers look like they might be a good match, click through to the review issue for that paper and invite one or more of the authors before considering asking the reviewers of these papers to review again for JOSS.

Kevin-Mattheus-Moerman commented 13 hours ago

@editorialbot query scope

editorialbot commented 13 hours ago

Submission flagged for editorial review.

Kevin-Mattheus-Moerman commented 13 hours ago

@abhi18av thanks for providing those additional details. I have just flagged this submission for a scope review by our editorial board. This is because I need some help to determine if this work is in scope, and if the pipeline/workflow you present meets our substantial scholarly effort criterion.

The scope review should take about 2 weeks to complete.