pyOpenSci / software-submission

Submit your package for review by pyOpenSci here! If you have questions please post them here: https://pyopensci.discourse.group/
94 stars 35 forks source link

QuadratiK Submission #180

Open rmj3197 opened 6 months ago

rmj3197 commented 6 months ago

Submitting Author: Raktim Mukhopadhyay (@rmj3197) All current maintainers: @giovsaraceno Package Name: QuadratiK One-Line Description of Package: QuadratiK includes test for multivariate normality, test for uniformity on the sphere, non-parametric two- and k-sample tests, random generation of points from the Poisson kernel-based density and clustering algorithm for spherical data. Repository Link: https://github.com/rmj3197/QuadratiK Version submitted: 1.1.0 EIC: @Batalex Editor: @isabelizimm Reviewer 1: @acolum Reviewer 2: @ab93 Archive: TBD JOSS DOI: TBD Version accepted: TBD Date accepted (month/day/year): TBD


Code of Conduct & Commitment to Maintain Package

Description

We introduce the QuadratiK package that incorporates innovative data analysis methodologies. The presented software, implemented in both R and Python, offers a comprehensive set of novel goodness-of-fit tests and clustering techniques using kernel-based quadratic distances. Our software implements one, two and k-sample tests for goodness of fit, providing an efficient and mathematically sound way to assess the fit of probability distributions. Expanded capabilities of our software include supporting tests for uniformity on the $d$-dimensional Sphere based on Poisson kernel densities, and algorithms for generating random samples from Poisson kernel densities. Particularly noteworthy is the incorporation of a unique clustering algorithm specifically tailored for spherical data that leverages a mixture of Poisson kernel-based densities on the sphere. Alongside this, our software includes additional graphical functions, aiding the users in validating, as well as visualizing and representing clustering results. This enhances interpretability and usability of the analysis. In summary, our R and Python packages serve as a powerful suite of tools, offering researchers and practitioners the means to delve deeper into their data, draw robust inference, and conduct potentially impactful analyses and inference across a wide array of disciplines.

Scope

Domain Specific

Community Partnerships

If your package is associated with an existing community please check below:

[^1]: Please fill out a pre-submission inquiry before submitting a data visualization package.

Technical checks

For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:

Publication Options

JOSS Checks - [ ] The package has an **obvious research application** according to JOSS's definition in their [submission requirements][JossSubmissionRequirements]. Be aware that completing the pyOpenSci review process **does not** guarantee acceptance to JOSS. Be sure to read their submission requirements (linked above) if you are interested in submitting to JOSS. - [ ] The package is not a "minor utility" as defined by JOSS's [submission requirements][JossSubmissionRequirements]: "Minor ‘utility’ packages, including ‘thin’ API clients, are not acceptable." pyOpenSci welcomes these packages under "Data Retrieval", but JOSS has slightly different criteria. - [ ] The package contains a `paper.md` matching [JOSS's requirements][JossPaperRequirements] with a high-level description in the package root or in `inst/`. - [ ] The package is deposited in a long-term repository with the DOI: *Note: JOSS accepts our review as theirs. You will NOT need to go through another full review. JOSS will only review your paper.md file. Be sure to link to this pyOpenSci issue when a JOSS issue is opened for your package. Also be sure to tell the JOSS editor that this is a pyOpenSci reviewed package once you reach this step.*

Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?

This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.

Confirm each of the following by checking the box.

Please fill out our survey

P.S. Have feedback/comments about our review process? Leave a comment here

Editor and Review Templates

The editor template can be found here.

The review template can be found here.

Batalex commented 6 months ago

Editor in Chief checks

Hi there! Thank you for submitting your package for pyOpenSci review. Below are the basic checks that your package needs to pass to begin our review. If some of these are missing, we will ask you to work on them before the review process begins.

Please check our Python packaging guide for more information on the elements below.



Editor comments

Nice submission, I'll get started on finding the perfect editor for Quadratik!

Batalex commented 5 months ago

Hey @rmj3197, I am super excited to introduce @isabelizimm as the editor for this submission! Isabel will be your privileged point of contact from now on, though you are welcome to ask me anything during the process. Please note that she will not get started until the week after June 7th.

Happy reviewing!

isabelizimm commented 5 months ago

Hello there! Happy to be ushering this package through 👋 I'm going to go ahead and start looking for reviewers; I'll plan to touch base when I have reviewers lined up OR in 2 weeks (say, June 24), whichever comes first.

rmj3197 commented 5 months ago

Hello @isabelizimm,

Thank you so much for the update and for taking the time to review our package. I look forward to hearing from you soon.

isabelizimm commented 5 months ago

Checking in! I have one reviewer ready (yay!) and have reached out to some possibilities for a second. I'll keep you updated when I know more 👍

rmj3197 commented 5 months ago

Hello @isabelizimm , thank you very much for the update!

isabelizimm commented 4 months ago

Welcome welcome to our fearless reviewers: @acolum and @ab93 👋 Thank you SO MUCH for volunteering to review for pyOpenSci! You are two people with awesome math-y, stats-y, ML-y, Python-y backgrounds, which is perfect for this package, and I am looking forward to learning from you through this review process 🌻

Please fill out our pre-review survey

Before beginning your review, please fill out our pre-review survey. This helps us improve all aspects of our review and better understand our community. No personal data will be shared from this survey - it will only be used in an aggregated format by our Executive Director to improve our processes and programs.

The following resources will help you complete your review:

  1. Here is the reviewers guide. This guide contains all of the steps and information needed to complete your review.
  2. Here is the review template that you will need to fill out and submit here as a comment, once your review is complete. You can look at other issues in this repository for examples of what this might look like.

Please get in touch with any questions or concerns! ~Your review is due 3 weeks from now, which is July 19.~ New review date: Aug 2. Please let me know if this date does not work for you.

tldr;

Reviewers: @acolum and @ab93 Due date [NOTE: deadline extended]: August 2

acolum commented 4 months ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme file requirements The package meets the readme requirements below:

The README should include, from top to bottom:

NOTE: If the README has many more badges, you might want to consider using a table for badges: see this example. Such a table should be more wide than high. (Note that the a badge for pyOpenSci peer-review will be provided upon acceptance.)

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider whether:

Functionality

For packages also submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 2.5


Review Comments

Overall, this submission was well done and followed most Python package development and documentation best practices. I found no major issues with the package's documentation, usability, and functionality, but I've outlined a few minor issues below.

Potential issues that could be fixed:

Minor issues that need fixing:

ab93 commented 3 months ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme file requirements The package meets the readme requirements below:

The README should include, from top to bottom:

NOTE: If the README has many more badges, you might want to consider using a table for badges: see this example. Such a table should be more wide than high. (Note that the a badge for pyOpenSci peer-review will be provided upon acceptance.)

Usability

Reviewers are encouraged to submit suggestions (or pull requests) that will improve the usability of the package as a whole. Package structure should follow general community best-practices. In general please consider whether:

Functionality

For packages also submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 3 hours


Review Comments

Great submission overall. Documentation is good, and I like the user guide as well. There are a few tweaks, suggestions and adjustments that I can add here.

Packaging and CI

Code Practices, which again can be identified using a linter like Ruff

isabelizimm commented 3 months ago

Thank you so much to our reviewers @acolum and @ab93 for your thoughts on QuadratiK!🌷 The next step here is for the author to implement the changes suggested by reviewers. This piece can involve a bit of back and forth, @rmj3197, please let us know in this thread if you have questions about the review. Otherwise, post here when the reviews have been addressed and the reviewers will look over the updates and give their final approval!

.rst file extension instead of a .md file extension.

This is okay! As long as there is a README file there, we are good to go 😄

rmj3197 commented 3 months ago

Thank you @acolum and @ab93 for your valuable suggestions and comments. Thank you @isabelizimm for your help and communication. I will address the changes and update you once they are completed. Thank you all for your time.

rmj3197 commented 2 months ago

Dear @acolum, @ab93, and @isabelizimm,

Thank you very much for your insightful review. We apologize for the delay in our response. We have tried to address the points raised in the review below:

Comments from @acolum

We have added a list of relevant packages in R and Python in the README file.

We have added the repo status badge and organized the various other badges according to the categories specified in the example package.

We have now linked all vignettes in the README file.

We have added a CITATION.cff file in the repository. Additionally, we have also included the BibTex entry in the README.

This was clarified by @isabelizimm that a .rst file is fine.

Comments from @ab93

We have now updated the development guide with commands and instructions on using Poetry. Please see the README file for the updated guide.

This has been updated in the pyproject.toml file with [python = ">=3.9, !=3.9.7, <3.13"].

The black format CI check is included now. The github action can be found at - https://github.com/rmj3197/QuadratiK/actions/workflows/black_check.yml.

We have now implemented the Ruff linting. The github action can be found at - https://github.com/rmj3197/QuadratiK/actions/workflows/ruff_linting.yml.

We have now included Python typing. The hints are now being shown in our updated documentation (https://quadratik.readthedocs.io/en/latest/api_reference/index.html).

All instance variables are now first defined in the __init__() method.

__slots__ are now included. An example is - https://github.com/rmj3197/QuadratiK/blob/master/QuadratiK/kernel_test/_kernel_test.py

The base Exception class is not raised anymore. We now raise relevant errors.

ab93 commented 1 month ago

Thank you for the changes, @rmj3197. It looks good now. I have approved it!

isabelizimm commented 3 weeks ago

Hi all--apologies for the late response! @acolum, are you able to check if the changes made addressed the comments in your review? If so, please check the box in your review that states The author has responded to my review and made changes to my satisfaction. I recommend approving this package. ✔️ and let us know you have approved it!

acolum commented 3 weeks ago

Thanks for the reminder, @isabelizimm! I've checked the box and approved the package.

rmj3197 commented 3 weeks ago

Thank you @ab93 and @acolum for your feedback in improving the package and approving the changes.

@isabelizimm, thanks for facilitating the process. Please let us know what are the next steps! Thank you very much for your time.