UBC-MDS / software-review

MDS Software Peer Review of MDS-created packages
1 stars 0 forks source link

Submission: altairexpress (Python) #3

Open tejasph opened 4 years ago

tejasph commented 4 years ago

Submitting Authors: Tejas Phaterpekar(@tejasph), Lesley Miller (@aromatic-toast), Jack Tan (@thejacktan), Wenjiao Zou(@zouwenjiao) Package Name: altairexpress One-Line Description of Package: Provides efficient EDA plots in Altair Repository Link: https://github.com/UBC-MDS/altairexpress Version submitted: V1.1.16 Editor: Varada Kolhatkar (@kvarada) Reviewer 1: Evhen Dytyniak (@evhend) Reviewer 2: Katherine Birchard (@katieb1) Archive: TBD
Version accepted: TBD


Description

This package simplifies the process of conducting Exploratory Data Analysis (EDA) on new datasets. It is designed to allow the user to explore the data graphically as well as obtain some basic summary statistics, all by writing only one line of code. Plots are produced using the Altair package under the hood. As Jenny Bryan once said: “Someone has to write for-loops, but it doesn’t have to be you!”. This sentiment has been implemented here for EDA analysis. The user is able to spend more time on analyzing the dataset and less time on configuring complex Altair plot settings.

Scope

* Please fill out a pre-submission inquiry before submitting a data visualization package. For more info, see this section of our guidebook.

Our package provides users with the ability to efficiently visualize their data with scatterplots, histograms, time-series, and Fourier transforms. These plots are also accompanied with axis transformation options and summary statistics.

altairexpress is meant for machine learning data scientists of all levels who want convenient tools to visualize their data both effectively and efficiently. The goal is to enable them to allocate more time to model planning/engineering.

From our knowledge, there has yet to be a popular, well-known Altair EDA package. There may be other packages out there, but our package offers a variety of plots that can fit different data needs.

Technical checks

For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:

Publication options

No

Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?

This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.

Code of conduct

P.S. Have feedback/comments about our review process? Leave a comment here

Editor and Review Templates

Editor and review templates can be found here

katieb1 commented 4 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: ~ 3 hours


Review Comments

Overall, very interesting idea for a package and really well done! Just a few notes below:

Documentation
Functionality
evhend commented 4 years ago

Package Review

Please check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide

Documentation

The package includes all the following forms of documentation:

Readme requirements The package meets the readme requirements below:

The README should include, from top to bottom:

Functionality

For packages co-submitting to JOSS

Note: Be sure to check this carefully, as JOSS's submission requirements and scope differ from pyOpenSci's in terms of what types of packages are accepted.

The package contains a paper.md matching JOSS's requirements with:

Final approval (post-review)

Estimated hours spent reviewing: 3


Review Comments

Overall, I enjoyed working my way through your package; I think there are some improvements that can be made, some of which are suggested, but the practical use of the package is evident.

tejasph commented 4 years ago

Thanks for the insightful feedback @katieb1 and @evhend. We will work through all your suggestions.

tejasph commented 4 years ago

From Evhen's comments I have implement the following changes in our latest version:

  1. Corrected the hist() import call in the Readme Package Walkthrough

  2. Corrected the make_scatter import call in the Readme Package Walkthrough Section

  3. In Read the docs, corrected a miss-specified in the make_scatter function. The defined variables now line up accurately with variables found inside the function.

  4. Corrected incorrect import, changing ts to ts_alt

  5. fixed installation command; the install still fails for me though with the following errors:

ERROR: Could not find a version that satisfies the requirement datetools<2.0,>=1.1 (from altairexpress) (from versions: none)
ERROR: No matching distribution found for datetools<2.0,>=1.1 (from altairexpress)

According to someone on Slack, we can't account for these errors unless we have a setup.py or a requirements.txt file. Instead, the user must upgrade the packages that are being referred to in the error. We are still trying to find a fix for this.

And the following is our latest release which has addressed the feedback above: v 1.1.24

aromatic-toast commented 4 years ago

All comments that have been checked off under Katie's review that has been addressed. Please see her review comments on details of the action items.