JOSS Review - Githubissues

adithirgis commented 4 years ago

Hello, Thanks for submitting this to JOSS and inviting me to review. I found the software very useful. Very good work. Congratulations. I am adding a few comments for this below:

I am not able to see the page correctly there is a overlap between the mobile/drag and drop menu in the right and the table area for the Data Loading tab. (I am using a Windows system). Any thoughts about this? Can the table to be placed a little lower?
I tried to run this app without the any data upload : It shows me this error for all combinations dropdown items selected and running the Descriptive Table analysis in Data Exploration tab. Am I selecting the wrong options?
Let me know if I am doing anything wrong here: I am trying to run GAM and LOESS for two numeric variables I get this error.
Please add Community Guidelines to the README.md file: stating how to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support.
I see that the description and walk-through is well written inside the application interface. It would be very helpful for other users not in the medical/clinical research to use this package for their own research. A descriptive README.md containing example usage and functionality documentation would be very helpful. I strongly suggest to write a descriptive README.md in your repository. Also, this might be a general comment: Is there a possibility to add description of the pre-loaded data base columns to understand the working of the package better.
Are there any other software similar to StatAid existing already? It will be useful to add a comparison here.
You can also list the academic papers in the API Documentation/ README.md where StatAid has been cited. As stated in the paper.md: "It has already been used by multiple researchers and students for their PhD or medical doctors for their thesis."
I think the good practice of writing cleaner consistent code can be implemented here using these wonderful resources (I too found out during my paper review). Implementing 80 or 120 characters in a line and spacing consistency, see http://adv-r.had.co.nz/Style.html and https://devguide.ropensci.org/building.html#code-style.
I would also suggest to do a check using this command- devtools::check(). I came across three warnings(). To solve those you can add some of the build ignore files into this file. There is also a import from namespaces issue/warning which can be probably solved using this.

A minor addition in the installation here would be more clear:

# install.packages("remotes")
# remotes::install_github("VincentAlcazer/StatAid")
# StatAid::run_app()

I would recommend to more software usage in the paper.md file.
I would also suggest to add a few more automated tests other than the recommended one in golem. Please let me know if they are already added and if I skipped it.
I downloaded the data sets and found that the IRIS data set was not giving desired results I was not able to select any numeric column for regression and correlation or run lm/gam/loess. Are there any other checks to be done before we enter new data into the software other than those mentioned in the software Introduction tab.

Let me know if you have any further questions. Thanks again, Stay Safe and Regards Adithi

VincentAlcazer commented 4 years ago

Dear @adithirgis, thank you very much for your review and your comments. I will take the time to answer point by point and correct the issues you raised.

VincentAlcazer commented 4 years ago

Dear @adithirgis ,

Thank you for your review and your positive feedback. Your comments helped me to improve the software with a new version I just uploaded to GitHub (1.01). Unfortunately, I did not manage to reproduce some errors you get. Here is a point-by-point answer. I hope we will manage to find where these errors came from.

Thank you again for your time.

Best regards,

Vincent

The table area have been moved down as suggested, with the addition of a “frequent issue” section.
This is working for me both in local and on the website (see picture for expected behavior). I am trying to see where this error came from: Have you tried to perform the analysis on a fresh launch? Did you load another dataset before loading again the base dataset? Are your packages up to date?
Same comment as for the 2nd point (see picture for expected result).
Additional description and guidelines for these points have been added at the end of the Readme.md.
Thank you for this suggestion. This is indeed an important point and I am currently considering writing a complete tutorial for the software. A short description of the dataset has been added.
To my knowledge, no free open-source software for data analysis exist. Of course, a lot of alternative solutions exist but they are else private software or shareware, or partial webtools with few options. I made a quick additional search on the web and did not manage to find direct concurrent in the same format as StatAid (if you have anything in mind please let me know so I can compare them).
Few papers where StatAid was useful have been published yet. The only one would be Clin Lung Cancer. 2020 May 8:S1525-7304(20)30136-4. doi: 10.1016/j.cllc.2020.04.013, where the software has been used for data exploration but not cited (I was added as an author). The PhD and medical thesis for which StatAid was more thoroughly used are else not going to be published, or with an article in preparation and I hope StatAid will be citable before the submission of these latter works.
The styler package has been used to reformat the whole code in a cleaner way.
I already run this tool before submission and ran it again with no warnings (only 3 notes). I wonder if a location issue could cause this and be potentially related to the 2nd and 3rd issue. Could you please paste me the warnings you had?
The recommended correction has been added.
From what I understood, the paper.md has to be short and should not contain API functionality (from JOSS recommendations: “Given this format, a “full length” paper is not permitted, and software documentation such as API (Application Programming Interface) functionality should not be in the paper and instead should be outlined in the software documentation.”). This point could probably be related to 5, and I am currently working on a more complete tutorial/manual for the software.
The rhub::check_for_cran() test has also been run, with 4 notes and no warnings. Do you have a particular test in mind you would like to see here?
The decimal separator for the Iris dataset is a period (instead of a comma for the original dataset). You have to make the change in the data loading menu so numerical variables are not considered as categorical ones. I added a section in “frequent issues” for this as this could be a regular issue.

adithirgis commented 4 years ago

Hello @VincentAlcazer !

Thanks. Sorry for the late response.

I see that 1, 4, 10, and 13 have been implemented. Thanks!
For 2 and 3, I tried with three Windows 10 system and RStudio 3.5, 3.6 and 4.0 with all updated packages (for packages with specific version please add that in your DESCRIPTION), new session with just the example data to be tested. I still find the same error, yes the online version works perfectly well. (though comment 9 has been resolved)
```
Warning in chisq.test(table) :
Chi-squared approximation may be incorrect
Warning: Error in : Tibble columns must have compatible sizes.
```
Size 8: Existing data.

Size 4: Column ELN2017. i Only values of size one are recycled. 146: geom_smooth() using formula 'y ~ x' Warning in predict.gam(x, newdata, type = type.predict) : not all required variables have been supplied in newdata! Warning: Error in eval: object 'Age' not found 176:


I also tried the [RStudio cloud](https://rstudio.cloud/) to test this, I get the same errors 2 and 3, pasting 2 and error from RStudio cloud -
![image](https://user-images.githubusercontent.com/63432077/92633932-25984e00-f2f1-11ea-94a9-fcd300e46d22.png)
![image](https://user-images.githubusercontent.com/63432077/92634164-7f007d00-f2f1-11ea-9edd-7afb828dee80.png)
I would like to know what is wrong here so that other users do not face the same issue.

For 5, I would wait for the README.md to complete to wrap up the review process.
Please add 6 in the paper.md that author is not aware of any similar free, open-source software like StatAid (of course in the right way).
For 7, so StatAid was used but not cited. I wanted to know to verify it. Thanks.
For 9, I used the new version you uploaded (updated the packages, and started a new session), now I find just three notes. Great! Sorry, I did not do that earlier.
For 12, I would suggest testthat / shinytest apart from what Nistara mentioned for testing shiny apps, I do see a tests folder made using testthat with the default test file. Please add test unit for automatically testing your application. Tests for functions will also be required. Examples from JOSS - https://github.com/michaellevy/gwdegree/tree/master/tests/testthat, https://github.com/PsyChiLin/EFAshiny/tree/master/tests/testthat, https://github.com/OttaviaE/DscoreApp/tree/master/tests, https://github.com/pmcharrison/psychTestR/tree/master/tests/testthat.

Please treat the following as minor comments / optional-

For 8, I would still recommend using the resources shared before since some of the code cleaning does not happen using that particular project as seen here. (the characters in a line seem way more than 120 characters).
For 11, I was wondering if a brief descriptive explanation of analysis that StatAid can achieve (Since I see a lot of analysis can be done using this.) It would help get more visibility.
I think, I too agree with Nistara's view about the colour palette. It will be good step.
If there are options to export table it would be great. I could save the graphs but I could not export the tables (if this helps: https://stackoverflow.com/questions/45270312/downloading-multiple-reactive-objects-containing-data-tables).

Feel free to let me know if you have questions. All the best! I will surely try to use this for air quality data! Thanks a lot. Regards and Stay safe Adithi

VincentAlcazer commented 4 years ago

Dear @adithirgis,

Thank you for your answer and your patience. I just uploaded a new version (1.02) integrating @nistara comments and some of your remaining reported issues. Three big points are still pending (the GAM/LOESS issue that I tried to correct, the guide/tutorial and the need for more test that are still work in progress) and I hope I will manage to provide a full correction soon.

Thank you again for your time and your help improving StatAid.

Best regards,

Vincent

2-3. For the GAM/LOESS issue, I tried to force the loading of the mgcv package at the app start: could you please tell me if you still have the issue?

The readme file with the tutorial/guide is still work in progress. I will let you know when it will be ready.
A sentence has been added in the paper.md: “To my knowledge, no free open-source software directly designed for researchers with an intuitive interface and a collaborative/evolving environment has been proposed yet.”
I removed the statement of previous use as I have no citations/proof to provide.
The testing part has also been commented by @nistara’s review. As I explained, I hardly see what kind of test I could add to my functions for the moment in the way I designed them (i.e. each function take the dataframe and the columns as input to produce the results, making the dataframe loading the limiting step). I am currently working on testing solutions and will try to propose relevant test with testthat for the next corrections.

Minor comments:

I will make a full review of the style according to your resources once major corrections will be done, using styler as a base and controlling each file manually.
11 will be integrated in the tutorial/guide.
A download button has been added for data tables.

adithirgis commented 4 years ago

Hi @VincentAlcazer!

Thanks for the response! I will take a look again and wrap the review process once the tutorial is updated.

Take care, Adithi

VincentAlcazer commented 4 years ago

Dear @adithirgis ,

I hope your are doing well.

I wanted to inform you that I managed to correct the three remaining points of the review. You will find in the updated v1.03:

A correction of the GAM/LOESS and descriptive table issue
A quick-start user guide with usage example (can be found in the readme or here: https://github.com/VincentAlcazer/StatAid/blob/master/STATAID_QUICK_START_USER_GUIDE.pdf)
Implemented test for each function of the package.

Thanks again for your time and comments. I'm looking forward to your feedback.

Best regards,

Vincent

adithirgis commented 4 years ago

Hi @VincentAlcazer!

Sorry for getting back late.

Thanks again for all the changes, the app has been improved a lot. I see that all the comments have been implemented. I will wrap the review process soon.

Could you please share what changes did you make to fix the GAM/LOESS analysis (it works on both Cloud and local system now!)? Thanks for such a wonderful user guide! Tests also seem to be added for functions! Please continue adding the tests to the test folder to improve the application time and again.

Regards, Take care! Adithi

adithirgis commented 4 years ago

Hi @VincentAlcazer!

Congratulations for the amazing application. I will wrap the review process now.

All the best and take care! Adithi

VincentAlcazer commented 4 years ago

Hi @VincentAlcazer!

Sorry for getting back late.

Thanks again for all the changes, the app has been improved a lot. I see that all the comments have been implemented. I will wrap the review process soon.

Could you please share what changes did you make to fix the GAM/LOESS analysis (it works on both Cloud and local system now!)? Thanks for such a wonderful user guide! Tests also seem to be added for functions! Please continue adding the tests to the test folder to improve the application time and again.

Regards, Take care! Adithi

The broom::augment function was not supporting the fit object from GAM so I had to manually recode the augment dataframe output by my function. However, I still do not understand why it worked on shinyapps.io and on my local computer...! Thanks for providing me the Rcloud resource that helped me fix this!

VincentAlcazer / StatAid

JOSS Review #1