UBC-MDS / data-analysis-review-2022

0 stars 1 forks source link

Submission: GROUP 11: Horror Movie Revenue and Ratings #22

Open AguilarRaul opened 1 year ago

AguilarRaul commented 1 year ago

Submitting authors: J99thoms, Lorraine97, AguilarRaul, Hongjian-Sam-Li

Repository: https://github.com/UBC-MDS/horror_movies Report link: https://github.com/UBC-MDS/horror_movies/blob/main/notebooks/EDA_keys.ipynb Abstract/executive summary:

inferential research question is whether 'high' rated horror movies have a larger median revenue than 'low' rated horror movies (among those with non-zero revenue).

Considering only horror movies with non-zero revenue, let $R_h$ be the population median revenue (in USD) of horror movies with average ratings greater than the median average rating of horror movies, let $R_l$ be the population median revenue (in USD) of horror movies with average ratings no greater than the median average rating of horror movies, and let $\delta = R_h - R_l$ be the difference in population median revenues. Then our hypotheses are:

$\text{H}_0:\ \delta = 0$ and $\text{H}_a:\ \delta > 0.$

Our significance level will be the standard $\alpha = 0.05$.

Our test statistic will be the difference in sample median revenues, $\delta^* = \hat{R}_h - \hat{R}_l$.

Since we are doing inference about the median, a CLT-based approach is not applicable here. Thus we will be using the simulation-based approach for this hypothesis test. In particular, we will use a permutation test. This makes the assumption that our sample is a good representative sample of our population of interest.

Editor: @flor14 Reviewer: Roan Rain, Ritisha Sharma, Gaoxiang Wang

ritisha2000 commented 1 year ago

Data analysis review checklist

Reviewer: ritisha2000

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1 hour

Review Comments:

Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.

  1. The documentation is great, the project is easy to follow and the objective of the project is clear. There are plenty of comments in the scripts which makes the analysis easy to follow. There is a good variety of plots for the EDA.
  2. Some minor edits you can make include changing the names of certain files for clarity. For example, you can change “down_data.R” to “download_data.R”, and change the name of "EDA_keys" so it is clear that it is the report. The x-axis of the plot in the EDA and report files have an axis that is cramped together so it makes it a bit difficult to read.
  3. There is a folder for images, I wondered if that belonged in the results. Also maybe rename the "notebooks" folder to "docs"
  4. The analysis report could be more thorough. You could explain why it would be important to answer this research question in the real-world context.
  5. The tables and plots are great but the results could be more clearly stated. I see the hypotheses but not the conclusion based on the conclusion.

Overall, it is a very well-done and interesting project!

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

snesunil commented 1 year ago

Data analysis review checklist

Reviewer: snesunil

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1.5 hours

Review Comments:

Things that went well :

Suggestions for improvements :

Overall, it is a well-explained and very interesting project. Great work!

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

louiewang820 commented 1 year ago

Data analysis review checklist

Reviewer: Gaoxiang Wang

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1.5 hours

Review Comments:

Overall well done! The code is divided into sections in long scripts that is easy to follow and I have few minor sugesstions:

This is an interesting topic. Good job!

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

roanraina commented 1 year ago

Data analysis review checklist

Reviewer: @roanraina

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 1

Review Comments:

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

AguilarRaul commented 1 year ago

Five pieces of feedback that have been implemented:

  1. [Peer review feedback] In the EDA script, more comments and documentation could be added so that it is easy for the reader to understand the script. Commit URL: https://github.com/UBC-MDS/horror_movies/commit/b95931e7f50162474832746202418d392fbee0b5 https://github.com/UBC-MDS/horror_movies/commit/95a0645f5373fde8ef731929bb4ed8ca21bae7b4 File Changed: src/eda_horror.R

  2. [TA feedback] Figure captions missing Commit URL: https://github.com/UBC-MDS/horror_movies/commit/bfd30bba27fbe09b092ee469cd1ac11537376606 File Changed: src/inference_horror.R

  3. [TA feedback] The proposal.md file has been created and moved to the doc directory -2 mechanics Commit URL: https://github.com/UBC-MDS/horror_movies/commit/b21a94378f89f94f32fc0732053b754036361c6b File Changed: doc/proposal.md

  4. [TA feedback] Plots suffer from one or more severe problems. For instance, overplotting, missing legend, small text or no axis labels Commit URL: https://github.com/UBC-MDS/horror_movies/commit/b99dbe0c7fd5642c1db521d8155fc955bfc2f64a File Changed: notebooks/Horror_movies_attributes_and_revenue_EDA.ipynb, src/eda_horror.R

  5. [Peer review] The final report was named EDA_keys.ipynb, which made it hard for me to understand which was the final report. The naming of the report file could be improved. (eg : {name of the project}_report.ipynb) Commit URL: https://github.com/UBC-MDS/horror_movies/commit/094d7534728707b4d908465dc50ff674dc55596f File Changed: report.ipynb