DSCI-310-2024 / DSCI_310_Group_9_NY-airbnb-analysis

Other
4 stars 0 forks source link

Feedback addressed #66

Closed rashiselarka closed 7 months ago

rashiselarka commented 7 months ago
          ## Data analysis review checklist

Reviewer:

Conflict of interest

Code of Conduct

General checks

Documentation

Code quality

Reproducibility

Analysis report

Estimated hours spent reviewing: 3.5

Review Comments:

Please provide more detailed feedback here on what was done particularly well, and what could be improved. It is especially important to elaborate on items that you were not able to check off in the list above.

1) Enhancing Readability and Efficiency Repository Cleanup: I would consider removing unnecessary files/folders like .local and .ipython, and possibly reorganizing or removing the eda_files folder for clearer navigation. This aligns with making the repository more streamlined and user-friendly. I suggest to split complex functions into simpler, single-responsibility functions for better maintainability. For example, rank_correlations could be divided into functions focused on matrix flattening and duplicate name removal. Additionally, there's a call for more comprehensive testing, including cases of erroneous inputs, to ensure robust error handling. We should always update the documentation to reflect current repository structure and instructions for using Docker environments. Highlighting the need for a detailed CONTRIBUTION guide and clear instructions on accessing and using the Docker container to facilitate contributions and project setup.

2) Prioritizing user experience and reproducibility and enhance visual clarity: Consider include relevant visualizations in the README with descriptive captions to improve understanding and engagement. This suggests that while the project's visual aspects are strong, their integration and presentation can be optimized for better impact. Docker Utilization and Guidance: Having detailed Docker instructions, particularly for newcomers to Docker is really important. This includes comprehensive steps for setting up, accessing, and using the Docker environment, which is critical for ensuring that all users, regardless of their familiarity with Docker, can easily reproduce the analysis and navigate the project.

3) Reproducibility and Hard-coded Values: There’s the issue of hard-coded values in reports, we need dynamic value references to enhance reproducibility and ease of updates. Please try to avoid redundant code through proper imports and maintaining an updated issue tracker to reflect the current status of the project accurately. This not only aids in project organization but also in the efficiency and clarity of collaboration.

Attribution

This was derived from the JOSE review checklist and the ROpenSci review checklist.

Originally posted by @lucyliuyihong in https://github.com/DSCI-310-2024/data-analysis-review-2024/issues/9#issuecomment-2041982221