Response to the reviews

showcy commented 2 years ago

I am not sure if we need write the summary of the implements we made based on review or not? But here is my draft. You can add details and then we can reply them in that issue page.

Thank you for all of your comments! We appreciated, agreed with, and implemented some of your comments.

About report

From: @ytz and @nrao944

Useful to briefly mention the number of data points on your data set, under the 'Data' subheading.

Is it possible to provide dimensions of the data (total number of observations) for each of the groups in the README and the Data Section of the Report? It is noted that you have this available in your discussion.

Our implementation:

Add the data dimension to "Data" section. @93402f8

From: @adrianne-l

In the report Results & Discussion section, would it be a good idea to include some sub-sections to summarise the interim findings to better navigate and follow your flow of result interpretation?

Our implementation:

Add more details to navigate the reader understand our flow of interpretation. @93402f8

From: @nrao944

Your alternative hypothesis in the report should insert "not" equal, else it exactly matches the null hypothesis.

In your report, the number of repeats, appears as N_REPEATS, and not a number.

Our implementation:

Fix the typos. @9c3adc6 and @b6f3a8c

About data visualization

From: @ytz

Not 100% sure whether the use of 'confidence interval' is correct in "...we noted the large overlap in the confidence intervals between the two groups"

For Figure 2, consider using log scale on x-axis for Figure 2 to make the box-plots more prominent

Since the focus is on the indigenous group, you could use a monotone colour for the non-indigenous group, and a primary colour like red or blue for the indigenous group. That will make it easier for the reader to interpret the chart

Our implementation:

Change to "we noted the large overlap in the quantiles between the two groups". @93402f8
Change the box plot to log scale. @9eb3585
Change color of box plots to highlight indig group. @4257f08

Review of Milestone 1 from TA @Ivyqiuhan

This line can't run: df_init = pd.read_csv('../data/offender_profile.csv', sep=r'\s,\s', header=0, encoding='ascii', engine='python') because your file is at this path '../data/RAW/offender_profile.csv'

I can't run your code it has KeyError: 'Sentence Type' at cell 10

Our implementation:

Fix with Makefile in Milestone 3. - @943094d

If figure captions are not provided the plot should be clearly explained in the text. I would recommend using figure captions.

Our implementation:

Add captions in details. - @fbfc939

Need to add some explanation to the plot and your code

Our implementation:

Try to add more explanation through out all commits.

Review of Milestone 2 from TA @Ivyqiuhan

You should create an environment.yaml file to contain all your dependencies

Our implementation:

Add the environment.yaml. - @9f728ce

In usage, should write how to run each of your scripts, not just "make all" and "make clean"

Our implementation:

Put our old step by step usage back to README again. - @8ae2880

Ivyqiuhan commented 2 years ago

Looks good to me, thanks!

AraiYuno commented 2 years ago

Thanks @showcy !

UBC-MDS / inference_on_indigenous_vs_non_indigenous_sentence_length_differences