UBC-MDS / DSCI-522_City_of_A-Stars_310

MIT License
0 stars 3 forks source link

To Love or Not to Love: Project Proposal

About

With this project, we attempt to run inferential analysis to determine if there is any significant difference between the total grades of secondary school students who are in a relationship and those who are not.
The analysis is conducted on the Student Performance Dataset from the UCI Machine Learning Repository (https://archive.ics.uci.edu/ml/datasets/student+performance)

This data set details student performance indicators (grades) of secondary school students for two courses in the form of two data sets, one for Maths and one for Portugese, along with 30 features spanning information pertaining to school activities, social behaviour and family background. The data set has been compiled using school reports and questionnaires answered by secondary school students in Portugal.

References:

Research Questions

The analysis will focus on answering the following inferential research questions:

Report

The final report can be found here

Usage

There are two ways in which this analysis can be replicated:

Option 1: Using Docker

  1. Install Docker
  2. Clone this GitHub repository
  3. Navigate to the root directory of the project in terminal
  4. Run docker run --rm -v /$(pwd):/usr/src/project manu2856/mds_workflows_g310 make -C /usr/src/project clean at the terminal
  5. Run docker run --rm -v /$(pwd):/usr/src/project manu2856/mds_workflows_g310 make -C /usr/src/project all at the terminal

note: Windows users can run the following commands in Bash in place of steps (4) and (5) above docker run --rm -v ${PWD}:/usr/src/project manu2856/mds_workflows_g310 bash -c "make -C /usr/src/project clean" docker run --rm -v ${PWD}:/usr/src/project manu2856/mds_workflows_g310 bash -c "make -C /usr/src/project all"

Option 2: Without using Docker

  1. Clone this GitHub repository
  2. Install the dependencies listed below
  3. Navigate to the root directory of the project in terminal
  4. Run make all at the command line/terminal
  5. To reset the repo to a clean state, with no intermediate or results files, run the make clean at the command line/terminal

Makefile Dependency Diagram

Following is a map of how each file in project directory is connected:

Dependencies

Python 3.7.3 and Python packages:

R version 3.6.1 and R packages: