UtopianYoungChung / census06-21

0 stars 0 forks source link

Data mining #2

Closed UtopianYoungChung closed 9 months ago

UtopianYoungChung commented 9 months ago

This is a reminder that we are now in the data mining stage of the project. You are doing 2021, and I am responsible for the 06 data. Our 8 variables are:

  1. Number of People and Household
  2. Age Structure
  3. Household Type and Composition
  4. Visible Minority and Ethnic Composition
  5. Income and Poverty Levels
  6. Educational Attainment
  7. Labor Force Characteristics
  8. Housing Stock Composition

After you finish the mining, please upload your data under the data folder of the repository. Good luck, and have fun~!

UtopianYoungChung commented 9 months ago

Look at these Excel files, which I downloaded from the course page. I am currently doing data cleaning on these files. I will let you know once I am done cleaning and if we can just stick with these files. Otherwise, we have to follow the instructions for week 2 lab; the link to the lab is below: https://q.utoronto.ca/courses/310759/discussion_topics/2305493

Also, I have attached the instruction document for the Census data from lab 2, so take a look at the document while I am working on the cleaning. census06_data.csv census21_data.csv Fall 2023-Lab 2 - Accessing Census Data.docx

UtopianYoungChung commented 9 months ago

Based on today's lecture, we need to adjust our methods on assignment 1 (note: new due date Oct 10).

Below is the link to the table for the weight measurement of the Census of 06-21: https://github.com/jamaps/CLTD/blob/master/crosswalk_tables/ct_2006_2021.csv

Accordingly, we need manual labor to generate data that we can work with, whether on R or Excel. Let's discuss more tomorrow morning; however, in the meantime, I will continuously work on data mining to get the data we need to work with; hence, you don't need to worry about the data mining until our discussion tomorrow.

UtopianYoungChung commented 9 months ago

Let's work on creating stories from the data

census21.xlsx

CT0012_06.xlsx

UtopianYoungChung commented 9 months ago

Here, I am adding three csv files where you can find the information and fill in the data where necessary:

  1. This is where you need to add your findings: mined_data.xlsx

  2. 2006 ct_data06.xlsx

  3. 2021 census21.xlsx