itep-data / R-questions

:question: :hand: Ask your R questions here
0 stars 0 forks source link

Lost on what to do #4

Open User29293 opened 2 years ago

User29293 commented 2 years ago

(Q1) Cluster the U.S. cities based on the occupational employment composition. You can determine the number of clusters. If needed, drop all observations with missing values.

(Q2) Create a scatter plot with Production Occupations on the x axis and Management Occupations on the y axis. Color the observations based on the clusters revealed in the previous question. Hint: When you specify a variable name with more than one word, it should be enclosed with ` (not ' ') (e.g.,Management Occupations`).

(Q3) Add title to the plot and data labels (city names) only to NC cities (whose state is either "NC" or "NC-SC"). Your name should be included in the plot title. See Canvas for an example. Note that visual details will not be accounted for grading.

(Q4) Create more scatter plots (at least two) for other occupation groups than management and production as in Q2 and Q3. Briefly discuss the characteristics of occupational structures of NC cities below (Must use the comments).

Link to data: https://www.bls.gov/oes/tables.htm

May 2019 | Metropolitan and nonmetropolitan area | XLS is the file

inside the zip file there are 3 files the one for these questions is named MSA_M2019_dl

Direct Download link: https://www.bls.gov/oes/special.requests/oesm19ma.zip