jsoma / playfair-projects

Common repository of projects for Playfair
6 stars 32 forks source link

Racial bias in supporting students #235

Closed gcgruen closed 8 years ago

gcgruen commented 8 years ago

Scope:

Collaborating with @kromreig and @HarshaDevulapalli , I would like to provide visuals for some of the claims made by the Black Lives Matter movement:

"Black students are

  1. less likely to be placed in gifted and talented programs
  2. less likely to attend schools that offer advanced coursework
  3. more likely to attend schools with less qualified educators, and employ law enforcement officers but no counselors."

    (Ugly) Proof of viability:

To see whether I can get sth out of the data, I focused on 1) the enrolment in gifted and talented programs Male Students: gt_bm gt_wm

Female Students: gt_b_f gt_w_f

Data:

The dataset is huge (more than 95,000 rows), so only working with a relevant subset of columns The overall dataset can be found here

Headlines:

gt_wb_m

That's why I was looking around for alternatives. In the description for hexagonal bin plots it says that it is convenient for very dense datapoints -- so this case, I guess? How do people feel about this kind of graph? Maybe not with a sequential scale (as the example in the documentation, but a diverging one)

Story checklist:

My pitch issue was: #219

kbennion commented 8 years ago

Not having been at the lecture with Richard, hexagonal graphs are kind of tough for the layperson to understand at an entry level -- I think it depends on who your target audience is for these vizes. Would it be possible to somehow narrow your dataset further, like for whom the data is bad, or somehow aggregate by income level or school size or region or something? Maybe make your dots smaller?

Does the dataset identify schools, or is it anonymous? (It might be interesting to investigate some of the outliers if so. Maybe for another project.)

miaomiaorepo commented 8 years ago

It's a great idea to show black students support in education equality. Since the scatter points overlap a lot, it might be better to make bar charts which show the total number of enrolled white/black students. If you would like to describe the enrolment ratio, you may want to calculate first, then make a line graph about the white/black ratio variation in the past decade(whatever period you want). I'm also very curious about the outliers.

miaomiaorepo commented 8 years ago

http://www.nytimes.com/interactive/2016/04/29/upshot/money-race-and-success-how-your-school-district-compares.html?_r=0

Hope it will help.

gcgruen commented 8 years ago

Thank you @shuyao1201 :)

gcgruen commented 8 years ago

Update: First graphic gt-960-01

ghost commented 8 years ago

I love how the visuals have taken a really different turn as we have understood this dataset better! Hurrah for figuring out what works. Overall I think it's great, and it really clearly communicates the patterns you found.

I dislike the bolded words in the annotations, I think it's distracting and doesn't add to the clarity of the explanation. The ticked lines seem a little odd. Are they separated, or -*- or somesuch thing? They just look weird to me :/

gcgruen commented 8 years ago

Thanks for your feedback! I changed most of the boldness and also the stroked line weight. That really made it better, so thanks for pointing this out! gt-960-03

gcgruen commented 8 years ago

Another form of graphical representation of the same data -- slight difference: includes the absolute counts of schools. Disadvantage: less statistical metrics. Go with both? gt-histogram-01

gcgruen commented 8 years ago

Updated existing graphics with corrected description and adding further graphics on advanced coursework boxplot_gifted_talented-960-02 gt-histogram-02 boxplot_advanced_coursework-960-01 ac-histogram-960-01

ghost commented 8 years ago

(more data on police in schools, and how the decision is made to place an officer in a school)

From the Department of Justice:

School Safety Data Sources There are a number of national sources of school safety data. Data are often broken into categories, such as urban/rural; age groups; male/female. These can be helpful in identifying where a school stands compared to other schools with similar characteristics. National data sources: • National Crime Victimization Survey (NCVS) • School Crime Supplement (SCS) to the National Crime Victimization Survey • School Survey on Crime and Safety (SSOCS) • Schools and Staffing Survey (SASS) • Youth Risk Behavior Surveillance System (YRBSS) For state data, the state attorney general or child services agency can often provide information. Locally, school districts, law enforcement, social service agencies, and colleges and universities can be useful sources.

gcgruen commented 8 years ago

Thanks for the input. I think I'll leave it to a future project to combine this dataset with another dataset ;) Also, as I understand from the documentation, the next release with the data of 2015/2016 will include the actual count of law enforcement officers plus the count for security guards.

gcgruen commented 8 years ago

Adding last graphic on law enforcement officers in relation to racial breakdown of student population. lawofficers-960-01

playfairbot commented 8 years ago

Closing since pull request #265 has been accepted