kferris10 / Stat-575-writing-project

A GitHub repository for my writing project at Montana State
http://kferris10.github.io/Stat-575-writing-project/
1 stars 1 forks source link

Parts of Paper to Update #2

Open kferris10 opened 9 years ago

kferris10 commented 9 years ago

Things I still need to update

  1. Explanation of Data Cleaning
  2. Explanation of Multiple Comparisons
kferris10 commented 9 years ago

Things to add about data cleaning

  1. Removed any rows where pitch locations are missing
  2. Regular season only
  3. Removed innings where a batter batted twice (i.e. the team hit around). This happened about 1% of the time in 2014. Did this because I couldn't figure out how else to link the batter's outcome to his on base status later in the inning. See lines 34-50 of code/clean-pitchfx-data/2-clean-pitchfx.R
  4. Only using rows where somebody is on base
  5. Removing rows with a pinch runner (?)