ryanvmenezes / tiepredict

Forecasting win probability in European club football
https://ryanvmenezes.github.io/tiepredict/
MIT License
0 stars 0 forks source link

all places in R code where the data has been filtered #34

Closed ryanvmenezes closed 3 years ago

ryanvmenezes commented 3 years ago

the latest fbref batch update includes a lot of matches going back to the early 90s. not adding them now as it would take too long. need to try adding one year incrementally (and comparing against odds which go back to about 2004).

in scrape of game urls: data-get/fbref/03_scrape-games.R

# downloading

legshtml = legs %>%
  filter(szn >= '2014-2015') %>% 
  filter(!is.na(url)) %>%
  arrange(desc(szn), desc(stagecode)) %>% 
  mutate(html = map(url, getorretrieve.games))

in assembling of team names to join/clean up against oddsportal: data-get/assemble/teams/01_reconcile-team-names.R

summaries %>% 
  filter(szn > '2014-2015') %>% 
  select(starts_with('team')) %>% 
  ...
ryanvmenezes commented 3 years ago

taking care of this on the front end now