davescroggs / footyStats

0 stars 0 forks source link

Standardise column data across both tables #4

Open davescroggs opened 2 years ago

davescroggs commented 2 years ago

Values and data type Ie. round should be a character/integer

davescroggs commented 2 years ago

Date in AFL tables is provided in two columns - one for date and one for time. May have a different reference to AFL data.

davescroggs commented 2 years ago

Will use AFL data as the standard and change all others to match. For team names the below is the mapping between the two tables.

teamAflData teamAflTbls
Adelaide Crows Adelaide
Brisbane Lions Brisbane Lions
Carlton Carlton
Collingwood Collingwood
Essendon Essendon
Fremantle Fremantle
Geelong Cats Geelong
Gold Coast Suns Gold Coast
GWS Giants Greater Western Sydney
Hawthorn Hawthorn
Melbourne Melbourne
North Melbourne North Melbourne
Port Adelaide Port Adelaide
Richmond Richmond
St Kilda St Kilda
Sydney Swans Sydney
West Coast Eagles West Coast
Western Bulldogs Western Bulldogs
davescroggs commented 2 years ago

Round data is tricky due to the COVID effected year - GF is round 22 in AFL data. There is no round-round name matching for all years. The best way to do it at this stage is an ordinal character column, then greater than filtering will still work.

davescroggs commented 2 years ago

Mapping rounds as follows.

AFLTables AFLData
EF Finals Week 1
QF Finals Week 1
SF Semi Finals
PF Preliminary Finals
GF Grand Final

I don't particularly like losing the implicit information in elimination and quarter finals, that gets lumped into one round name. Maybe something to look at later.