We have to establish what we want our dataset to actually contain.
I have listed a number of potential data sources in the projects README.md.
I think that it is important that we rely on types of data that is up to date and/or can be easily collected in the future e.g.: weather information, or score progression during a match like the left-side panel here which shows play by play actions. In terms of historical and live betting odds I propose this website, for which there already seem to be scrappers we can take inspiration from.
So far I identified these types of data:
Betting odds
Match scores
Play by play series
Weather data (since it might affect each player differently)
We have to establish what we want our dataset to actually contain. I have listed a number of potential data sources in the projects README.md.
I think that it is important that we rely on types of data that is up to date and/or can be easily collected in the future e.g.: weather information, or score progression during a match like the left-side panel here which shows play by play actions. In terms of historical and live betting odds I propose this website, for which there already seem to be scrappers we can take inspiration from.
So far I identified these types of data: