rogers1000 / cyclingchaos

Cycling Data Package
6 stars 0 forks source link

Paralympics (Basic) Results #37

Open rogers1000 opened 4 months ago

rogers1000 commented 4 months ago

Need to put in Paralympics results from 2000 onwards aligned in same format as First Cycling Road results df.

rogers1000 commented 4 months ago

Ingestion has been done.

Transformation is complex but will be one of the next priorities.

rogers1000 commented 4 months ago

the method of table building is really odd. However, most of it is logical in the sense if the data label is not one thing then it's the other. However, when dealing with riders who aren't counted in the results, the data label turns into "". The lack of data is then breaking the df creation.

I'm wondering if the solution could be work out which column has data filled in and then scrape that column rather than using the must have the same data-label approach. Will take some thought but should be possible.

rogers1000 commented 4 months ago
rogers1000 commented 4 months ago

for info column not working...

Need to create a list of where the data-label = Info or note. Filter to only include numbers and then take from the first index.

rogers1000 commented 3 months ago
rogers1000 commented 3 months ago

Need to put results into one field, transform stage_number and transform data to match FirstCycling results and then union.

rogers1000 commented 3 months ago

Stage Number and Race Results Note's need transforming.

rogers1000 commented 3 months ago

Basic output now uploaded. Stage Number is broken still but need to have a proper thought about how to solve it.