beoutbreakprepared / nCoV2019

Location for summaries and analysis of data related to n-CoV 2019, first reported in Wuhan, China
MIT License
658 stars 257 forks source link

better performance of the dedupe func #68

Closed attwad closed 4 years ago

attwad commented 4 years ago

df.append copies the whole dataframe to return a new one, which means we were previously copying the whole europe data for each aggregate row.

Also fixes an off by one error: do not concatenate rows that had "1" as aggregate number this is useless.

With that change the Europe sheet that was hanging before now finishes in just a few seconds on my machine.