fatihsolmaz22 / Bachelorarbeit_FS22

1 stars 0 forks source link

Duplicates in fwg_composition_data_IDP.csv #48

Open kunman93 opened 2 years ago

kunman93 commented 2 years ago

row 54760 - 54769: 2022-03-31T21:00:00Z,NA,9.38,1.32,66,0,0,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster 2022-03-31T22:00:00Z,NA,8.17,1.32,69,0,0,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster 2022-03-31T23:00:00Z,NA,7.02,1.19,73,0,0.28,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster

row 1040593 - 1040593: 2022-03-31T21:00:00Z,NA,9.38,1.32,66,0,0,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster 2022-03-31T22:00:00Z,NA,8.17,1.32,69,0,0,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster 2022-03-31T23:00:00Z,NA,7.02,1.19,73,0,0.28,NA,NA,NA,NA,NA,22,NA,NA,NA,fwg-butcher-uster

and for some restaurants we don't have the data, NA in excel: 'fwg-negishi-archhoefe', fwg-butcher-badenerstrasse', 'fwg-nooch-badenerstrasse', ...

mar-wir commented 2 years ago

Yes, bravo! Indeed. Two tenants are there twice. It was a dumb mistake on my end, but is a nice exercise for you. pandas/dplyr have all methods to deal with duplicates :D Ask if in need.

mar-wir commented 2 years ago

If it's NA, we don't have the data either... If reopened because of that....