datasets / publicbodies

A database of public bodies such as government departments, ministries etc.
http://publicbodies.org
MIT License
63 stars 26 forks source link

Nepal: data has duplicate rows #85

Closed augusto-herrmann closed 6 years ago

augusto-herrmann commented 6 years ago

Goodtables detected some duplicate rows in Greece's data:

$ goodtables data/np.csv
DATASET
=======
{'error-count': 2,
 'preset': 'nested',
 'table-count': 1,
 'time': 0.012,
 'valid': False}

TABLE [1]
=========
{'encoding': 'utf-8',
 'error-count': 2,
 'format': 'csv',
 'headers': ['id',
             'name',
             'abbreviation',
             'other_names',
             'description',
             'classification',
             'parent_id',
             'founding_date',
             'dissolution_date',
             'image',
             'url',
             'jurisdiction_code',
             'email',
             'address',
             'contact',
             'tags',
             'source_url'],
 'row-count': 143,
 'scheme': 'file',
 'source': 'data/np.csv',
 'time': 0.01,
 'valid': False}
---------
[55,-] [duplicate-row] Row 55 is duplicated to row(s) 46
[78,-] [duplicate-row] Row 78 is duplicated to row(s) 75

@nikeshbalami, or anyone else who can verify this, is it ok to delete the duplicated rows in the data?

augusto-herrmann commented 6 years ago

@todrobbins, I think it's safe to accept this PR, right?

nikeshbalami commented 6 years ago

Hi @augusto-herrmann, so sorry I misplaced the notification email previously. Thank you so much for removing the duplicated rows and submitting the PR. Hope it will be merged soon 👍

augusto-herrmann commented 6 years ago

Thanks for the feedback, @nikeshbalami.