openwashdata / data

The issue tracker on this repository has the purpose to collect ideas for data to be donated, cleaned, and published. Check out current ideas and add your own.
https://github.com/openwashdata/data/issues
1 stars 0 forks source link

(DATASET) Revenue and water production records from seven rural piped water service providers, 2016-2020 #24

Open uptimeandrew opened 1 year ago

uptimeandrew commented 1 year ago

This dataset has been compiled as part of a PhD project from secondary, longitudinal records maintained by seven agencies that operate piped water services in rural areas of ten countries. Three of the agencies are international nongovernmental organisations that operate as social enterprises. Three agencies are private companies that offer a range of engineering, construction, and management services. One agency is a public utility. Operational data have been extracted, with permission, from proprietary online and offline electronic databases maintained by the participating agencies. Deidentified data have been transferred to a master database where additional transformations and aggregations have been performed. The master database contains separate databases for water service areas, geographic regions, financials, and service levels which are linked with a record identification number corresponding to unique piped water service areas. Transformed data are documented with comments that explain relationships between variables. The full dataset covers roughly 5,500 waterpoints and represents services provided to more than half a million people spanning the years 2016 to 2020.

In general, the dataset consists of the following:

Furthermore, the data adhere to the following criteria:

  1. Services cover rural or a mixture of rural and peri-urban areas
  2. Infrastructure consists of small to medium-sized piped schemes covering one or multiple villages with metered on and off premises connections
  3. Users make regular financial contributions
  4. At least 12 concurrent monthly revenue and water volume records are available

Since the assembled dataset holds long-term value to the public, the master database containing anonymised data has been preserved under controlled access in the Oxford University Research Archive at https://ora.ox.ac.uk/objects/uuid:85bb166a-1065-4c4c-867d-5823f11831c9.

larnsce commented 1 year ago

Thank you @uptimeandrew for sharing this with us. The dataset sounds very valuable and I had a look at and can see that it also good documentation.

We will discuss in the team how to move forward with a publication process using our R data package workflow.