enram / data-repository

Data quality assessment
https://enram.github.io/data-repository/
MIT License
3 stars 1 forks source link

`ecog-04003` contains data outside the expected range #70

Closed peterdesmet closed 1 year ago

peterdesmet commented 1 year ago

The period of interest for ecog-04003 is 2016-09-19 to 2016-10-09 (21 days). However, some countries (Portugal and Germany, one day of NL), provide more data. Need to investigate if that is part of the Zenodo deposit or if it should be removed to avoid confusion.

peterdesmet commented 1 year ago

Data outside the expected range:

date    country (Number of Rows)
2016-08-01  pt  3
2016-08-02  pt  3
2016-08-03  pt  3
2016-08-04  pt  3
2016-08-05  pt  3
2016-08-06  pt  3
2016-08-07  pt  3
2016-08-08  pt  3
2016-08-09  pt  3
2016-08-10  pt  3
2016-08-11  pt  2
2016-08-12  pt  2
2016-08-13  pt  2
2016-08-14  pt  3
2016-08-15  pt  3
2016-08-16  pt  3
2016-08-17  pt  3
2016-08-18  pt  3
2016-08-19  pt  3
2016-08-20  pt  3
2016-08-21  pt  3
2016-08-22  pt  2
2016-08-23  pt  3
2016-08-24  pt  3
2016-08-25  pt  3
2016-08-26  pt  3
2016-08-27  pt  3
2016-08-28  pt  3
2016-08-29  pt  3
2016-08-30  pt  3
2016-08-31  pt  3
2016-09-01  pt  3
2016-09-02  pt  3
2016-09-03  pt  3
2016-09-04  pt  3
2016-09-05  pt  3
2016-09-06  pt  3
2016-09-07  pt  3
2016-09-08  pt  3
2016-09-09  de  16
2016-09-09  pt  3
2016-09-10  de  16
2016-09-10  pt  3
2016-09-11  de  16
2016-09-11  pt  3
2016-09-12  de  16
2016-09-12  pt  3
2016-09-13  de  16
2016-09-13  pt  3
2016-09-14  de  16
2016-09-14  pt  3
2016-09-15  de  16
2016-09-15  pt  3
2016-09-16  de  16
2016-09-16  pt  3
2016-09-17  de  16
2016-09-17  pt  3
2016-09-18  de  16
2016-09-18  pt  3
2016-10-10  pt  3
2016-10-11  pt  3
2016-10-12  pt  3
2016-10-13  pt  3
2016-10-14  pt  3
2016-10-15  pt  3
2016-10-16  pt  3
2016-10-17  pt  3
2016-10-18  pt  3
2016-10-19  pt  3
2016-10-20  pt  3
2016-10-21  pt  3
2016-10-22  pt  3
2016-10-23  pt  3
2016-10-24  pt  3
2016-10-25  pt  3
2016-10-26  pt  3
2016-10-27  pt  3
2016-10-28  pt  3
2016-10-29  pt  3
2016-10-30  pt  3
2016-10-31  pt  3
2016-11-01  pt  3
2016-11-02  pt  3
2016-11-03  pt  3
2016-11-04  pt  2
2016-11-05  pt  2
2016-11-06  pt  3
2016-11-07  pt  3
2016-11-08  pt  3
2016-11-09  pt  3
2016-11-10  pt  3
2016-11-11  pt  3
2016-11-12  pt  3
2016-11-13  pt  3
2016-11-14  pt  3
2016-11-15  pt  3
2016-11-16  pt  2
2016-11-17  pt  2
2016-11-18  pt  2
2016-11-19  pt  2
2016-11-20  fi  10
2016-11-20  nl  1
2016-11-20  pt  2
2016-11-21  pt  3
2016-11-22  pt  3
2016-11-23  pt  3
2016-11-24  pt  3
2016-11-25  pt  3
2016-11-26  pt  3
2016-11-27  pt  3
2016-11-28  pt  3
2016-11-29  pt  3
2016-11-30  pt  3
peterdesmet commented 1 year ago

I checked the vp.zip deposited on Zenodo and that one only contains data from the range of interest. Should we do the same for the data on the S3 bucket?

CeciliaNilsson709 commented 1 year ago

Ugg, probably cut it to the span of dates on zenodo? Feels wrong to "loose" data, but might be the cleanest solution, and the chance anyone would actually use those extra dates are probably very slim.

peterdesmet commented 1 year ago

Data has been removed outside selected time period. Will select update coverage.csv tomorrow to check I haven't forgotten something.

peterdesmet commented 1 year ago

All data have been removed outside the period of interest.