There are a couple of factual errors in the (as-of-yet unsourced) timeline in monarchs.json posted by @arvind in 2015. Since README.md notes that
Datasets may contain intentional inconsistencies or errors to provide opportunities for data cleaning exercises and to illustrate common data quality issues.
should we leave the dataset as-is and reference the errors in SOURCES.md? Or should we correct the errors? Personally, I favor hosting accurate data as the default. In this case, it would be simple for an instructor to reintroduce errors in the dataset if needed for teaching purposes. Then again, correcting the errors may "break" existing examples that rely on the existence of errors.
There are a couple of factual errors in the (as-of-yet unsourced) timeline in
monarchs.json
posted by @arvind in 2015. Since README.md notes thatshould we leave the dataset as-is and reference the errors in
SOURCES.md
? Or should we correct the errors? Personally, I favor hosting accurate data as the default. In this case, it would be simple for an instructor to reintroduce errors in the dataset if needed for teaching purposes. Then again, correcting the errors may "break" existing examples that rely on the existence of errors.