iho-ohi / S-101-Test-Datasets

A repository of S-101 test datasets which make available for development phases and they will be migrated to the Registry later.
22 stars 6 forks source link

Remove invalid/outdated datasets #43

Closed DavidGrant-NIWC closed 9 months ago

DavidGrant-NIWC commented 1 year ago

Only datasets which are intended to be used for testing should be provided in the repo. Retaining datasets with known errors which can't be loaded makes it cumbersome to install the valid datasets; it's necessary to manually traverse the folder structure and install each dataset one at a time.

For instance, DS0013 contains five folders and an additional dataset which isn't in a folder. The standalone dataset appears to be identical to the dataset in the "5" folder. Three of the datasets are unusable (folders "2", "3", and "4"). One of the datasets is on an older FC ("1"), which might be useful for testing except that it has other known issues.

Other dataset folders have similar issues.

DavidGrant-NIWC commented 1 year ago

Also remove extraneous files from the docs folder: image

kusala9 commented 1 year ago

or just clearly mark folders as historical and not for general consumption. I certainly find access to older versions can be useful and also gives us audit/trail of previous versions. Potentially, we just need a dev folder and a production folder with production only containing the active, most up to date version of the dataset. Of course, if we did this by compiling text files and using branches we could use github to do the change management - but that may be a step too far right now.... I would rather the historical versions of the dataset are archived and available. Agree with the sentiment here though, keep only

prune files like .vld which I don't believe are required.

DavidGrant-NIWC commented 1 year ago

Bad datasets should be updated in place or removed. There is a historical record of changes to the branch even if datasets are deleted.

https://github.com/iho-ohi/S-101-Test-Datasets/commits/main

TomRichardson6 commented 1 year ago

@DavidGrant-NIWC and @kusala9 I have now tidied up these folders and retained a sub directory indicating the edition number.

This doesn't totally address David's point about it being cumbersome to install the cells so I would welcome your view on me removing these edition directories?

DavidGrant-NIWC commented 1 year ago

Much better - I can now drop the dev/cells folder on the ShoreECDIS and have it install all cells without errors (I did remove the zip file though).

Recommend updating the Readme.md on the repo to indicate what all the folders are for: dev vs. S-164 (recommend remove the old S-164 folder) cells / docs / exports exports 1.0 vs 2.0 (recommend remove the old folder)

plebihan29n commented 1 year ago

I have tried to organize the new version S-164_1.1/V2/ to clearly separate TestDataSets and other files (logs and dumps)

DavidGrant-NIWC commented 1 year ago

I have tried to organize the new version S-164_1.1/V2/ to clearly separate TestDataSets and other files (logs and dumps)

That is helpful. Some additional recommendations:

plebihan29n commented 10 months ago

These recommendations have been followed the delivered version 3 of S-164 . Next step is to deliver a S-158 package

DavidGrant-NIWC commented 10 months ago

The S-164_1.1 folder should be removed. The rest looks good.

TomRichardson6 commented 9 months ago

I think this has now been addressed following various changes. Please raise new issues if anything further is identified.