UAL-RE / LD-Cool-P

Python tool to enable data curation
MIT License
4 stars 5 forks source link

[WIP] Feature: Python script and module to check dataset readiness for data preservation #236

Closed astrochun closed 1 year ago

astrochun commented 3 years ago

Description

This is WIP. Do not merge! This is a new feature to support preparation for preservation checks.

Closes #235

ToDo List

@astrochun: As of 07/26/2021, here is the current status of this PR: While most methods have been tested at a minimum level for the preservation check workflow, it is still not fully tested/explored. The methods of Preserve that have been tested include: get_metadata, save_metadata, check_files, make_symbolic_links, delete_old_readme_files, delete_hidden_files, delete_files (used by delete_hidden_files).

I have not tested:

In addition, we should have the script run as a "dry run" by default and to have a --update option. This will ensure that files are not replaced and to not download metadata. This would be a keyword input option that would need to be available to all method, probably through a (self) instance variable in Preserve.

Test plan

Update Changelog

Resources

Screenshots or additional context

zoidy commented 1 year ago

Preservation work is now in the ReBACH repo. Closing this PR