Closed mnabian closed 3 weeks ago
@mnabian So looking through the task list, we are rearchitecting the code to make it cleaner. From a user perspective, what will they be able to do more easily in terms of say what types of data formats can they ingest, does our training code provide configs that make it easy to train with the supported data formats?
Scope of the enhancements:
Re-architect Training and Generation: Significantly refactor training and generation processes to hide unnecessary details and configurations, improving modularity, and removing non-intuitive abstractions for a more user-friendly experience.
Enhance Configuration Management: Improve the handling of configurations for more efficient, generalizable, and user-friendly setup. Also hide unnecessary configurations from the users to improve usability.
Simplify Metrics Logging: Streamline the logging process for training and validation metrics to provide clear and accessible performance tracking.
Develop CorrDiff-Lite: Create a simplified, educational version of CorrDiff training and generation, demonstrated on the HRRR dataset, to facilitate learning and accessibility.
Refactor and Generalize Dataloader: Enhance and generalize the dataloader to be served as a template for the users to implement their own customized dataloaders.
Implement Regression Tests: Introduce regression tests for CorrDiff training and generation to ensure accuracy and reliability in future code updates.
Develop Comprehensive Documentation: Create thorough and user-friendly documentation to support all aspects of CorrDiff, ensuring users have clear guidance and resources. This includes both README documentation as well as API documentation.