prio-data / views_pipeline

VIEWS forecasting pipeline for monthly prediction runs. Includes MLops and QA for all models/ensembles.
Other
3 stars 3 forks source link

Generate `get_data.py` Scripts for CM-Level Models #124

Closed Polichinel closed 2 weeks ago

Polichinel commented 2 weeks ago

Issue: Generate get_data.py Scripts for CM-Level Models

THIS CAN NOT BE DONE BEFORE:

Description
Leverage the scaffolding function to create get_data.py scripts for each CM-level model. Use the model catalog to verify that each script includes the correct querysets, and adjust scripts as needed to meet model-specific requirements. Ensure validation includes alignment with querysets specified in the model catalog to maintain consistency and data integrity.

Objectives and Requirements

  1. Generate Data Loading Scripts:

    • Utilize the scaffolding function from to create get_data.py scripts for all CM-level models, placing them in each model’s src/dataloaders/ directory.
    • Ensure each script loads data using the correct querysets based on the model catalog.
  2. Customization and Validation:

    • Adjust each get_data.py script as needed for specific CM model configurations.
    • Validate each script to ensure it meets model-specific data loading requirements and confirm data integrity.

Tasks

Next Steps
Once all get_data.py scripts are complete, we move on to implementing the actual models

Labels
CM-level, data loading, querysets, validation