stefan-grafberger / mlwhatif

Data-Centric What-If Analysis for Native Machine Learning Pipelines
Apache License 2.0
14 stars 4 forks source link

Add a trial run/execute/estimate option to the fluent API of mlwhatif #25

Closed stefan-grafberger closed 1 year ago

stefan-grafberger commented 1 year ago

Description

Next to the execute method that triggers the analysis execution, there should be an estimate function that only computes the estimated analysis costs.

Example:

data_cleaning = DataCleaning({'education': ErrorType.CAT_MISSING_VALUES, 
                              'age': ErrorType.NUM_MISSING_VALUES,
                              'hours-per-week': ErrorType.OUTLIERS,
                              None: ErrorType.MISLABEL})

estimation_result = PipelineAnalyzer \
    .on_previously_extracted_pipeline(analysis_result.dag_extraction_info) \
    .add_what_if_analysis(data_cleaning) \
    .estimate()
stefan-grafberger commented 1 year ago

Implemented with PR #28