e-mission / e-mission-docs

Repository for docs and issues. If you need help, please file an issue here. Public conversations are better for open source projects than private email.
https://e-mission.readthedocs.io/en/latest
BSD 3-Clause "New" or "Revised" License
15 stars 34 forks source link

Analysis of Emission Pipeline Top 20% and Bottom 80% #1098

Open TeachMeTW opened 6 hours ago

TeachMeTW commented 6 hours ago

Introduces two distinct performance analysis methods to evaluate function-level metrics within our emission dataset. The objective is to identify which functions significantly impact performance and which do not, enabling targeted optimizations and improvements.

Analysis Types

1. Individual Entry Categorization

Features


2. Aggregated Entry Categorization

Features


bottom80_function_level_individual_sorted.csv bottom80_function_level_mean_sorted.csv bottom80_function_level_sum_sorted.csv top20_function_level_individual_sorted.csv top20_function_level_mean_sorted.csv top20_function_level_sum_sorted.csv

TeachMeTW commented 6 hours ago

From both mean and sum, the same functions show up in the top 20% and bottom 80% albeit a different order. I will go ahead and prune them from the pipeline.