synthesized-io / insight

🧿 Metrics & Monitoring of Datasets
BSD 3-Clause "New" or "Revised" License
12 stars 0 forks source link

Speed up KendallTauCorrelation metric calculation #149

Closed tomcarter23 closed 9 months ago

tomcarter23 commented 9 months ago

KendallTauCorrelation was becoming a bottleneck when calculating the metrics for a dataset. This PR switches out the scipy kendall tau function in favour of the Pandas version which seems to be much faster. Tested on the CREDIT dataset.

sonarcloud[bot] commented 9 months ago

Kudos, SonarCloud Quality Gate passed!    Quality Gate passed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 0 Code Smells

100.0% 100.0% Coverage
0.0% 0.0% Duplication