salesforce / TransmogrifAI

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
https://transmogrif.ai
BSD 3-Clause "New" or "Revised" License
2.24k stars 393 forks source link

Remove cardinality computation #438

Closed TuanNguyen27 closed 5 years ago

TuanNguyen27 commented 5 years ago

This will be replaced by a more accurate (albeit approximate) frequency count, such as CountMinSketch. cc @Jauntbox

codecov[bot] commented 5 years ago

Codecov Report

Merging #438 into master will decrease coverage by 0.01%. The diff coverage is 100%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #438      +/-   ##
==========================================
- Coverage   86.93%   86.92%   -0.02%     
==========================================
  Files         337      337              
  Lines       11100    11096       -4     
  Branches      366      362       -4     
==========================================
- Hits         9650     9645       -5     
- Misses       1450     1451       +1
Impacted Files Coverage Δ
...om/salesforce/op/filters/FeatureDistribution.scala 98.55% <100%> (-0.08%) :arrow_down:
.../op/features/types/FeatureTypeSparkConverter.scala 98.23% <0%> (-0.89%) :arrow_down:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update ccc1501...1479746. Read the comment docs.