salesforce / TransmogrifAI

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
https://transmogrif.ai
BSD 3-Clause "New" or "Revised" License
2.24k stars 392 forks source link

Lift upperBound on # of hash features #477

Closed TuanNguyen27 closed 4 years ago

TuanNguyen27 commented 4 years ago

Related issues Some use cases require a much larger hash space when hashing text features. We are relaxing this limit in HashingVectorizerParams and TextMapHashingVectorizer

codecov[bot] commented 4 years ago

Codecov Report

Merging #477 into master will decrease coverage by 0.00%. The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #477      +/-   ##
==========================================
- Coverage   87.01%   87.01%   -0.01%     
==========================================
  Files         345      345              
  Lines       11671    11667       -4     
  Branches      379      384       +5     
==========================================
- Hits        10156    10152       -4     
  Misses       1515     1515              
Impacted Files Coverage Δ
...s/impl/feature/OPCollectionHashingVectorizer.scala 96.50% <ø> (-0.05%) :arrow_down:
...force/op/stages/impl/feature/OPMapVectorizer.scala 97.77% <ø> (-0.04%) :arrow_down:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update d222438...f8d7ae8. Read the comment docs.